eGeMAPS: Compute the eGeMAPS openSMILE feature set
In humlab-speech/superassp: A speech signal processing using various framworks using a wrassp-like interface

eGeMAPS

R Documentation

Compute the eGeMAPS openSMILE feature set

Description

This function applies the extended version of the "Geneva Minimalistic Acoustic Parameter Set (eGeMAPS) for Voice Research and Affective Computing" (the Extended Geneva Minimalistic Standard Parameter Set, eGeMAPS v02) \insertCiteEyben.2015.10.1109/taffc.2015.2457417superassp to a portion of a recording.

Usage

eGeMAPS(listOfFiles, beginTime = 0, endTime = 0, explicitExt = "ocp")

Arguments

`listOfFiles`	The full path to the sound file.
`beginTime`	The starting time of the section of the sound files that should be analysed.
`endTime`	The end time of the section of the sound files that should be analysed.
`explicitExt`	The file extension of the slice file where the results should be stored.

Details

The GeMAPS feature set consists of of 88 static acoustic features resulting from the computation of various functionals over low-level descriptor features, and is applied by this function using the openSMILE. \insertCiteEyben:2010fq,Jaimes.2013.10.1145/2502081.2502224superassp acoustic feature extraction library.

Value

A list of 88 acoustic values, with the names as reported by openSMILE. The extendedacoustic parameter set contains the following compact set of 18 low-level descriptors (LLD), sorted by parameter groups:

Frequency related parameters:

Pitch, logarithmic f0 on a semitone frequency scale,starting at 27.5 Hz (semitone 0).
Jitter, deviations in individual consecutive f0 period lengths.
Formant 1, 2, and 3 frequency, centre frequency of first, second, and third formant
Formant 1, bandwidth of first formant.Energy/Amplitude related parameters:
Shimmer, difference of the peak amplitudes of consecutive f0 periods.
Loudness, estimate of perceived signal intensity from an auditory spectrum.
Harmonics-to-noise ratio (HNR), relation of energy in harmonic components to energy in noise-like components.
Formant 2-3 bandwidth

Spectral (balance) parameters:

Alpha Ratio, ratio of the summed energy from50-1000 Hz and 1-5 kHz
Hammarberg Index, ratio of the strongest energy peak in the 0-2 kHz region to the strongest peak in the 2–5 kHz region.
Spectral Slope 0-500 Hz and 500-1500 Hz, linear regression slope of the logarithmic power spectrum within the two given bands.
Formant 1, 2, and 3 relative energy, as well as the ratio of the energy of the spectral harmonic peak at the first, second, third formant’s centre frequency to the energy of the spectral peak atF0.
Harmonic difference H1-H2, ratio of energy of the first f0 harmonic (H1) to the energy of the second f0 harmonic (H2).
Harmonic difference H1-A3, ratio of energy of the first f0harmonic (H1) to the energy of the highest harmonic in the third formant range (A3).
MFCC 1-4 Mel-Frequency Cepstral Coefficients 1-4.
Spectral flux difference of the spectra of two consecutive frames.

which are analysed in terms of mean and coefficient of variation, as well as 20th, median (50th), and 80th percentile (pitch and loudness), the arithmetic mean of the Alpha Ratio, the Hammarberg Index, and the spectral slopes from 0-500 Hz and 500-1500 Hz over all unvoiced segments, and the equivalent sound level.

Temporal features:

the rate of loudness peaks, i.e., the number of loudness peaks per second,
the mean length and the standard deviation of continuously voiced regions(f0>0),
the mean length and the standard deviation of unvoiced regions (f0 == 0; approximating pauses),
the number of continuous voiced regions per second(pseudo syllable rate).

Please consult the \insertCiteEyben.2015.10.1109/taffc.2015.2457417superassp for a description of the features.

References

\insertAllCited

humlab-speech/superassp documentation built on June 10, 2025, 3:02 p.m.

humlab-speech/superassp index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

humlab-speech/superassp
A speech signal processing using various framworks using a wrassp-like interface

eGeMAPS: Compute the eGeMAPS openSMILE feature set
In humlab-speech/superassp: A speech signal processing using various framworks using a wrassp-like interface

Compute the eGeMAPS openSMILE feature set

Description

Usage

Arguments

Details

Value

References

Related to eGeMAPS in humlab-speech/superassp...

R Package Documentation

Browse R Packages

We want your feedback!

humlab-speech/superassp A speech signal processing using various framworks using a wrassp-like interface

eGeMAPS: Compute the eGeMAPS openSMILE feature set In humlab-speech/superassp: A speech signal processing using various framworks using a wrassp-like interface

Compute the eGeMAPS openSMILE feature set

Description

Usage

Arguments

Details

Value

References

Related to eGeMAPS in humlab-speech/superassp...

R Package Documentation

Browse R Packages

We want your feedback!

humlab-speech/superassp
A speech signal processing using various framworks using a wrassp-like interface

eGeMAPS: Compute the eGeMAPS openSMILE feature set
In humlab-speech/superassp: A speech signal processing using various framworks using a wrassp-like interface