Phoneme: Phonetic Time Series.
In classiFunc: Classification of Functional Data

Description Usage Format Details Format A data frame with 100 rows (=observations) and 65 variables Source References

A data set containing the audio files of English words.

Phoneme

An object of class data.frame with 100 rows and 65 columns.

This data set is a subsample of the data used in Hamooni and Mueen (2014). Each series is extracted from the segmented audio collected from Google Translate, oxforddictionaries.com and the Merrriam-Webster online dictionary. Each of these sources have different features. Audio files collected from Google translate, Oxford, and Merrriam-Webster dictionaries are recorded at 22050, 44100 and 11025 samples per second respectively. All of them have male and female speakers in different ratios. The Oxford dictionary includes British and American accent pronunciation for each word. After data collection, they segment waveforms of the words to generate phonemes using the Forced Aligner tool from the Penn Phonetics Laboratory.

col 1:64: one functional observation.
target: encoding the word of the functional observation

http://timeseriesclassification.com/description.php?Dataset=Phoneme

Hamooni, Hossein, and Mueen, Abdullah. "Dual-domain hierarchical classification of phonetic time series." Data Mining (ICDM), 2014 IEEE International Conference on. IEEE, 2014.

classiFunc documentation built on May 2, 2019, 2:04 a.m.