durationsGe: Durational measurements on the Dutch prefix ge-

Description Usage Format References Examples

Description

Durational measurements on the Dutch prefix ge- in the Spoken Dutch Corpus.

Usage

1

Format

A data frame with 428 observations on the following 8 variables.

Word

a factor with the words as levels.

Frequency

a numeric vector with the word's absolute frequency in the Spoken Dutch Corpus.

Speaker

a factor with the speakers as levels.

Sex

a factor with levels female and male, this information is missing for one speaker.

YearOfBirth

a numeric vector with years of birth.

DurationOfPrefix

a numeric vector with the duration of the prefix -ont in seconds.

SpeechRate

a numeric vector coding speech rate in number of syllables per second.

NumberSegmentsOnset

a numeric vector for the number of segments in the onset of the stem.

References

Pluymaekers, M., Ernestus, M. and Baayen, R. H. (2005) Frequency and acoustic length: the case of derivational affixes in Dutch, Journal of the Acoustical Society of America, 118, 2561-2569.

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
	## Not run: 
    data(durationsGe)
    durationsGe$Frequency = log(durationsGe$Frequency + 1)
    durationsGe$YearOfBirth = durationsGe$YearOfBirth - 1900

    durationsGe.lm = lm(DurationOfPrefix ~ Frequency+SpeechRate, data = durationsGe)
    summary(durationsGe.lm)

    # ---- model criticism
    
    plot(durationsGe.lm)
    outliers = c(271, 392, 256, 413, 118, 256)
    durationsGe.lm = lm(DurationOfPrefix ~ Frequency + SpeechRate, 
      data = durationsGe[-outliers, ])
    summary(durationsGe.lm)
  
## End(Not run)

languageR documentation built on May 2, 2019, 10:02 a.m.