| english | R Documentation |
This data set gives mean visual lexical decision latencies and word naming latencies to 2284 monomorphemic English nouns and verbs, averaged for old and young subjects, with various predictor variables.
data(english)
A data frame with 4568 observations on the following variables.
RTlexdecnumeric vector of log RT in visual lexical decision.
RTnamingnumeric vector of log RT in word naming.
Familiaritynumeric vector of subjective familiarity ratings.
Worda factor with 2284 words.
AgeSubjecta factor with as levels the age group of the subject:
young versus old.
WordCategorya factor with as levels the word categories
N (noun) and V (verb).
WrittenFrequencynumeric vector with log frequency in the CELEX lexical database.
WrittenSpokenFrequencyRationumeric vector with the logged ratio of written frequency (CELEX) to spoken frequency (British National Corpus).
FamilySizenumeric vector with log morphological family size.
DerivationalEntropynumeric vector with derivational entropy.
InflectionalEntropynumeric vector with inflectional entropy.
NumberSimplexSynsetsnumeric vector with the log-transformed count of synonym sets in WordNet in which the word is listed.
NumberComplexSynsetsnumeric vector with the log-transformed count of synonym sets in WordNet in which the word is listed as part of a compound.
LengthInLettersnumeric vector with length of the word in letters.
Ncountnumeric vector with orthographic neighborhood density, defined as the number of lemmas in CELEX with the same length (in letters) at Hamming distance 1.
MeanBigramFrequencynumeric vector with mean log bigram frequency.
FrequencyInitialDiphonenumeric vector with log frequency of initial diphone.
ConspelVnumeric vector with type count of orthographic neighbors.
ConspelNnumeric vector with token count of orthographic neighbors.
ConphonVnumeric vector with type count of phonological neighbors.
ConphonNnumeric vector with token count of phonological neighbors.
ConfriendsVnumeric vector with type counts of consistent words.
ConfriendsNnumeric vector with token counts of consistent words.
ConffVnumeric vector with type count of forward inconsistent words
ConffNnumeric vector with token count of forward inconsistent words
ConfbVnumeric vector with type count of backward inconsistent words
ConfbNnumeric vector with token count of backward inconsistent words
NounFrequencynumeric vector with the frequency of the word used as noun.
VerbFrequencynumeric vector with the frequency of the word used as verb.
CVfactor specifying whether the initial phoneme of
the word is a consonant (C) or a vowel (V).
Obstruentfactor specifying whether the initial phoneme
of the word is a continuant (cont) or an obstruent (obst).
Fricationfactor specifying whether the initial phoneme
has a burst (burst) or frication (frication) for
consonant-initial words, and for vowel-initial words whether the vowel is
long or short.
Voicefactor indicating whether the initial phoneme is voiced
or voiceless.
FrequencyInitialDiphoneWordnumeric vector with the log-transformed frequency of the initial diphone given that it is syllable-initial.
FrequencyInitialDiphoneSyllablenumeric vector with the log-transformed frequency of the initial diphone given that it is word initial.
CorrectLexdecnumeric vector with the proportion of subjects that accepted the item as a word in lexical decision.
Balota, D.A., Cortese, M.J. and Pilotti, M. (1999) Visual lexical decision latencies for 2906 words,
and
Spieler, D. H. and Balota, D. A. (1998) Naming latencies for 2820 words.
Available at http://psychnet.wustl.edu/coglab/?page_id=208
Balota, D., Cortese, M., Sergent-Marshall, S., Spieler, D. and Yap, M. (2004) Visual word recognition for single-syllable words, Journal of Experimental Psychology:General, 133, 283-316.
Baayen, R.H., Feldman, L. and Schreuder, R. (2006) Morphological influences on the recognition of monosyllabic monomorphemic words, Journal of Memory and Language, 53, 496-512.
## Not run:
data(english)
# ---- orthogonalize orthographic consistency measures
items = english[english$AgeSubject == "young",]
items.pca = prcomp(items[ , c(18:27)], center = TRUE, scale = TRUE)
x = as.data.frame(items.pca$rotation[,1:4])
items$PC1 = items.pca$x[,1]
items$PC2 = items.pca$x[,2]
items$PC3 = items.pca$x[,3]
items$PC4 = items.pca$x[,4]
items2 = english[english$AgeSubject != "young", ]
items2$PC1 = items.pca$x[,1]
items2$PC2 = items.pca$x[,2]
items2$PC3 = items.pca$x[,3]
items2$PC4 = items.pca$x[,4]
english = rbind(items, items2)
# ---- add Noun-Verb frequency ratio
english$NVratio = log(english$NounFrequency+1)-log(english$VerbFrequency+1)
# ---- build model with ols() from rms
library(rms)
english.dd = datadist(english)
options(datadist = 'english.dd')
english.ols = ols(RTlexdec ~ Voice + PC1 + MeanBigramFrequency +
rcs(WrittenFrequency, 5) + rcs(WrittenSpokenFrequencyRatio, 3) +
NVratio + WordCategory + AgeSubject +
rcs(FamilySize, 3) + InflectionalEntropy +
NumberComplexSynsets + rcs(WrittenFrequency, 5) : AgeSubject,
data = english, x = TRUE, y = TRUE)
# ---- plot partial effects
plot(Predict(english.ols))
# ---- validate the model
validate(english.ols, bw = TRUE, B = 200)
## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.