| writtenVariationLijk | R Documentation |
This dataset documents variation in the use of the 80 most frequent words ending in the suffix -lijk in written Dutch.
data(writtenVariationLijk)
A data frame with 560 observations on the following 5 variables.
Corpusa factor with as levels the sampled newspapers:
belang (Het Belang van Limburg),
gazet (De Gazet van Antwerpen),
laatnieu (Het Laatste Nieuws),
limburg (De Limburger),
nrc (NRC Handelsblad),
stand (De Standaard), and
tele (De Telegraaf).
Worda factor with the 80 most frequent words ending in -lijk.
Counta numeric vector with token counts in the CONDIV corpus.
Countrya factor with levels Flanders and
Netherlands.
Registera factor with levels National,
Quality and Regional coding the type of newspaper.
Keune, K., Ernestus, M., Van Hout, R. and Baayen, R.H. (2005) Social, geographical, and register variation in Dutch: From written 'mogelijk' to spoken 'mok', Corpus Linguistics and Linguistic Theory, 1, 183-223.
## Not run:
data(writtenVariationLijk)
require(lme4)
require(lmerTest)
require(lme4)
writtenVariationLijk.lmer = glmer(Count ~ Country * Register + (1|Word),
control=glmerControl(optimizer="optimx",optCtrl=list(method="nlminb")),
data = writtenVariationLijk, family = "poisson")
writtenVariationLijk.lmerA = glmer(Count ~ Country * Register + (Country|Word),
control=glmerControl(optimizer="optimx",optCtrl=list(method="nlminb")),
data = writtenVariationLijk, family = "poisson")
anova(writtenVariationLijk.lmer, writtenVariationLijk.lmerA)
summary(writtenVariationLijk.lmerA)
## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.