Description Format Source Examples
This data set was a case study in the COMURE project ("corpus-based, multivariate research of register variation in translated and non-translated Belgian Dutch") which was conducted at the Department of Translation, Interpreting and Communication of Ghent University between 2010 and 2014.
A data frame with 3762 rows and 5 variables.
Variant
The linguistic variant used in a set of alternatives (27 levels).
Variable
The linguistic variable specifying a set of alternatives (13 levels).
Variety
The dichotomization of Variant
into standard and non-standard.
Register
The register or "Text type" of the data (6 levels).
Language
The language (and source language) of the data (3 levels).
Delaere, I., G. De Sutter and K. Plevoets (2012) Is translated language more standardized than non-translated language? Target 24 (2), 203–224.
1 2 3 4 5 6 7 8 9 10 | data(COMURE)
# The execution of corregp may be slow, due to bootstrapping:
comure.crg <- corregp(Variant ~ Register * Language, data = COMURE, part = "Variable", b = 3000)
comure.crg
summary(comure.crg, parm = "b", add_ci = TRUE)
screeplot(comure.crg, add_ci = TRUE)
anova(comure.crg, nf = 2)
comure.col <- ifelse( xtabs(~ Variant + Variety, data = COMURE)[, "Standard"] > 0, "blue", "red")
plot(comure.crg, x_ell = TRUE, xsub = c("Register", "Language"), col_btm = comure.col,
col_top = "black")
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.