Description Usage Format References Examples
Corpus data for measuring the productivity of German word formation affixes -bar, -lich, -sam, -ös, -tum, Klein-, -chen and -lein (Evert & Lüdeling 2001). Data were extracted from two volumes of the German daily newspaper Stuttgarter Zeitung, then manually cleaned and normalized.
1 |
A list of 8 character vectors for the different affixes, with names
klein
(Klein-), bar
(-bar),
chen
(-chen), lein
(-lein),
lich
(-lich), oes
(-ös),
sam
(-sam), tum
(-tum).
Each vector contains all relevant tokens from the corpus in their original (chronological) ordering, so vocabulary growth curves can be determined from the vectors in addition to type frequency lists and frequency spectra.
Evert, Stefan and Lüdeling, Anke (2001). Measuring morphological productivity: Is automatic preprocessing sufficient? In Proceedings of the Corpus Linguistics 2001 Conference, pages 167–175, Lancaster, UK.
1 2 3 4 5 6 7 | str(EvertLuedeling2001)
# tokens and type counts for the different affixes
sapply(EvertLuedeling2001, function (x) {
y <- vec2tfl(x)
c(N=N(y), V=V(y))
})
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.