Frequency spectra included as examples in Baayen (2001).
A list of 23 frequency spectra, i.e. objects of class
List elements are named according to the original files, but without the extension
See Baayen (2001, pp. 249-277) for details.
In particular, the following spectra are included:
Lewis Carroll, Alice's Adventures in Wonderland
Lewis Carroll, Through the Looking-Glass and What Alice Found There
H. G. Wells, War of the Worlds
Arthur Conan-Doyle, Hound of the Baskervilles
E. Douwes Dekker, Max Havelaar
An archeology text (Turkish)
A. H. Tammsaare, Truth and Justice (Estonian)
The context-governed subcorpus of the British National Corpus (BNC)
Sample of 1 million tokens from The Independent
Sample of 8 million tokens from The Independent
Nouns in -heid in the CELEX database (Dutch)
Nouns in -iteit in the CELEX database (Dutch)
Nouns in -ster in the CELEX database (Dutch)
Nouns in -in in the CELEX database (Dutch)
Simplex nouns in the CELEX database (Dutch)
Singular nouns in M. Innes, The Bloody Wood
Plural nouns in M. Innes, The Bloody Wood
Nouns in -ness in the written subcorpus of the BNC
Nouns in -ness in the context-governed subcorpus of the BNC
Nouns in -ness in the demographic subcorpus of the BNC
Counts of filarial worms in mites on rats
Context-vowel patterns in the TIMIT speech database
Word pairs in E. Douwes Dekker, Max Havelaar
Baayen, R. Harald (2001). Word Frequency Distributions. Kluwer, Dordrecht.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.