This data set provides some basic quantiative measures for all texts in the LOB corpus of written British English (Johansson et al. 1978).
A data frame with 500 rows and the following columns:
number of distinct types
number of tokens (including punctuation)
number of sentences
mean word length in characters, averaged over tokens
mean word length in characters, averaged over types
Marco Baroni <firstname.lastname@example.org>
Johansson, Stig; Leech, Geoffrey; Goodluck, Helen (1978). Manual of information to accompany the Lancaster-Oslo/Bergen corpus of British English, for use with digital computers. Technical report, Department of English, University of Oslo, Oslo.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.