Description Usage Format Details Value
A dataset with information on the corpus on a year-by-year basis is included in the package to be included in the data report of the package vignette.
1 |
A data.frame with 22 rows and 6 variables with summary
statistics on the GermaParl corpus on a year-by-year basis.
year reported on in the row (integer value)
total number of protocols included in the corpus for the
respective year (integer value)
number of protocols prepared based on plain text versions of the
protocols (integer value)
number of protocols prepared based on pdf versions of the
protocols (integer value)
number of tokens in subcorpus for the respective year
(integer value)
share of words that cannot be lemmatized, resulting in
#unknown# tag (numeric value)
The table is based on v1.0.6 of the corpus. The prepare the table, the script available at data-raw/stats_for_vignette.R has been used.
A data.frame.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.