germaparl_by_lp: Table with information on GermaParl by legislative period

Description Usage Format Value

Description

A dataset with information on the corpus by legislative period is included in the package to be included in the data report of the package vignette.

Usage

1

Format

A data.frame with 5 rows and 6 variables with summary statistics on the GermaParl corpus on a year-by-year basis.

lp

legislative period (integer value)

protocols

total number of protocols included in the corpus for the respective legislative period (integer value)

first

date of the first plenary protocol in the legislative period (Date class)

last

date of the last plenary protocol in the legislative period (Date class)

size

number of tokens in subcorpus for the respective legislative period (integer value)

unknown_total

total number of words that cannot be lemmatized, resulting in #unknown# tag (numeric value)

unknown_share

share of words that cannot be lemmatized, resulting in #unknown# tag (numeric value)

The table is based on v1.0.6 of the corpus. To prepare the table, the script available at data-raw/stats_for_vignette.R has been used.

Value

A data.frame.


GermaParl documentation built on Oct. 23, 2020, 8:27 p.m.