lang.support.en: Language support for English

Description Usage Arguments Details References Examples

View source: R/lang.support-en.R

Description

This function adds support for English to the koRpus package. You should not need to call it manually, as that is done automatically when this package is being loaded.

Usage

1

Arguments

...

Optional arguments for set.lang.support.

Details

The POS tags cover tag definitions from multiple sources. Please note that there is one tag, "PRP", that is defined in both PENN[3] and BNC[4] tagsets, but with different meanings: The PENN tag marks personal pronouns, whereas the BNC tag marks prepositions (except "of"). Since the conflicting tag is not being used by TreeTagger's PENN parameter set, but in its BNC set, koRpus also uses the BNC definition. Keep this in mind if you use this language support package with alternative taggers.

In particular, this function adds the following:

Hyphenation patterns are provided by means of the sylly.en package.

References

[1] http://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/

[2] http://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/data/Penn-Treebank-Tagset.pdf

[3] https://www.ling.upenn.edu/courses/Fall_2003/ling001/penn_treebank_pos.html

[4] http://www.natcorp.ox.ac.uk/docs/c5spec.html

Examples

1

koRpus.lang.en documentation built on Oct. 24, 2020, 5:07 p.m.