list_supported_pos: Listing of parts of speech to include in stopword lists you...

View source: R/list_supported_pos.R

list_supported_posR Documentation

Listing of parts of speech to include in stopword lists you generate by generate_stoplist().

Description

The function gives you a character vector of supported parts of speech (e.g. prepositions). They are represented by abbreviations.

Usage

list_supported_pos()

Details

The stopwoRds package relies on multilingual_stoplist, a large multilingual table with individual word forms as rows, derived from the Universal Dependencies treebanks. Each word form comes along with its lemma and part of speech, as well as with the language name and its ISO-639-nnnn code. This function gives you unique values from the POS column of multilingual_stoplist. The parts of speech (POS) are common for all supported languages ("Universal Part-of-Speech tags").

Value

A character vector.

Author(s)

Silvie Cinkova, Maciej Eder

References

http://universaldependencies.org

Nivre, Joakim; Agić, Željko; Ahrenberg, Lars; et al., 2017, Universal Dependencies 2.1, LINDAT/CLARIN digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University, http://hdl.handle.net/11234/1-2515.

See Also

list_supported_language_names, list_supported_language_ids, generate_stoplist, multilingual_stoplist


computationalstylistics/stopwoRds documentation built on April 8, 2024, 11:04 p.m.