sw_nltk_ru: Stopwords List of Python's NLTK Library (augmented)

Description Usage Format License Source References

Description

A dataset containing a character vector of stopwords based on Python's NLTK library augmented by Michail Kalinskiy using filtered stopwords-ru list.

Usage

1

Format

A character vector with \Sexpr{length(rulexicon::sw_nltk_ru)} elements.

License

The Python's NLTK library and JavaScript's stopwords-iso NPM package stopwords lists are published under MIT License.

Source

Complex source, see References

References

Python's NLTK library stopwords list: https://github.com/mitmedialab/DataBasic/blob/master/nltk_data/corpora/stopwords/russian

JavaScript's stopwords-iso NPM package stopwords list: https://github.com/stopwords-iso/stopwords-ru

Filtered by Michail Kalinskiy version of JavaScript's stopwords-iso NPM package stopwords list: https://dev.kmint21.info/posts/python-summa


dmafanasyev/rulexicon documentation built on Jan. 25, 2022, 4:18 p.m.