Description Usage Format Details License Source References
A lemmatization table generated from filtered OpenCorpora morphological dictionary. This table can be useful for simple and fast
"word form-to-lemma" replacement, for example using lemmatize_strings
from textstem package.
Table contains \Sexpr{nrow(rulexicon::hash_lemmas_opencorpora)} words forms of
\Sexpr{length(unique(rulexicon::hash_lemmas_opencorpora$lemma))} lemmas.
1 |
A data table with \Sexpr{nrow(rulexicon::hash_lemmas_opencorpora)} rows and \Sexpr{ncol(rulexicon::hash_lemmas_opencorpora)} variables:
a textual token (word) inflected by affixes
a base form or lemma
The lemmatization table was generated from original OpenCorpora morphological dictionary after filtering unique pairs of token-lemma and unique tokens (first occurrence for both).
The base morphological dictionary of OpenCorpora is published under Creative Commons "Attribution-ShareAlike" 3.0 Unported License (CC BY-SA 3.0).
http://opencorpora.org/files/export/dict/dict.opcorpora.txt.zip
OpenCorpora project web-page: http://opencorpora.org
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.