Description Usage Arguments Value
View source: R/speed_set_vocabulary.R
A function the reorgaizes vocabulary to speed up document term matrix formation using a string stem dictionary.
1 | speed_set_vocabulary(vocab, term_frequency_threshold = 0, cores = 1)
|
vocab |
A vocabulary list object returned by the count_words() function. |
term_frequency_threshold |
A threshold below which all words appearing fewer than that many times in the corpus will be removed. Defaults to 0 in which case no words will be removed. |
cores |
The number of cores we wish to use for parallelization to speed up computation. Defaults to 1. |
A vocabulary list object.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.