Description Usage Arguments Value Examples
Similar to get_sparse but looks at word frequency not doc count. If X is whole number, returns the X least frequent terms. If X is decimal returns the X
1 | get_bottom_terms(wf, nterms, count)
|
wf |
A data table containing the word and document frequencies accross the corpus. |
nterms |
A number specifying the total number of unique words in the corpus. |
count |
A number either decimal or whole; interpreted as percent, whole as count. |
words A character vector of the least frequent terms
1 2 3 4 5 | ## Not run:
infreq = get_bottom_terms(wf, 100000, 5000) #returns 5000 least common terms
infreq = get_bottom_terms(wf, 100000, .05) #returns the bottom 5% of terms
## End(Not run)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.