Description Usage Arguments Value Examples
A function to analyze the output of the summary_corpus similar to get_spare. Returns words that appeared in more than or equal to X percent of documents, if you pass X as a decimal. Otherwise, if X is a whole number returns the words that appeared in X or more documents.
1 | get_abundant(wf, ndocs, abundance)
|
wf |
A data table containing the word and document frequencies accross the corpus. |
ndocs |
A number specifying the total number of unique documents in the corpus. |
abundance |
A number either decimal or whole; interpreted as percent, whole as count. |
words A character vector of all the abundant terms.
1 2 3 4 5 | ## Not run:
sparse = get_abundant(wf, 100, .95)
sparse = get_abundant(wf, 100, 95)
## End(Not run)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.