Description Usage Arguments Value Examples
A function to analyze the output of the summary_corpus. Returns words that appeared in less than or equal to X percent of documents, if you pass X as a decimal. Otherwise, if X is a whole number returns the words that appeared in X or less documents.
1 | get_sparse(wf, ndocs, sparsity)
|
wf |
A data table containing the word and document frequencies accross the corpus. |
ndocs |
A number specifying the total number of unique documents in the corpus. |
sparsity |
A number either decimal or whole; interpreted as percent, whole as count. |
words A character vector of all the sparse terms.
1 2 3 4 5 | ## Not run:
sparse = get_sparse(wf, 100, .03)
sparse = get_sparse(wf, 100, 3)
## End(Not run)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.