List terms with the highest number of occurrences in the document-term matrix of a corpus, possibly grouped by the levels of a variable.
An optional vector of values giving the groups for which most frequent terms should be reported.
The maximal number of terms to report (for each group, if applicable).
A list of matrices, one for each level of the variable, with columns:
"\ (rather than in other levels).
"Level": the number of occurrences of the term in the level ("internal").
"Global": the number of occurrences of the term in the corpus.
"t value": the quantile of a normal distribution corresponding the probability "Prob.".
"Prob.": the probability of observing such an extreme (high or low) number of occurrences of the term in the level, under an hypergeometric distribution.
1 2 3 4 5
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.