get_abundant: Get List of Abundant Terms
In avkoehl/textprocessingDSI: Clean an arbitrarily large corpus for topic modelling over many cores

Description Usage Arguments Value Examples

A function to analyze the output of the summary_corpus similar to get_spare. Returns words that appeared in more than or equal to X percent of documents, if you pass X as a decimal. Otherwise, if X is a whole number returns the words that appeared in X or more documents.

1	get_abundant(wf, ndocs, abundance)

`wf`	A data table containing the word and document frequencies accross the corpus.
`ndocs`	A number specifying the total number of unique documents in the corpus.
`abundance`	A number either decimal or whole; interpreted as percent, whole as count.

words A character vector of all the abundant terms.

## Not run: 
sparse = get_abundant(wf, 100, .95)
sparse = get_abundant(wf, 100, 95) 

## End(Not run)

avkoehl/textprocessingDSI documentation built on June 5, 2019, 7:41 p.m.

avkoehl/textprocessingDSI index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

avkoehl/textprocessingDSI
Clean an arbitrarily large corpus for topic modelling over many cores

get_abundant: Get List of Abundant Terms
In avkoehl/textprocessingDSI: Clean an arbitrarily large corpus for topic modelling over many cores

Description

Usage

Arguments

Value

Examples

Related to get_abundant in avkoehl/textprocessingDSI...

R Package Documentation

Browse R Packages

We want your feedback!

avkoehl/textprocessingDSI Clean an arbitrarily large corpus for topic modelling over many cores

get_abundant: Get List of Abundant Terms In avkoehl/textprocessingDSI: Clean an arbitrarily large corpus for topic modelling over many cores

Description

Usage

Arguments

Value

Examples

Related to get_abundant in avkoehl/textprocessingDSI...

R Package Documentation

Browse R Packages

We want your feedback!

avkoehl/textprocessingDSI
Clean an arbitrarily large corpus for topic modelling over many cores

get_abundant: Get List of Abundant Terms
In avkoehl/textprocessingDSI: Clean an arbitrarily large corpus for topic modelling over many cores