transform_filter_commons: Remove terms from a document-term matrix

Description Usage Arguments See Also

View source: R/transformers.R

Description

This function removes very common and very uncommon words from a document-term matrix.

Usage

1
transform_filter_commons(dtm, term_freq = c(uncommon = 0.001, common = 0.975))

Arguments

dtm

a document-term matrix of class dgCMatrix or dgTMatrix.

term_freq

numeric vector of 2 values in between 0 and 1. The first element corresponds to frequency of uncommon words; the second element corresponds to the frequency of common words. Terms which are observed less than first value or frequency or more than second will be filtered out.

See Also

prune_vocabulary, transform_tf, transform_tfidf, transform_binary


text2vec documentation built on May 29, 2017, 9:09 a.m.

Search within the text2vec package
Search all R packages, documentation and source code