Trim a Word Frequency Data

Share:

Description

Ejects low frequency observations and subsamples

Usage

1
  trim(wfm, min.count=5, min.doc=5, sample=NULL)

Arguments

wfm

an object of class wfm, or a data matrix

min.count

the smallest permissible word count

min.doc

the fewest permissible documents a word can appear in

sample

how many words to randomly retain

Value

If sample is a number then this many words will be retained after min.doc and min.doc filters have been applied.

Author(s)

Will Lowe

See Also

wfm

Want to suggest features or report bugs for rdrr.io? Use the GitHub issue tracker.