trim: Trim a Word Frequency Data

Description

Ejects low frequency observations and subsamples

Usage

1
  trim(wfm, min.count=5, min.doc=5, sample=NULL)

Arguments

wfm

an object of class wfm, or a data matrix

min.count

the smallest permissible word count

min.doc

the fewest permissible documents a word can appear in

sample

how many words to randomly retain

Value

If sample is a number then this many words will be retained after min.doc and min.doc filters have been applied.

Author(s)

Will Lowe

See Also

wfm


Questions? Problems? Suggestions? or email at ian@mutexlabs.com.

All documentation is copyright its authors; we didn't write any of that.