speed_set_vocabulary: A function the reorgaizes vocabulary to speed up document...

Description Usage Arguments Value

View source: R/speed_set_vocabulary.R

Description

A function the reorgaizes vocabulary to speed up document term matrix formation using a string stem dictionary.

Usage

1
speed_set_vocabulary(vocab, term_frequency_threshold = 0, cores = 1)

Arguments

vocab

A vocabulary list object returned by the count_words() function.

term_frequency_threshold

A threshold below which all words appearing fewer than that many times in the corpus will be removed. Defaults to 0 in which case no words will be removed.

cores

The number of cores we wish to use for parallelization to speed up computation. Defaults to 1.

Value

A vocabulary list object.


matthewjdenny/SpeedReader documentation built on March 25, 2020, 5:32 p.m.