Description Usage Arguments Value
A text cleanup utility. takes in a dataset of documents, cleans it, removes stopwords and performs Porter stemming using the SnowballC package. It outputs a tidy corpus with stemmed words. Optionally is takes a custom stopwords list and optionally transforms the stemmed words to a readable form
1 | tm_make_corpus(documents, custom_sw = NULL, stem2readable = TRUE)
|
documents |
object. Must contain a 'text' column |
custom |
stopwords object. Must contain a 'word' column. Optional |
stem |
to readable flag. defaults to TRUE |
a corpus that includes the original word and the stemmed word in the 'stem' column
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.