prep | R Documentation |
Prep a dfm doing or don't doing certain preprocessing steps
prep( x, remove_punct, remove_num, lowercase, stem, remove_stop, infrequent_terms, tfidf, use_ngrams, stopwords = stopwords::stopwords(language = "en"), pb = NULL )
x |
Preferably a corpus object but can contain everything accepted by quanteda::tokens. |
remove_punct, remove_num, lowercase, stem, remove_stop, infrequent_terms, tfidf, use_ngrams |
Logical. Should a preprocessing step be included or not. |
stopwords |
A character vector of stopwords. |
pb |
A progress_bar environment from the progress package. |
a dfm.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.