prep: Prep a dfm doing or don't doing certain preprocessing steps

View source: R/prep.R

prepR Documentation

Prep a dfm doing or don't doing certain preprocessing steps

Description

Prep a dfm doing or don't doing certain preprocessing steps

Usage

prep(
  x,
  remove_punct,
  remove_num,
  lowercase,
  stem,
  remove_stop,
  infrequent_terms,
  tfidf,
  use_ngrams,
  stopwords = stopwords::stopwords(language = "en"),
  pb = NULL
)

Arguments

x

Preferably a corpus object but can contain everything accepted by quanteda::tokens.

remove_punct, remove_num, lowercase, stem, remove_stop, infrequent_terms, tfidf, use_ngrams

Logical. Should a preprocessing step be included or not.

stopwords

A character vector of stopwords.

pb

A progress_bar environment from the progress package.

Value

a dfm.


JBGruber/smlhelper documentation built on Oct. 7, 2022, 3:43 p.m.