process_text: Preprocessing of raw or lemmatized text data

View source: R/process_text.R

process_textR Documentation

Preprocessing of raw or lemmatized text data

Description

Preprocessing of raw or lemmatized text data

Usage

process_text(
  src_docs = NULL,
  src_texts,
  target_docs = "titles_processed.txt",
  target_texts = "corpus_processed.txt",
  replace_from = NULL,
  replace_to = NULL,
  collocations = TRUE,
  min_nchar = NULL,
  remove_punct = TRUE,
  remove_symbols = TRUE,
  remove_numbers = TRUE,
  remove_url = TRUE,
  remove_separators = TRUE,
  split_hyphens = FALSE,
  padding = FALSE,
  stopwords = NULL,
  min_termfreq = NULL,
  max_termfreq = NULL,
  termfreq_type = c("count", "prop", "rank", "quantile"),
  min_docfreq = NULL,
  max_docfreq = NULL,
  docfreq_type = c("count", "prop", "rank", "quantile")
)

hlageek/rhSBM documentation built on Jan. 2, 2023, 8:02 p.m.