cleantext: Text Cleaner

View source: R/ngramComponents.R

cleantextR Documentation

Text Cleaner

Description

background function to load.

Usage

cleantext(
  text,
  language = "english",
  punct = FALSE,
  stop.words = TRUE,
  number.words = TRUE
)

Arguments

text

character Vector of strings to clean.

language

character Language to use for cleaning. Default is "english".

punct

logical Should punctuation be kept as tokens? Default is TRUE.

stop.words

logical Should stop words be kept? default is TRUE.

number.words

logical Should numbers be converted to words? default is TRUE.

Value

character Vector of cleaned strings.


doc2concrete documentation built on June 29, 2022, 1:05 a.m.