RandomActsofPizza: Documenting an Analysis for Kaggle's Random Acts of Pizza Competition

Documented in ProcessText

#' Preprocess text fields
#'
#' To preprocess text fields so they can be used for further analysis. The
#' preprocessing follows these steps:
#' \itemize{
#' \item Convert to corpus
#' \item Remove capitalized letters
#' \item Convert to plain text
#' \item Remove punctuation
#' \item Remove stopwords
#' \item Stem words
#' \item Convert to document term matrix
#' }
#'
#' @param x character for preprocessing
#' @param remove a vector of additional words to remove
#' @return Document term matrix
#' @export
#'
ProcessText<- function(x,remove=NULL){

    Corp<-Corpus(VectorSource(x))
    Corp<-tm_map(Corp, content_transformer(tolower))
    Corp<-tm_map(Corp, PlainTextDocument)
    Corp<-tm_map(Corp, removePunctuation)
    Corp<-tm_map(Corp, removeWords, c(remove, stopwords("english")))
    Corp<-tm_map(Corp, stemDocument)
    DocumentTermMatrix(Corp)
    }

kuhnrl30/RandomActsofPizza documentation built on May 20, 2019, 7:06 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

kuhnrl30/RandomActsofPizza
Documenting an Analysis for Kaggle's Random Acts of Pizza Competition

R/ProcessText.R
In kuhnrl30/RandomActsofPizza: Documenting an Analysis for Kaggle's Random Acts of Pizza Competition

Defines functions ProcessText

Documented in ProcessText

R Package Documentation

Browse R Packages

We want your feedback!

kuhnrl30/RandomActsofPizza Documenting an Analysis for Kaggle's Random Acts of Pizza Competition

R/ProcessText.R In kuhnrl30/RandomActsofPizza: Documenting an Analysis for Kaggle's Random Acts of Pizza Competition

Defines functions ProcessText

Documented in ProcessText

R Package Documentation

Browse R Packages

We want your feedback!

kuhnrl30/RandomActsofPizza
Documenting an Analysis for Kaggle's Random Acts of Pizza Competition

R/ProcessText.R
In kuhnrl30/RandomActsofPizza: Documenting an Analysis for Kaggle's Random Acts of Pizza Competition