format_data: takes imported one-line-per-row data and prepares it for...

Description Usage Arguments Value

View source: R/format-data.R

Description

takes imported one-line-per-row data and prepares it for later analysis

Usage

1
2
3
4
5
6
7
format_data(
  .data,
  lemmatize = TRUE,
  stopwords = TRUE,
  sw_lexicon = "snowball",
  addl_stopwords = NA
)

Arguments

.data

tibble with one line of text per row

lemmatize

boolean, whether to lemmatize or not

stopwords

boolean, whether to remove stopwords or not

sw_lexicon

string, lexicon with which to remove stopwords

addl_stopwords

char vector of user-supplied stopwords

Value

a [tibble][tibble::tibble-package] with one token per line, stopwords removed leaving NA values, column for analysis named "text"


jcai849/iNZightTA documentation built on Dec. 2, 2020, 6:59 a.m.