tidify_to_bigrams: Prepare data for text analysis.

Description Usage Arguments Details Value

Description

Create a tidy data.frame of unigrams.

Usage

1
2
3
4
tidify_to_bigrams_at(data = NULL, text = NULL, rgx_pattern, rgx_replacement,
  stopwords = TRUE, stopwords_lexicon, rgx_ignore_custom)

tidify_to_bigrams(..., text)

Arguments

data

data.frame.

text

bare for NSE; character for SE. Name of column in data to parse.

rgx_pattern

character. Regular expression to substitute.

rgx_replacement

character. Regular expression used as replacement.

stopwords

logical. Whether or not to remove stopwords. Default: TRUE

stopwords_lexicon

character. If not specified, then uses all stop words available in tidytext package. Otherwise, must be a lexicon available in the tidytext package.

rgx_ignore_custom

character. Custom regular expression to remove.

...

dots. Parameters to pass directly to visualize_time().

Details

Heavily influenced by Text Mining with R. Creates output columns 'word', 'word1', and 'word2'. ('word' is simply 'word1' and 'word2' toegether.)

Value

data.frame.


tonyelhabr/tetext documentation built on May 14, 2019, 8:03 a.m.