unnest_regex: Wrapper around unnest_tokens for regular expressions
In insightdataintel/tidytext: Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools

Description Usage Arguments See Also Examples

This function is a wrapper around unnest_tokens( token = "regex" ).

unnest_regex(
  tbl,
  output,
  input,
  pattern = "\\s+",
  format = c("text", "man", "latex", "html", "xml"),
  to_lower = TRUE,
  drop = TRUE,
  collapse = NULL,
  ...
)

`tbl`	A data frame
`output`	Output column to be created as string or symbol.
`input`	Input column that gets split as string or symbol. The output/input arguments are passed by expression and support quasiquotation; you can unquote strings and symbols.
`pattern`	A regular expression that defines the split.
`format`	Either "text", "man", "latex", "html", or "xml". If not text, this uses the hunspell tokenizer, and can tokenize only by "word"
`to_lower`	Whether to convert tokens to lowercase. If tokens include URLS (such as with `token = "tweets"`), such converted URLs may no longer be correct.
`drop`	Whether original input column should get dropped. Ignored if the original input and new output column have the same name.
`collapse`	Whether to combine text with newlines first in case tokens (such as sentences or paragraphs) span multiple lines. If NULL, collapses when token method is "ngrams", "skip_ngrams", "sentences", "lines", "paragraphs", or "regex".
`...`	Extra arguments passed on to tokenizers

unnest_tokens()

library(dplyr)
library(janeaustenr)

d <- tibble(txt = prideprejudice)

d %>%
  unnest_regex(word, txt, pattern = "Chapter [\\\\d]")

insightdataintel/tidytext documentation built on Aug. 23, 2020, 12:44 a.m.

insightdataintel/tidytext index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

insightdataintel/tidytext
Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools

unnest_regex: Wrapper around unnest_tokens for regular expressions
In insightdataintel/tidytext: Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools

Description

Usage

Arguments

See Also

Examples

Related to unnest_regex in insightdataintel/tidytext...

R Package Documentation

Browse R Packages

We want your feedback!

insightdataintel/tidytext Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools

unnest_regex: Wrapper around unnest_tokens for regular expressions In insightdataintel/tidytext: Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools

Description

Usage

Arguments

See Also

Examples

Related to unnest_regex in insightdataintel/tidytext...

R Package Documentation

Browse R Packages

We want your feedback!

insightdataintel/tidytext
Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools

unnest_regex: Wrapper around unnest_tokens for regular expressions
In insightdataintel/tidytext: Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools