bind_tf_idf: Bind the term frequency and inverse document frequency of a...
In igorscarvalho/tidytext: Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools

Description Usage Arguments Details Examples

Calculate and bind the term frequency and inverse document frequency of a tidy text dataset, along with the product, tf-idf, to the dataset. Each of these values are added as columns. This function supports non-standard evaluation through the tidyeval framework.

1	bind_tf_idf(tbl, term, document, n)

`tbl`	A tidy text dataset with one-row-per-term-per-document
`term`	Column containing terms as string or symbol
`document`	Column containing document IDs as string or symbol
`n`	Column containing document-term counts as string or symbol

The arguments term, document, and n are passed by expression and support quasiquotation; you can unquote strings and symbols.

If the dataset is grouped, the groups are ignored but are retained.

The dataset must have exactly one row per document-term combination for this to work.

library(dplyr)
library(janeaustenr)

book_words <- austen_books() %>%
  unnest_tokens(word, text) %>%
  count(book, word, sort = TRUE)

book_words

# find the words most distinctive to each document
book_words %>%
  bind_tf_idf(word, book, n) %>%
  arrange(desc(tf_idf))

igorscarvalho/tidytext documentation built on Aug. 23, 2020, 12:44 a.m.

igorscarvalho/tidytext index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

igorscarvalho/tidytext
Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools

bind_tf_idf: Bind the term frequency and inverse document frequency of a...
In igorscarvalho/tidytext: Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools

Description

Usage

Arguments

Details

Examples

Related to bind_tf_idf in igorscarvalho/tidytext...

R Package Documentation

Browse R Packages

We want your feedback!

igorscarvalho/tidytext Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools

bind_tf_idf: Bind the term frequency and inverse document frequency of a... In igorscarvalho/tidytext: Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools

Description

Usage

Arguments

Details

Examples

Related to bind_tf_idf in igorscarvalho/tidytext...

R Package Documentation

Browse R Packages

We want your feedback!

igorscarvalho/tidytext
Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools

bind_tf_idf: Bind the term frequency and inverse document frequency of a...
In igorscarvalho/tidytext: Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools