View source: R/token_helpers.R
token_most_common | R Documentation |
Returns a dataframe of common ngrams
token_most_common( .v, n_range = DEFAULT_RANGE_NGRAM_RANGE, token = "ngrams", n_ngrams_returns = DEFAULT_NUM_NGRAMS_RETURN, tokenizer = tokenizer_basic, ... )
.v |
a vector of strings |
n_range |
range of ngrames to feed back : Default 1:4 |
token |
passed to tokenizer_basic which passes it to tidytext::unnest_tokens :Default 'ngrams' |
n_ngrams_returns |
how many of each n to return for most common :Default 12 |
tokenizer |
a function that tokenizes a column of a data frame :Default tokenizer_basic |
... |
passed to tokenizer_basic |
a dataframe showing the most common tokens
mtcars |> tibble::rownames_to_column() |> dplyr::pull(rowname) |> token_most_common() library(tokenizers) mobydick |> stringr::str_split('\\.') |> magrittr::extract2(1) |> token_most_common()
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.