R/show_tokens.R

Defines functions show_tokens

Documented in show_tokens

#' Show token output of recipe
#'
#' Returns the tokens as a list of character vector of a recipe. This function
#' can be useful for diagnostics doing recipe construction but should not be
#' used in final recipe steps. Note that this function will both prep() and
#' bake() the recipe it is used on.
#'
#' @param rec A recipe object
#' @param var name of variable
#' @param n Number of elements to return.
#'
#' @return A list of character vectors
#' @export
#'
#' @examples
#' text_tibble <- tibble(text = c("This is words", "They are nice!"))
#'
#' recipe(~text, data = text_tibble) %>%
#'   step_tokenize(text) %>%
#'   show_tokens(text)
#'
#' library(modeldata)
#' data(tate_text)
#'
#' recipe(~., data = tate_text) %>%
#'   step_tokenize(medium) %>%
#'   show_tokens(medium)
show_tokens <- function(rec, var, n = 6L) {
  res <- rec %>%
    prep() %>%
    bake(new_data = NULL) %>%
    dplyr::pull({{ var }}) %>%
    get_tokens()

  res[seq_len(min(length(res), n))]
}

Try the textrecipes package in your browser

Any scripts or data that you put into this service are public.

textrecipes documentation built on Nov. 16, 2023, 5:06 p.m.