costumer: COmprehensive Searches ThroUgh Machine learning for systEmatic Reviews

Documented in tok_token

#' Tokenizator of token
#'
#' The function's aim is to access tokens at their single-words component level
#' maintaining the information of the original token structure
#'
#' @param doc (chr) A character vector representing a tokenized tocument
#'
#' @return (list) of character vector each element representing
#'         the word-components of each original token
#'
#' @export
#'
#' @examples
#' tokened_document <- c('this is', 'is a', 'a beautiful', 'beautiful day')
#'
#' tok_token(tokened_document)
tok_token <- function(doc) {

  stats::setNames(
    purrr::map(doc,
      ~ stringi::stri_extract_all_words(.) %>%
        unlist  # do not use "simplify = TRUE" because it returns a 1-row matrix
    ),
    doc
  )
}

UBESP-DCTV/costumer documentation built on Feb. 1, 2023, 4:52 a.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

UBESP-DCTV/costumer
COmprehensive Searches ThroUgh Machine learning for systEmatic Reviews

R/tok_token.R
In UBESP-DCTV/costumer: COmprehensive Searches ThroUgh Machine learning for systEmatic Reviews

Defines functions tok_token

Documented in tok_token

R Package Documentation

Browse R Packages

We want your feedback!

UBESP-DCTV/costumer COmprehensive Searches ThroUgh Machine learning for systEmatic Reviews

R/tok_token.R In UBESP-DCTV/costumer: COmprehensive Searches ThroUgh Machine learning for systEmatic Reviews

Defines functions tok_token

Documented in tok_token

R Package Documentation

Browse R Packages

We want your feedback!

UBESP-DCTV/costumer
COmprehensive Searches ThroUgh Machine learning for systEmatic Reviews

R/tok_token.R
In UBESP-DCTV/costumer: COmprehensive Searches ThroUgh Machine learning for systEmatic Reviews