rakeR: An implementation of rapid automatic keyword extraction for R.

#' stem but don't tokenize
#' 
#' a helper function to stem your chr vector and return stemmed vector this
#' reduces the total vocab significantly
#' @export
#' @param x is a chr vector.
#' @param language should be one of “danish”, “dutch”, “english”, “finnish”,
#'   “french”, “german”, “hungarian”, “italian”, “norwegian”, “porter”,
#'   “portuguese”, “romanian”, “russian”, “spanish”, “swedish”, “turkish”
stem_in_place <- function(x, language = "english"){
  
  x <- tokenizers::tokenize_word_stems(x, language = language)
  x <- purrr::map(x, stringr::str_c, collapse = " ")
  x <- purrr::as_vector(x)
  
  x
  
}

lmkirvan/rakeR documentation built on May 14, 2019, 1:46 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

lmkirvan/rakeR
An implementation of rapid automatic keyword extraction for R.

R/stem_in_place.R
In lmkirvan/rakeR: An implementation of rapid automatic keyword extraction for R.

R Package Documentation

Browse R Packages

We want your feedback!

lmkirvan/rakeR An implementation of rapid automatic keyword extraction for R.

R/stem_in_place.R In lmkirvan/rakeR: An implementation of rapid automatic keyword extraction for R.

R Package Documentation

Browse R Packages

We want your feedback!

lmkirvan/rakeR
An implementation of rapid automatic keyword extraction for R.

R/stem_in_place.R
In lmkirvan/rakeR: An implementation of rapid automatic keyword extraction for R.