textmineR: Functions for Text Mining and Topic Modeling

# Corpus is just a character vector

docs <- c("This is my first document.",
          "My 2nd document!",
          "skills, son, skills. Skillz!",
          "all my documents have skills")

# Single function call to make a DTM
d <- CreateDtm(doc_vec = docs[2:4], doc_names = seq_along(docs)[2:4],
               ngram_window = c(1,2),
               stopword_vec = "the", 
               lower = TRUE,
               remove_punctuation = TRUE,
               remove_numbers = TRUE,
               cpus = 2)

# Fit a model
m <- FitLsaModel(dtm = d, k = 2)

# Make a DTM for a new document
d2 <- CreateDtm(doc_vec = docs[1], doc_names = 1,
                ngram_window = c(1,2),
                stopword_vec = "the", 
                lower = TRUE,
                remove_punctuation = TRUE,
                remove_numbers = TRUE,
                cpus = 2)

# Single call to predict
p <- predict(m, d2)

TommyJones/textmineR documentation built on July 26, 2023, 9:51 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

TommyJones/textmineR
Functions for Text Mining and Topic Modeling

extra_functions/_presentation_example.R
In TommyJones/textmineR: Functions for Text Mining and Topic Modeling

R Package Documentation

Browse R Packages

We want your feedback!

TommyJones/textmineR Functions for Text Mining and Topic Modeling

extra_functions/_presentation_example.R In TommyJones/textmineR: Functions for Text Mining and Topic Modeling

R Package Documentation

Browse R Packages

We want your feedback!

TommyJones/textmineR
Functions for Text Mining and Topic Modeling

extra_functions/_presentation_example.R
In TommyJones/textmineR: Functions for Text Mining and Topic Modeling