textmineR: Functions for Text Mining and Topic Modeling
Version 2.0.5

An aid for text mining in R, with a syntax that should be familiar to experienced R users. Provides a wrapper for several topic models that take similarly-formatted input and give similarly-formatted output. Has additional functionality for analyzing and diagnostics for topic models.

AuthorThomas Jones [aut, cre], William Doane [ctb]
Date of publication2017-04-07 06:07:04 UTC
MaintainerThomas Jones <jones.thos.w@gmail.com>
LicenseGPL (>= 3)
Version2.0.5
URL https://github.com/TommyJones/textmineR
Package repositoryView on CRAN
InstallationInstall the latest version of this package by entering the following in R:
install.packages("textmineR")

Getting started

README.md

Popular man pages

CalcHellingerDist: Calculate Hellinger Distance
CalcJSDivergence: Calculate Jensen-Shannon Divergence
FitLdaModel: Fit a topic model using Latent Dirichlet Allocation
FitLsaModel: Fit a topic model using Latent Semantic Analysis
FormatRawLdaOutput: Format Raw Output from 'lda.collapsed.gibbs.sampler'
GetVocabFromDtm: Reconstruct a 'text2vec::vocabulary' object from a document...
JSD: Jensen-Shannon Divergence
See all...

All man pages Function index File listing

Man pages

CalcHellingerDist: Calculate Hellinger Distance
CalcJSDivergence: Calculate Jensen-Shannon Divergence
CalcLikelihood: Calculate the log likelihood of a document term matrix given...
CalcPhiPrime: Calculate a matrix whose rows represent P(topic_i|tokens)
CalcProbCoherence: Probailistic coherence of topics
CalcTopicModelR2: Calculate the R-squared of a topic model.
Cluster2TopicModel: Represent a document clustering as a topic model
CorrectS: Function to remove some forms of pluralization.
CreateDtm: Convert a character vector to a document term matrix.
CreateTcm: Convert a character vector to a term co-occurence matrix.
DepluralizeDtm: Run the CorrectS function on columns of a document term...
Dtm2Docs: Convert a DTM to a Character Vector of documents
Dtm2Tcm: Turn a document term matrix into a term co-occurence matrix
Files2Vec: Function for reading text files into R
FitCtmModel: Fit a Correlated Topic Model
FitLdaModel: Fit a topic model using Latent Dirichlet Allocation
FitLsaModel: Fit a topic model using Latent Semantic Analysis
FormatRawLdaOutput: Format Raw Output from 'lda.collapsed.gibbs.sampler'
GetPhiPrime: Calculate a matrix whose rows represent P(topic_i|tokens)
GetProbableTerms: Get cluster labels using a "more probable" method of terms
GetTopTerms: Get Top Terms for each topic from a topic model
GetVocabFromDtm: Reconstruct a 'text2vec::vocabulary' object from a document...
HellDist: Hellinger Distance
InternalFunctions: Internal helper functions for 'textmineR'
JSD: Jensen-Shannon Divergence
LabelTopics: Get some topic labels using a "more probable" method of terms
nih: Abstracts and metadata from NIH research grants awarded in...
RecursiveRbind: Recursively call rBind from the Matrix package.
TermDocFreq: Get term frequencies and document frequencies from a document...
TmParallelApply: An OS-independent parallel version of 'lapply'
Vec2Dtm: Convert a character vector to a document term matrix of class...

Functions

CalcHellingerDist Man page Source code
CalcJSDivergence Man page Source code
CalcLikelihood Man page Source code
CalcLikelihoodC Man page Source code
CalcPhiPrime Man page Source code
CalcProbCoherence Man page Source code
CalcSumSquares Man page Source code
CalcTopicModelR2 Man page Source code
Cluster2TopicModel Man page Source code
CorrectS Man page Source code
CreateDtm Man page Source code
CreateTcm Man page Source code
DepluralizeDtm Man page Source code
Dtm2Docs Man page Source code
Dtm2DocsC Man page Source code
Dtm2Tcm Man page Source code
Files2Vec Man page Source code
FitCtmModel Man page Source code
FitLdaModel Man page Source code
FitLsaModel Man page Source code
FormatRawLdaOutput Man page Source code
GetPhiPrime Man page Source code
GetProbableTerms Man page Source code
GetTopTerms Man page Source code
GetVocabFromDtm Man page Source code
HellDist Man page Source code
HellingerMat Man page Source code
Hellinger_cpp Man page Source code
JSD Man page Source code
JSD_cpp Man page Source code
JSDmat Man page Source code
LabelTopics Man page Source code
RecursiveRbind Man page Source code
TermDocFreq Man page Source code
TmParallelApply Man page Source code
Vec2Dtm Man page Source code
nih Man page
nih_sample Man page
nih_sample_dtm Man page
nih_sample_topic_model Man page

Files

src
src/CalcSumSquares.cpp
src/textmineR_init.cpp
src/JSD_cpp.cpp
src/HellingerMat.cpp
src/Dtm2DocsC.cpp
src/JSDmat.cpp
src/CalcLikelihoodC.cpp
src/RcppExports.cpp
src/Hellinger_cpp.cpp
NAMESPACE
data
data/nih_sample_topic_model.rda
data/nih_sample.rda
data/nih_sample_dtm.rda
R
R/DepluralizeDtm.R
R/CalcHellingerDist.R
R/CreateTcm.R
R/CalcTopicModelR2.R
R/Dtm2Tcm.R
R/LabelTopics.R
R/Files2Vec.R
R/GetTopTerms.R
R/CalcJSDivergence.R
R/JSD.R
R/FitLdaModel.R
R/HellDist.R
R/GetPhiPrime.R
R/FitCtmModel.R
R/FitLsaModel.R
R/FormatRawLdaOutput.R
R/CalcProbCoherence.R
R/Vec2Dtm.R
R/CorrectS.R
R/RcppExports.R
R/Dtm2Docs.R
R/Cluster2TopicModel.R
R/CalcLikelihood.R
R/RecursiveRbind.R
R/TermDocFreq.R
R/GetProbableTerms.R
R/CreateDtm.R
R/CalcPhiPrime.R
R/GetVocabFromDtm.R
R/TmParallelApply.R
README.md
MD5
DESCRIPTION
man
man/nih.Rd
man/GetVocabFromDtm.Rd
man/JSD.Rd
man/InternalFunctions.Rd
man/CalcProbCoherence.Rd
man/DepluralizeDtm.Rd
man/CreateDtm.Rd
man/TermDocFreq.Rd
man/Cluster2TopicModel.Rd
man/CalcPhiPrime.Rd
man/GetProbableTerms.Rd
man/CalcTopicModelR2.Rd
man/CorrectS.Rd
man/Dtm2Tcm.Rd
man/FitLsaModel.Rd
man/GetTopTerms.Rd
man/TmParallelApply.Rd
man/Files2Vec.Rd
man/CalcJSDivergence.Rd
man/HellDist.Rd
man/LabelTopics.Rd
man/FitLdaModel.Rd
man/CalcHellingerDist.Rd
man/RecursiveRbind.Rd
man/CalcLikelihood.Rd
man/CreateTcm.Rd
man/Dtm2Docs.Rd
man/FormatRawLdaOutput.Rd
man/Vec2Dtm.Rd
man/FitCtmModel.Rd
man/GetPhiPrime.Rd
textmineR documentation built on May 19, 2017, 1:52 p.m.