Man pages for dselivanov/text2vec
Modern Text Mining Framework for R

as.lda_cConverts document-term matrix sparse matrix to 'lda_c' format
BNSBNS
check_analogy_accuracyChecks accuracy of word embeddings on the analogy task
coherenceCoherence metrics for topic models
CollocationsCollocations model.
combine_vocabulariesCombines multiple vocabularies into one
create_dtmDocument-term matrix construction
create_tcmTerm-co-occurence matrix construction
create_vocabularyCreates a vocabulary of unique terms
distancesPairwise Distance Matrix Computation
GlobalVectorsCreates Global Vectors word-embeddings model.
gloveFit a GloVe word-embedded model
ifilesCreates iterator over text files from the disk
itokenIterators (and parallel iterators) over input objects
LatentDirichletAllocationCreates Latent Dirichlet Allocation model.
LatentSemanticAnalysisLatent Semantic Analysis model
movie_reviewIMDB movie reviews
normalizeMatrix normalization
perplexityPerplexity of a topic model
prepare_analogy_questionsPrepares list of analogy questions
prune_vocabularyPrune vocabulary
reexportsObjects exported from other packages
RelaxedWordMoversDistanceCreates model which can be used for calculation of "relaxed...
similaritiesPairwise Similarity Matrix Computation
split_intoSplit a vector for parallel processing
text2vectext2vec
TfIdfTfIdf
tokenizersSimple tokenization functions for string splitting
vectorizersVocabulary and hash vectorizers
dselivanov/text2vec documentation built on June 15, 2018, 8:17 a.m.