topicModeling: Topic Modeling
In ivanliu1989/RQuant: A collection of generic functions

Description Usage Arguments Details Examples

Topic Modeling based on Correlated Topic Model (Estimate a CTM model using for example the VEM algorithm.) and Latent Dirichlet Allocation (Estimate a LDA model using for example the VEM algorithm or Gibbs Sampling.)

1	topicModeling(corpus, k = 10)

`corpus`	the words
`k`	number of topics

LDA:
The C code for LDA from David M. Blei and co-authors is used to estimate and fit a latent dirichlet allocation model with the VEM algorithm. For Gibbs Sampling the C++ code from Xuan-Hieu Phan and co-authors is used.
When Gibbs sampling is used for fitting the model, seed words with their additional weights for the prior parameters can be specified in order to be able to fit seeded topic models.

CTM:
The C code for CTM from David M. Blei and co-authors is used to estimate and fit a correlated topic model.

setupTwitterConn()
tweets <- tweet_corpus(search = "audusd", n = 100, since = as.character(Sys.Date()-7), until = as.character(Sys.Date()))
tweets <- text_clean(tweets$v, rmDuplicates = FALSE, cores = 6, stems = NULL)
wordCloudVis(tweets$corpus)
topicModeling(tweets$corpus, k = 5)