topics: Topic Coherence
In news-r/gensimr: Topic Modelling

Description Usage Arguments Details Examples

Calculate topic coherence for topic models.

1	model_coherence(models, ...)

`models`	A model, i.e.: LDA or LSI, or a `list` of the latter.
`...`	Any other options, from the official documentation.

A greater coherence is preferred: a higher value on the get_coherence method, see example.

# preprocess the corpus
texts <- prepare_documents(corpus)
dictionary <- corpora_dictionary(texts)
corpus <- doc2bow(dictionary, texts)

# create 2 models to compare
good_lda_model <- model_lda(
  corpus = corpus, 
  id2word = dictionary, 
  iterations = 50L, 
  num_topics = 2L
)
bad_lda_model <- model_lda(
  corpus = corpus, 
  id2word = dictionary, 
  iterations = 1L, 
  num_topics = 5L
)

# create coherence models
good_cm <- model_coherence(
  model = good_lda_model, 
  corpus = corpus, 
  dictionary = dictionary, 
  coherence = 'u_mass'
)
bad_cm <- model_coherence(
  model = bad_lda_model, 
  corpus = corpus, 
  dictionary = dictionary, 
  coherence = 'u_mass'
)

# compare coherence
good_cm$get_coherence()
bad_cm$get_coherence()