coherence | R Documentation |
Computes various coherence based metrics for topic models. It
assesses the quality of estimated topics based on co-occurrences of words.
For best results, consider cleaning the initial tokens object with padding = TRUE
.
coherence(
x,
nWords = 10,
method = c("C_NPMI", "C_V"),
window = NULL,
NPMIs = NULL
)
x |
a model created from the |
nWords |
the number of words in each topic used for evaluation. |
method |
the coherence method used. |
window |
optional. If |
NPMIs |
optional NPMI matrix. If provided, skip the computation of NPMI between words, substantially decreasing computing time. |
Currently, only C_NPMI and C_V are documented. The implementation follows Röder & al. (2015). For C_NPMI, the sliding window is 10 whereas it is 110 for C_V.
A vector or matrix containing the coherence score of each topic.
Olivier Delmarcelle
Röder, M., Both, A., & Hinneburg, A. (2015). Exploring the Space of Topic Coherence Measures. In Proceedings of the Eighth ACM International Conference on Web Search and Data Mining, 399-–408.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.