clusterHDBSCAN | R Documentation |
Summarize HDBSCAN data per cluster
clusterHDBSCAN(m)
m |
Tibble with one token per row and HDBSCAN information. The |
Tibble with one row per cluster and various HDBSCAN-derived values:
Minimum, mean and maximum, as well as standard deviation, of the number of first-order context words per token in that cluster.
Minimum, mean and maximum, as well as standard deviation, of the ε value of the tokens in that cluster.
Absolute number of tokens in the cluster and proportion of modelled tokens covered by the cluster.
Proportion of tokens in that cluster with an ε value lower than the minimum ε of noise tokens in that model.
Union of first-order context words of tokens in that cluster: number of types and of tokens and type-token ratio.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.