cluster_tsne_mclust | R Documentation |
This function performs Mclust clustering on the 2D t-SNE results, which are derived from high-dimensional data. It includes an initial outlier detection step using DBSCAN, and the user can specify whether to exclude outliers from the clustering process. Silhouette scores are computed to evaluate the quality of the clustering, and cluster centroids are returned for visualization, with outliers handled separately.
cluster_tsne_mclust(info.norm, tsne.norm, settings)
info.norm |
A data frame containing the normalized data on which the t-SNE analysis was carried out. |
tsne.norm |
The t-SNE results object, including the 2D t-SNE coordinates ( |
settings |
A list of settings for the clustering analysis, including:
|
The function first uses DBSCAN to detect outliers (marked as cluster "100") and then applies Mclust clustering on the t-SNE results. Outliers can be either included or excluded from the clustering, depending on the settings. Silhouette scores are calculated to assess the quality of the clustering. Cluster centroids are returned, along with the sizes of each cluster, and outliers are handled separately in the centroid calculation.
A list containing:
info.norm
: The input data frame with an additional pandora_cluster
column for cluster assignments.
cluster_data
: A data frame with cluster centroids and labeled clusters.
avg_silhouette_score
: The average silhouette score, providing a measure of clustering quality.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.