| two_different_clusters_data | R Documentation |
Two gaussians with equal size but unequal bandwidths, from "How to Use t-SNE Effectively".
two_different_clusters_data(n, dim = 50, scale = 10)
n |
Number of points per gaussian. |
dim |
Dimension of the gaussians. |
scale |
Amount to reduce the standard deviation of the second cluster, relative to the first. |
Creates a dataset consisting of two symmetric gaussian distributions with
equal number of points, but different standard deviations: the standard
deviations of the second cluster will be 1/scale of the other.
Clusters are separated by 20 units. Points are colored depending on which
cluster they belong to.
Data frame with coordinates in the X1, X2 ...
Xdim columns, and color in the color column.
http://distill.pub/2016/misread-tsne/
Other distill functions:
circle_data(),
cube_data(),
gaussian_data(),
grid_data(),
link_data(),
long_cluster_data(),
long_gaussian_data(),
ortho_curve(),
random_circle_cluster_data(),
random_circle_data(),
random_jump(),
random_walk(),
simplex_data(),
subset_clusters_data(),
three_clusters_data(),
trefoil_data(),
two_clusters_data(),
unlink_data()
df <- two_different_clusters_data(n = 50, dim = 2, scale = 5)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.