two_different_clusters_data | R Documentation |
Two gaussians with equal size but unequal bandwidths, from "How to Use t-SNE Effectively".
two_different_clusters_data(n, dim = 50, scale = 10)
n |
Number of points per gaussian. |
dim |
Dimension of the gaussians. |
scale |
Amount to reduce the standard deviation of the second cluster, relative to the first. |
Creates a dataset consisting of two symmetric gaussian distributions with
equal number of points, but different standard deviations: the standard
deviations of the second cluster will be 1/scale
of the other.
Clusters are separated by 20 units. Points are colored depending on which
cluster they belong to.
Data frame with coordinates in the X1
, X2
...
Xdim
columns, and color in the color
column.
http://distill.pub/2016/misread-tsne/
Other distill functions:
circle_data()
,
cube_data()
,
gaussian_data()
,
grid_data()
,
link_data()
,
long_cluster_data()
,
long_gaussian_data()
,
ortho_curve()
,
random_circle_cluster_data()
,
random_circle_data()
,
random_jump()
,
random_walk()
,
simplex_data()
,
subset_clusters_data()
,
three_clusters_data()
,
trefoil_data()
,
two_clusters_data()
,
unlink_data()
df <- two_different_clusters_data(n = 50, dim = 2, scale = 5)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.