| two_clusters_data | R Documentation |
Two gaussians with equal size and bandwidth, from "How to Use t-SNE Effectively".
two_clusters_data(n, dim = 50)
n |
Number of points per gaussian. |
dim |
Dimension of the gaussians. You may pass a vector of length 2 to create clusters of different dimensionalities, with the smaller cluster having zeros in the extra dimensions. |
Creates a dataset consisting of two symmetric gaussian distributions with equal number of points and standard deviation 1, separated by a distance of 10 units. Points are colored depending on which cluster they belong to.
Data frame with coordinates in the X1, X2 ...
Xdim columns, and color in the color column.
http://distill.pub/2016/misread-tsne/
Other distill functions:
circle_data(),
cube_data(),
gaussian_data(),
grid_data(),
link_data(),
long_cluster_data(),
long_gaussian_data(),
ortho_curve(),
random_circle_cluster_data(),
random_circle_data(),
random_jump(),
random_walk(),
simplex_data(),
subset_clusters_data(),
three_clusters_data(),
trefoil_data(),
two_different_clusters_data(),
unlink_data()
df <- two_clusters_data(n = 50, dim = 2)
# two clusters with 10 members each, first 10 sampled from a 3D gaussian,
# second 10 are sampled from a 4D gaussian
df <- two_clusters_data(n = 10, dim = c(3, 4))
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.