three_clusters_data: Three Gaussian Clusters With Equal Standard Deviations

View source: R/misread-tsne.R

three_clusters_dataR Documentation

Three Gaussian Clusters With Equal Standard Deviations

Description

Three gaussian clusters with equal size and bandwidth from "How to Use t-SNE Effectively".

Usage

three_clusters_data(n, dim = 50)

Arguments

n

Number of points per gaussian.

dim

Dimension of the gaussians.

Details

Creates a dataset consisting of three symmetric gaussian distributions with equal number of points and standard deviation 1. The clusters are arranged in a line, with the second cluster 10 units from the first, and the third cluster 50 units from the first. Points are colored depending on which cluster they belong to.

Value

Data frame with coordinates in the X1, X2 ... Xdim columns, and color in the color column.

References

http://distill.pub/2016/misread-tsne/

See Also

Other distill functions: circle_data(), cube_data(), gaussian_data(), grid_data(), link_data(), long_cluster_data(), long_gaussian_data(), ortho_curve(), random_circle_cluster_data(), random_circle_data(), random_jump(), random_walk(), simplex_data(), subset_clusters_data(), trefoil_data(), two_clusters_data(), two_different_clusters_data(), unlink_data()

Examples

df <- three_clusters_data(n = 50, dim = 2)

jlmelville/snedata documentation built on Jan. 13, 2024, 2:06 a.m.