random_circle_cluster_data: Fuzzy 2D Circle

View source: R/misread-tsne.R

random_circle_cluster_dataR Documentation

Fuzzy 2D Circle

Description

Clusters arranged in a circle from "How to Use t-SNE Effectively".

Usage

random_circle_cluster_data(n)

Arguments

n

Number of points to define the circle. These are then jittered as described in 'Details', so the total number of observations in the data frame will be 20 * n.

Details

Creates a dataset where the points are located on the circumference of a circle of unit radius via circle_data, where the angle subtended by the point at the center of the circle has been uniformly sampled.

Each point is then replicated 20 times, with a slight perturbation to the original coordinate (using a normal distribution with a standard deviation of 0.1). Only the twenty jittered points are added to the data frame. Each point has a color linearly mapped from the angle of the original unjittered point to a rainbow color scheme.

Value

Data frame with x and y coordinates in the columns x and y respectively, and color in the color column.

References

http://distill.pub/2016/misread-tsne/

See Also

Other distill functions: circle_data(), cube_data(), gaussian_data(), grid_data(), link_data(), long_cluster_data(), long_gaussian_data(), ortho_curve(), random_circle_data(), random_jump(), random_walk(), simplex_data(), subset_clusters_data(), three_clusters_data(), trefoil_data(), two_clusters_data(), two_different_clusters_data(), unlink_data()

Examples

df <- random_circle_data(n = 50)

jlmelville/snedata documentation built on March 5, 2025, 12:22 p.m.