sim2: Synthetic dataset used in section 5.1.2 of the reference...

sim2R Documentation

Synthetic dataset used in section 5.1.2 of the reference paper.

Description

Dataset used for testing clustering with HMM-VB. The data dimension is 5. Data points were drawn from a 10-component Gaussian Mixture Model. By specific choice of the means, the data contains 10 distinct clusters. For details see the references.

Usage

sim2

Format

A data frame with 5000 rows and 5 variables. Last column contains ground truth cluster labels.

References

Lin Lin and Jia Li, "Clustering with hidden Markov model on variable blocks," Journal of Machine Learning Research, 18(110):1-49, 2017.


HDclust documentation built on Sept. 20, 2024, 5:09 p.m.