DMdata: A Dataset Simulating Single-Cell Epigenomic Data

Description Usage Format Source Examples

Description

A simulated dataset used in the 'SWKM' vignette for illustration to perform weighted K-Means clustering.

Usage

1

Format

A list with three items:

data

a 100 by 5000 matrix, indicating 100 observations with 5000 features. Each element is a non-negative integer, which could be regarded as read counts.

nCluster

true number of clusters.

true.label

a 100-dimension integer vector indicating the cluster each observation should be assigned to.

noisy.label

a 20-dimension integer vector indicating the positions of noisy observations. 90% of the features in the noisy observations are set to be 0, simulating missing values in real experiments.

Source

Jason D Buenrostro, et al. (2015) Single-cell chromatin accessibility reveals principles of regulatory variation. Nature, 523(7561), 486-490.

Examples

1
2
3
data <- DMdata$data
true.label <- DMdata$true.label
noisy.label <- DMdata$noisy.label

Van1yu3/SWKM documentation built on Sept. 3, 2019, 7:50 a.m.