DMdata: A Dataset Simulating Single-Cell Epigenomic Data

DMdataR Documentation

A Dataset Simulating Single-Cell Epigenomic Data

Description

A simulated dataset used in the 'SWKM' vignette for illustration to perform weighted K-Means clustering.

Usage

DMdata

Format

A list with three items:

data

a 100 by 5000 matrix, indicating 100 observations with 5000 features. Each element is a non-negative integer, which could be regarded as read counts.

nCluster

true number of clusters.

true.label

a 100-dimension integer vector indicating the cluster each observation should be assigned to.

noisy.label

a 20-dimension integer vector indicating the positions of noisy observations. 90% of the features in the noisy observations are set to be 0, simulating missing values in real experiments.

Source

Jason D Buenrostro, et al. (2015) Single-cell chromatin accessibility reveals principles of regulatory variation. Nature, 523(7561), 486-490.

Examples

data <- DMdata$data
true.label <- DMdata$true.label
noisy.label <- DMdata$noisy.label

cuhklinlab/SWKM documentation built on Aug. 5, 2022, 2:27 a.m.