Description Usage Arguments Details Value Examples
Generate a simulated scRNA-seq dataset of 2 biological groups with several batches per group, a defined variability within and between batches, and a defined proportion and amount of differentially distributed genes.
1 2 3 4 | SimulateData(k = 4.2, k_CV = 0.25, k_CV2 = 0.08, a = 1.9, a_CV = 0.35,
a_CV2 = 0.07, b = 3.5, b_CV = 0.4, b_CV2 = 0.09, Km = 4.5,
n = 3.4, pZero_SD = 0.1, nBatches = 3, nGenes = 10000, nCells = 100,
nCells_CV = 0.3, ddP = 0, ddM = 1)
|
k, a, b |
|
Km, n |
|
pZero_SD |
|
nBatches |
|
nGenes |
|
nCells |
|
ddP |
|
ddM |
|
_CV |
|
_CV2 |
|
nCells_SD |
|
Count tables, phenotypic- and feature-information tables are generated to resemble a scRNA-seq experiment. Count distributions are modelled using a 3 parameter Beta-Poisson distributions: Pois(k * Beta(a, b)).
For low average expression (genewise) additional 0's are introduced to better resemble real dropout rates. This is done according to a Hill equation, which defines the propability of a zero (dropout) from the average expression by P_zero = x^n / (Km + x^n) where x is the average expression.
The defaults resemble the Tung 2016 dataset. In general, these parameters fit well to datasets which were produced using UMI's and the fluidigm platform. For raw readcounts the absolute count values and variabilities are higher. Count tables produced using Drop-Seq usually have more cells and a much larger dropout rate.
list of generated count table, phenotypic- (cell-) and feature- (gene-) information tables, and a list which holds the Parameters for each batch and each gene.
1 2 | ds <- ds <- SimulateData()
str(ds)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.