stochprofML-package: Stochastic Profiling using Maximum Likelihood Estimation

Description Details Author(s) References Examples

Description

This package accompanies the paper "Parameterizing cell-to-cell regulatory heterogeneities via stochastic transcriptional profiles" by Sameer S Bajikar, Christiane Fuchs, Andreas Roller, Fabian J Theis and Kevin A Janes (PNAS 2014, 111(5), E626-635). In that work, we measure expression profiles from small heterogeneous populations of cells. Each cell is assumed to be from a mixture of lognormal and exponential distributions (see details). We perform maximum likelihood estimation in order to infer the mixture ratio and the parameters of the lognormal/exponential distributions from the cumulated expression measurements.

Details

Package: stochprofML
Type: Package
Version: 2.0
Date: 2018-05-07
License: GPL (>= 2)

There are three stochastic profiling models: The lognormal-lognormal (LN-LN) model assumes that each cell is from a mixture of one or more lognormal distributions with different log-means but identical log-standard deviations. In the relaxed lognormal-lognormal (rLN-LN) model, the log-standard deviations are not necessarily identical. The exponential-lognormal (EXP-LN) model considers the mixture of zero, one or more lognormal distributions and one exponential distribution.

The user can immediately start with data generation and model estimation using the two functions stochasticProfilingData and stochasticProfilingML, respectively. These functions prompt the user to input the settings/data. Typical analyses are then performed without the user having to delve into the structure of this package.

When not using one the above two functions, parameters can be estimated calling stochprof.loop, which again utilizes three other functions: stochprof.search.LNLN/ stochprof.search.rLNLN/ stochprof.search.EXPLN in order to calculate and locally optimize the likelihood function;
stochprof.results.LNLN/ stochprof.results.rLNLN/ stochprof.results.EXPLN for evaluating these results; and calculate.ci.LNLN/ calculate.ci.rLNLN/ calculate.ci.EXPLN for calculating confidence intervals.

Two essential functions are r.sum.of.mixtures.LNLN/ r.sum.of.mixtures.rLNLN/
r.sum.of.mixtures.EXPLN and d.sum.of.mixtures.LNLN/ d.sum.of.mixtures.rLNLN/
d.sum.of.mixtures.EXPLN for the density and random number generation of the distribution assumed for all measurements in the stochastic profiling model.

Version 2.0 shows one significant extension to previous versions: The number of cells n in the input data does no longer need to be the same over all samples.

The package provides four datasets: sod2, containing real measurements for one gene, and
toycluster.LNLN/ toycluster.rLNLN/ toycluster.EXPLN, containing artificial data for 12 genes generated with the three stochastic profiling models.

Examples for typical analyses are given below.

Author(s)

Lisa Amrhein, Christiane Fuchs

Maintainer: Lisa Amrhein <amrheinlisa@gmail.com>

References

"Parameterizing cell-to-cell regulatory heterogeneities via stochastic transcriptional profiles" by Sameer S Bajikar*, Christiane Fuchs*, Andreas Roller, Fabian J Theis^ and Kevin A Janes^: PNAS 2014, 111(5), E626-635 (* joint first authors, ^ joint last authors) <doi:10.1073/pnas.1311647111>

"Pheno-seq - linking visual features and gene expression in 3D cell culture systems" by Stephan M. Tirier, Jeongbin Park, Friedrich Preusser, Lisa Amrhein, Zuguang Gu, Simon Steiger, Jan-Philipp Mallm, Teresa Krieger, Marcel Waschow, Bjoern Eismann, Marta Gut, Ivo G. Gut, Karsten Rippe, Matthias Schlesner, Fabian Theis, Christiane Fuchs, Claudia R. Ball, Hanno Glimm, Roland Eils & Christian Conrad: Sci Rep 9, 12367 (2019) <doi:10.1038/s41598-019-48771-4>

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
## Not run: 
# Generate synthetic data. The user is prompted to input all settings.
stochasticProfilingData()

# Estimate a stochastic profiling model. The user is prompted to input the data
# and other parameters.
stochasticProfilingML()

# Generate a synthetic dataset (without measurement error) for one gene
# and estimate the parameters from this data.
generate.toydata()

# Estimate the model parameters for the SOD2 dataset.
analyze.sod2()

# Estimate the model parameters for the 12-gene toycluster.
analyze.toycluster()

## End(Not run)

stochprofML documentation built on July 1, 2020, 5:18 p.m.