EMDomics: Earth Mover's Distance for Differential Analysis of Genomics Data
The EMDomics algorithm is used to perform a supervised multi-class analysis to measure the magnitude and statistical significance of observed continuous genomics data between groups. Usually the data will be gene expression values from array-based or sequence-based experiments, but data from other types of experiments can also be analyzed (e.g. copy number variation). Traditional methods like Significance Analysis of Microarrays (SAM) and Linear Models for Microarray Data (LIMMA) use significance tests based on summary statistics (mean and standard deviation) of the distributions. This approach lacks power to identify expression differences between groups that show high levels of intra-group heterogeneity. The Earth Mover's Distance (EMD) algorithm instead computes the "work" needed to transform one distribution into another, thus providing a metric of the overall difference in shape between two distributions. Permutation of sample labels is used to generate q-values for the observed EMD scores. This package also incorporates the Komolgorov-Smirnov (K-S) test and the Cramer von Mises test (CVM), which are both common distribution comparison tests.
- Sadhika Malladi [aut, cre], Daniel Schmolze [aut, cre], Andrew Beck [aut], Sheida Nabavi [aut]
- Date of publication
- Sadhika Malladi <firstname.lastname@example.org> and Daniel Schmolze <email@example.com>
- MIT + file LICENSE
- Cramer von Mises for differential analysis of genomics data
- Calculate CVM score for a single gene
- Earth Mover's Distance for differential analysis of genomics...
- Calculate EMD score for a single gene
- Calculate the Komolgorov-Smirnov test statistic and q-values...
- Calculate KS score for a single gene
- Create an CVMomics object
- Create an EMDomics object
- Earth Mover's Distance algorithm for differential analysis of...
- Create an KSomics object
- Plot distributions and CVM score for a gene.
- Plot null distribution of permuted cvm scores vs. calculated...
- Plot histogram of CVM scores calculated via random...
- Plot distributions and EMD score for a gene.
- Plot null distribution of permuted EMD scores vs. calculated...
- Plot histogram of EMD scores calculated via random...
- Plot distributions and KS score for a gene.
- Plot null distribution of permuted ks scores vs. calculated...
- Plot histogram of KS scores calculated via random...
Files in this package