databio/PCRSA: Coordinate Covariation Analysis

COCOA is a method for understanding variation among samples and can be used with data that includes genomic coordinates such as DNA methylation. On a high level, COCOA uses a database of "region sets" and principal component analysis (PCA) of your data to identify sources of variation among samples. A region set is a set of genomic regions that share a biological annotation, for instance, transcription factor binding regions, histone modification regions, or open chromatin regions. COCOA works in both supervised (known groups of samples) and unsupervised (no groups) situations and can be used as a complement to "differential" methods that find discrete differences between groups. COCOA can identify biologically meaningful sources of variation between samples and increase understanding of variation in your data.

Getting started

Package details

Bioconductor views ChIPSeq DNAMethylation Epigenetics FunctionalGenomics GeneRegulation GenomeAnnotation GenomicVariation ImmunoOncology MethylSeq PrincipalComponent Sequencing SystemsBiology
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
databio/PCRSA documentation built on Dec. 7, 2018, 8:57 a.m.