M3C: Monte Carlo Consensus Clustering
Version 1.0.0

A central task in genomic data analyses for stratified medicine is class discovery which is accomplished through clustering. However, an unresolved problem with current clustering algorithms is they do not test the null hypothesis and derive p values. To solve this, we developed a novel hypothesis testing framework that uses consensus clustering called Monte Carlo Consensus Clustering (M3C). M3C use a multi-core enabled Monte Carlo simulation to generate a distribution of stability scores for each number of clusters using null datasets with the same gene-gene correlation structure as the real one. These distributions are used to derive p values and a beta distribution is fitted to the data to cheaply estimate p values beyond the limits of the simulation. M3C improves accuracy, allows rejection of the null hypothesis, removes systematic bias, and uses p values to make class number decisions. We believe M3C deals with a major pitfall in current automated class discovery tools.

Package details

AuthorChristopher John [aut, cre]
Bioconductor views Clustering GeneExpression RNASeq Sequencing Transcription
MaintainerChristopher John <[email protected]>
LicenseAGPL-3
Version1.0.0
Package repositoryView on Bioconductor
Installation Install the latest version of this package by entering the following in R:
source("https://bioconductor.org/biocLite.R")
biocLite("M3C")

Try the M3C package in your browser

Any scripts or data that you put into this service are public.

M3C documentation built on Nov. 17, 2017, 1:30 p.m.