In CrossICC: An Interactive Consensus Clustering Framework for Multi-platform Data Analysis

Brief instructions

In order to evaluate the result of clustering, we implemented a tool for analyzing the correlation of clusters obtained from two cluster systems. By default, the consensus molecular subtypes (CMSs) of Colorectal cancer (CRC) samples (Guinney, J., Dienstmann, R., 2015) as well as CrossICC-clustered subtypes with same data were used for comparison.

The input file should be csv format (comma-separated), with three columns:

| ID | Which cluster? (by method 1) | Which cluster? (by method 2) | |--------------------------|------------------------------|------------------------------| | XXXXXXX | C1 | K5 | | XXXXXXX | C7 | K8 | | XXXXXXX | C4 | K2 | | ... | ... | ... |

The comparison were evaluated by the following statistics:

Rand index

a measure of the similarity between two data clusterings. A true positive (TP) decision assigns two similar documents to the same cluster, a true negative (TN) decision assigns two dissimilar documents to different clusters. There are two types of errors we can commit. A (FP) decision assigns two dissimilar documents to the same cluster. A (FN) decision assigns two similar documents to different clusters. The Rand index measures the percentage of decisions that are correct.

$$RI=((TP+TN))/((TP+FP+FN+TN))$$

adjusted Rand index

The adjusted Rand index is the corrected-for-chance version of the Rand index

$$ARI=((RI-Expetced RI))/((maxa(RI)-Expected RI))$$

Jaccard index

a statistic used for comparing the similarity and diversity of sample sets. The Jaccard coefficient measures similarity between finite sample sets, and is defined as the size of the intersection divided by the size of the union of the sample sets:

$$J(A,B)=(|A∩B|)/|A∪B| =(|A∩B|)/(|A|+|B|-|A∩B|)$$

Any scripts or data that you put into this service are public.

CrossICC documentation built on April 29, 2020, 4:40 a.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

CrossICC
An Interactive Consensus Clustering Framework for Multi-platform Data Analysis

In CrossICC: An Interactive Consensus Clustering Framework for Multi-platform Data Analysis

Brief instructions

Rand index

adjusted Rand index

Jaccard index

Try the CrossICC package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

CrossICC An Interactive Consensus Clustering Framework for Multi-platform Data Analysis

In CrossICC: An Interactive Consensus Clustering Framework for Multi-platform Data Analysis

Brief instructions

Rand index

adjusted Rand index

Jaccard index

Try the CrossICC package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

CrossICC
An Interactive Consensus Clustering Framework for Multi-platform Data Analysis