The package constructs the mutually exclusively mutated gene networks from mutation associations and gene interactions with the graph clustering technique and identifies sub-networks with distinct biological functions with the canonical pathways. The sub-networks reveal crucial genes in the canonical pathway related to cancers, which could used to build better prediction models for clinical response and survival. gcMECM also provides informative visualization functionality of mutual exclusivity and network. The associated publication was "gcMECM: graph clustering of mutual exclusivity of cancer mutations" on BMC Bioinformatics 2021.
For additional visualization and clustering analysis, the packages, (NCIRASPathway, OmicPath, scCorr, SubPath, and GCluster), could be used.
The mis-sense mutation and clinical outcome datasets for BRCA (Breast invasive carcinoma) in The Cancer Genome Atlas (TCGA) were obtained from The NCI Genomic Data Commons (GDC, version 6). RAS pathway v2.0 is obtained from NCI Ras Initiative. The pathway structure and gene coordinates were created manually for the visualization. KEGG pathway images and gene relationships were from KEGG database.
Generate the pairwise gene-gene adjacency distance matrix from the p-value of one-tailed Fisher’s exact test and generalized linear models (glm). Select gene pairs below the p-value cutoff and negative correlation in glm. The Fisher’s exact test p values are used as the distance in the matrix.
Convert the distance matrix into a weighted graph or network using R package igraph. The resulting graph was clustered into modules with the Louvain algorithm.
Overlay the modules onto the canonical pathways.
library(devtools)
install_github("CBIIT-CGBB/gcMECM")
library(gcMECM)
## load the output of Fisher's test
dat <- read.table(gzfile(infile), header=T);
## convert the p values to distance matrix
dat.d <- p2dist(dat.s[,c(gene.i,gene.j)], as.numeric(dat.s[,pvalue.l]));
## construct clusters from the distance matrix
clu <- dist2cluster(dat.d, wt=wt, method="louvain");
Download the exmaple codes (02p2cluster.R)
library(gcMECM);
## load the pathway data
library(NCIRASPathway);
## retrieve genes in the pathway
g.xy <- get_node_layout();
## load gene relationships in the pathway
pdat <- get_relations()
## read cluster data from dist2cluster
clu.d <- read.table(gzfile(infile), header=T);
Download the example codes (R codes) for NCI RAS pathway in NCIRASPathway package. The output figures are as the follows (three sub-networks in the pathway and plots).
The option example codes (R codes) were for Ras signaling pathway of KEGG with OmicPath package. The output figures are as the follows (two sub-networks in the pathway and plots).
For mutation plot (R codes) and sub-networks (R codes)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.