README.md

CiberAMP | An R package to identify differential mRNA expression linked to somatic copy number variations in cancer datasets

CiberAMP is an R package that uses differential expression analyses to stablish accurate correlations between specific SCNVs and changes in expression in the genes affected by them. The algorithm has been designed to be an easy-to-access tool for the TCGA, the largest database in the world with genomic and transcriptomic data ofr more than 10,000 samples of 33 different human cancers.

Unlike other methods, CiberAMP can yield information on: (i) SCNV-DEGs (somatic copy number variations associated differentially expressed genes) in a cohort of TCGA tumor samples (ii) The type of copy number variation associated with each SCNV-DEG in terms of expression pattern and genomic context (iii) Insights on the potential functional relevance of each identified SCNV-DEG

Installation from GitHub

# Install devtools from CRAN
install.packages("devtools")

# Or the development version from GitHub:
# install.packages("devtools")
devtools::install_github("r-lib/devtools")

# Install ciberAMP by devtools
devtools::install_github("vqf/ciberAMP", dependencies = TRUE)

Usage

# Load the library
library(ciberAMP)

# Write your function
x <- ciberAMP(genes = c(), cohorts = c(), pat.percentage = 0, writePath = "PATH_TO_FOLDER")

Where:

Looking into CiberAMP results

CiberAMP returns a list of 3 data frames:

The first data frame contains all SCNV-DEGs and genes differentially expressed between tumor and normal samples exclusively. The secon data frame contains all the SCNV-DE known cancer drivers. These two data frames have the same format and in each column we can find:

The third data frame contains the information about the significant co-occurring amplification or deletions between the SCNV-DEGs and known cancer drivers:

Looking into CiberAMP's logic classifier results

The logic classification algorithm integrated in CiberAMP's package allows the user to rate the potential candidates subdividing them into four subgroups.

First, the SCN-associated DEGs reported from the previous step are divided based on their significant genomic interactions with any COSMIC CGC oncogene in each cohort. Secondly, these genes are further subdivided regarding their genomic location inside or outside enriched genomic regions. Finally, within each of the four resulting subgroups, genes are rated based on, first, their recurrency and, secondly, their SCN-associated FDR adjusted p-value.

# Load the library
library(ciberAMP)

# Write your function
x <- CiberAMP.classifier(res1 = NULL, res3 = NULL, width.window = 10000000)

Where: res1 The first data frame reported from the previous function res3 The third data frame reported from the previous function width.window* The window length in base pairs used for genomic enriched clusters calculation.

The outcomes of this function is a list of 4 data frames. The first data frame contains all the SCNV-DEGs that are not co-amplified or co-deleted with any known cancer driver gene and outside any enriched cluster. The second data frame conatins all SCNV-DEGs that are not co-amplified or co-deleted with any known cancer driver gene and located within an enriched cluster. The third data frame containes all SCNV-DEGs that are co-amplified or co-deleted with a known cancer driver gene and outside any enriched cluster. The fourth data frame contains all SCNV-DEGs that are co-amplified or co-deleted with a known cancer driver gene and within an enriched gene cluster.



vqf/ciberAMP documentation built on April 12, 2022, 12:45 p.m.