Description Usage Arguments Details Author(s) References Examples
This function allows the identification of pathways that are differentially enriched across two sub-populations of samples of the same input dataset. Similarly to differential gene expression analysis, the two sub-populations to be contrasted are defined through a contrast matrix.
1 2 3 4 5 6 7 8 9 10 11 | SLAPE.diff_SLAPE_analysis(EM, contrastMatrix,
positiveCondition, negativeCondition,
show_progress = TRUE, display = TRUE,
correctionMethod = "fdr",
path_probability = "Bernoulli",
NSAMPLES = 1, NGENES = 1,
accExLength = TRUE,
BACKGROUNDpopulation = NULL,
GeneLenghts,
PATH_COLLECTION,
SLAPE.FDRth = 5, PATH = "./")
|
EM |
A sparse binary matrix, or a sparse matrix with integer non-null entries. In this matrix the column names are sample identifiers, and the row names official HUGO gene symbols. A non-zero entry in position i,j of this matrix indicates the presence of a somatic mutations harbored by the j-sample in the i-gene. If the matrix contains integer entries then these values are deemed as the number of somatic point mutations harbored by a given sample in a given gene (these values will be considered if the analysis takes into account of the gene exonic lengths, or converted in binary values otherwise). |
contrastMatrix |
A binary matrix specifying which sample is included in which sub-population. The row names of this matrix are sample identifiers (and must match the column names of the |
positiveCondition |
String indicating one of the two sub-populations of samples to be contrasted (the positive population). It should match a column header of the |
negativeCondition |
String indicating one of the two sub-populations of samples be contrasted (the negative population). It should match a column header of the |
show_progress |
Boolean parameter determining if a progress bar should be visualized during the execution of the analysis (default) or not. |
display |
Boolean parameter determining if result figures should be displayed and saved. |
correctionMethod |
A string indicating which method should be used to correct pathway enrichment p-values for multiple hypothesis testing, in the
two individual SLAPenrich analyses (therefore |
path_probability |
A string specifying which model should be used to compute the sample-wise pathway alteration probabilities, in the
two individual SLAPenrich analyses (therefore |
NSAMPLES |
The minimal number of samples of |
NGENES |
The minimal number of genes of a given pathway P that must be mutated in at least one sample of the |
accExLength |
Boolean parameter determining whether the sample-wise pathway alteration probability model in the
two individual SLAPenrich analyses (therefore |
BACKGROUNDpopulation |
A string vector containing the official HUGO symbols of the gene background population used to compute the sample-wise pathway alteration probabilities in the two individual SLAPenrich analyses (therefore |
GeneLenghts |
A named vector containing the genome-wide total exonic block lengths to be used in the two individual SLAPenrich analyses (therefore |
PATH_COLLECTION |
A list containing the pathway collection to be tested on the |
SLAPE.FDRth |
The false discovery rate threshold to be considered for selecting significant pathways in at least one of the two individual SLAPenrich analyses (therefore |
PATH |
String specifiying the path of the directory where the pdf file containing results shoud be created. |
This function first performs two individual SLAPenrichment analyses using the SLAPE.analyse
function, on the user-defined two sub-populations of samples, yielding two lists of results. The pathways that are significantly enriched in at least one of the two result lists (according to a user defined false discovery rate (FDR) threshold) are then selected and, for each of them, a differential enrichment score is computed as:
<e2><88><86>_(A,B) (P)= -log_10 FDR_A(P)+log_10 FDR_B(P),
where A and B are the two contrasted sub-populations (respectively, positive and negative) and FDR_A and FDR_B are the two SLAPenrichment FDRs obtained in the two corresponding individual analyses.
Results are visualised at the level of the inputted alterations across the two contrasted population, on the domain of the differentially enriched pathways as well has heatmaps and barplots of the differential enrichment scores. Visualisations are saved into a set of pdf files.
Francesco Iorio - iorio@ebi.ac.uk
Storey JD, Tibshirani R. Statistical significance for genomewide studies. Proceedings of the National Academy of Sciences of the United States of America. 2003;100:9440<e2><80><93>5.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 | #Loading the Genomic Event data object derived from variants annotations
#identified in 188 Lung Adenocarcinoma patients (Ding et al, 2008)
data(LUAD_CaseStudy_ugs)
#Loading KEGG pathway gene-set collection data object
data(SLAPE.PATHCOM_HUMAN_nr_i_hu_2016)
#Loading genome-wide total exonic block lengths
data(SLAPE.all_genes_exonic_content_block_lengths_ensemble)
#Loading clinical infos for 188 Lung Adenocarcinoma patients
#(Ding et al, 2008)
data(LUAD_CaseStudy_clinicalInfos)
#Performing differential SLAPenrichment analysis comparing
#Smokers Vs. Non Smokers. Pdf files with result figures are saved in
#the current working directory
RES<-
SLAPE.diff_SLAPE_analysis(EM = LUAD_CaseStudy_ugs,contrastMatrix = LUAD_CaseStudy_clinicalInfos,
BACKGROUNDpopulation = rownames(LUAD_CaseStudy_ugs),
SLAPE.FDRth = 5,display = TRUE,
positiveCondition = 'SS_CurrentSmoker',
negativeCondition = 'SS_Never',
PATH_COLLECTION = PATHCOM_HUMAN,
GeneLenghts = GECOBLenghts,
PATH = './')
#Showing the top-10 most differentially enriched pathways in the Smokers population
RES[1:10,]
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.