| tpacForCancer | R Documentation |
Executes the TPAC (tissue-adjusted pathway analysis for cancer) method (tpacForCollection) on cancer gene expression data
using normal tissue expression data from the Human Protein Atlas (HPA) that is included in the package as hpa.data.
This HPA normal tissue data was specially processed by the HPA group as FPKM values using a pipeline similar to that employed by GDC for the TCGA data. For consistency with this HPA normal tissue data, the provided cancer.gene.expr data must be specified as FPKM+1 values. Please see the vignette for an example of calling this function using appropriately normalized TCGA gene expression data.
tpacForCancer(cancer.gene.expr, cancer.type, gene.set.collection,
min.set.size=1, max.set.size)
cancer.gene.expr |
An n x p matrix of gene expression values for n tumors of the specified tumor type and p genes. The data should be normalized as FPKM+1 values, row names should be sample ID, and column names should be Ensembl gene IDs. |
cancer.type |
Cancer type of the expression data. Must be one of the supported cancer types as per |
gene.set.collection |
List of m gene sets for which scores are computed. Each element in the list corresponds to a gene set and the list element is a vector of Ensembl IDs for genes in the set. Gene set names should be specified as list names. |
min.set.size |
See description of |
max.set.size |
See description of |
A list containing two elements:
S.pos: n x m matrix of TPAC scores computed using the positive squared adjusted Mahalanobis distances.
S.neg: n x m matrix of TPAC scores computed using the negative squared adjusted Mahalanobis distances.
S: n x m matrix of TPAC scorescomputed using the sum of the positive and negative squared adjusted Mahalanobis distances.
tpac, hpa.data, tpacForCollection, getSupportedCancerTypes, createGeneSetCollection
# Simulate Gaussian expression data for 10 genes and 10 samples
# (Note: cancer expression should be FPKM+1 for real applications)
cancer.gene.expr=matrix(rnorm(200), nrow=20)
# Create arbitrary Ensembl IDs
gene.ids = c("ENSG00000000003","ENSG00000000005","ENSG00000000419",
"ENSG00000000457","ENSG00000000460","ENSG00000000938",
"ENSG00000000971","ENSG00000001036","ENSG00000001084",
"ENSG00000001167")
colnames(cancer.gene.expr) = gene.ids
# Define a collection with two disjoint sets that span the 10 genes
collection=list(set1=gene.ids[1:5], set2=gene.ids[6:10])
# Execute TPAC on both sets
tpacForCancer(cancer.gene.expr=cancer.gene.expr, cancer.type="glioma",
gene.set.collection=collection)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.