panGeneSets: Test gene networks for a set of cases versus a set of...

Description Usage Arguments Details Value Examples

View source: R/panGeneSets.R

Description

Test differential gene expression in gene networks for a set of cases versus a set of controls, with gene networks defined by reactome directed graph for cancer genes, and genomic events determined from variant, cna, or gcount.

Usage

1
2
3
4
5
6
panGeneSets(caseids, controlids, variant = NULL, cna = NULL,
           gcount, tumorpct = 0.5, tailPct = 0.1, tailEnd = "both",
           eventOnly = FALSE,  gene.adj = NULL, drug.adj = NULL,
           gageCompare = ifelse(length(caseids) > 1, "as.group", "unpaired"))
drivDNA(ids, variant = NULL, cna = NULL, tumorpct = 0.5, gene.adj=NULL)
outRNA(ids, gcount, tailPct = 0.1, tailEnd = "both")

Arguments

caseids

identifiers of subjects case subjects that will match to variant column PatientID, and column name of cna and gcount.

controlids

identifiers of subjects in row names of variant and column names of gcount who are controls

ids

identifiers of subjects in row names of variant and column names of gcount and cna matrices

variant

data.frame for somatic and/or germline variants. Must contain columns CHROM, POS, Gene.Symbol, SampleType (Germline, Tumor) and PatientID that must match the caseid and controlid

cna

data.frame with log2 of copy number aberrations. Required columns CHROM, START, STOP, Gene.Symbol, and columns named to match caseid and controlid

gcount

Normalized gene expression counts for patients (columns) at gene symbols (rows). Gene symbols are the row names and should match dataset gcinfoPan gene symbols.

tumorpct

Vector of approximate tumor percentage for each patient samples given in the order they appear in cases and controls. If a single value is passed, it is used for all patients. Copy number mutation events are called based on their approximate tumor percentage, transformed to the log2-ratio copy number call.

tailPct

percentile used to define the most extreme RNA expression events as cancer events per patient; only used when eventOnly=TRUE. Values 0 < tailPct < .5 allowed.

tailEnd

perform differential expression tests that are one-sided ("upper" or "lower"), or two-sided ("both"). Also used with tailPct to include both, upper, or lower expression events to be used as cancer events.

gene.adj

Adjacency matrix of genes connected via a graph of nodes and edges. Panoply contains reactome.adj, which is directed acyclic graph of Reactome nodes (genes) and edges (relationships).

drug.adj

Adjacency matrix (values of 1 or 0) for drug-gene interactions with the drug in the row directly impacting the gene in the column. The framework is equipped for coding for other interaction types of the drug and genes.

gageCompare

Character string telling the gage package, which performs the gene network tests, to perform tests of 1-vs-M ("unpaired") or N-vs-M ("as.group")

eventOnly

Logical, test only gene networks that have a cancer event (variant, cna, and rna-expression outside tailPct)).

Details

If eventOnly=FALSE, then test all gene networks for differential expression between cases and controls. Otherwise, cancer events are determined per gene from variant (any variant in the gene for the case(s)), cna (copy gain or loss for case(s) given tumorpct), and gcount (outlying expression from tailPct and tailEnd settings) and only those gene networks with a cancer event are tested.

Value

A data.frame with the following columns

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
data(clinPanTNBC)
data(genelistPan)
data(cnaPanTNBC)
data(gcPanTNBC)
data(gcinfoPan)
data(variantPanTNBC)
data(dgidbPan)
data(dgiSets)
data(reactome)

patient <- "TCGA-B6-A0IK"
match.index <- which(clinPanTNBC$Vital.Status=="Alive" & clinPanTNBC$PatientID!=patient & clinPanTNBC$age_at_diagnosis>50 & clinPanTNBC$N.stage == "N1" & clinPanTNBC$ndays > 1000)
ptmatch <- clinPanTNBC[match.index,"PatientID"]
drivGenes <- panGeneSets(caseid=patient, controlid=ptmatch, eventOnly=TRUE, variant=variantPanTNBC,
   cna=cnaPanTNBC, gcount=gcPanTNBC, tumorpct=0.3, tailEnd="upper", tailPct=0.1)
drivGenes[1:10,1:6]

sinnweja/panoply documentation built on Aug. 9, 2019, 9:56 a.m.