gsvaParam-class | R Documentation |
gsvaParam
classMethod-specific parameters for the GSVA method.
Objects of class gsvaParam
contain the parameters for running
the GSVA
method.
gsvaParam(
exprData,
geneSets,
assay = NA_character_,
annotation = NA_character_,
minSize = 1,
maxSize = Inf,
kcdf = c("Gaussian", "Poisson", "none"),
tau = 1,
maxDiff = TRUE,
absRanking = FALSE
)
exprData |
The expression data. Must be one of the classes
supported by |
geneSets |
The gene sets. Must be one of the classes supported by
|
assay |
The name of the assay to use in case |
annotation |
The name of a Bioconductor annotation package for the gene
identifiers occurring in the row names of the expression data matrix. This
can be used to map gene identifiers occurring in the gene sets if those are
provided in a |
minSize |
Minimum size of the resulting gene sets after gene identifier mapping. By default, the minimum size is 1. |
maxSize |
Maximum size of the resulting gene sets after gene identifier
mapping. By default, the maximum size is |
kcdf |
Character vector of length 1 denoting the kernel to use during
the non-parametric estimation of the cumulative distribution function of
expression levels across samples. By default, |
tau |
Numeric vector of length 1. The exponent defining the weight of
the tail in the random walk performed by the |
maxDiff |
Logical vector of length 1 which offers two approaches to calculate the enrichment statistic (ES) from the KS random walk statistic.
|
absRanking |
Logical vector of length 1 used only when |
In addition to the two common parameter slots inherited from
[GsvaMethodParam]
, this class has slots for the two method-specific
parameters of the GSVA method described below.
In addition to an expression data set and a collection of
gene sets, GSVA
takes four method-specific parameters as described below.
A new gsvaParam
object.
kcdf
Character vector of length 1 denoting the kernel to use during
the non-parametric estimation of the cumulative distribution function of
expression levels across samples. kcdf="Gaussian"
is suitable when input
expression values are continuous, such as microarray fluorescent units in
logarithmic scale, RNA-seq log-CPMs, log-RPKMs or log-TPMs. When input
expression values are integer counts, such as those derived from RNA-seq
experiments, then this argument should be set to kcdf="Poisson"
.
tau
Numeric vector of length 1. The exponent defining the weight of the tail in the random walk performed by the GSVA (Hänzelmann et al., 2013) method.
maxDiff
Logical vector of length 1 which offers two approaches to calculate the enrichment statistic (ES) from the KS random walk statistic.
FALSE
: ES is calculated as the maximum distance of the random walk from 0.
TRUE
: ES is calculated as the magnitude difference between
the largest positive and negative random walk deviations.
absRanking
Logical vector of length 1 used only when mx.diff=TRUE
.
When abs.ranking=FALSE
a modified Kuiper statistic is used to calculate
enrichment scores, taking the magnitude difference between the largest
positive and negative random walk deviations. When abs.ranking=TRUE
the
original Kuiper statistic that sums the largest positive and negative
random walk deviations, is used. In this latter case, gene sets with genes
enriched on either extreme (high or low) will be regarded as ’highly’
activated.
Hänzelmann, S., Castelo, R. and Guinney, J. GSVA: Gene set variation analysis for microarray and RNA-Seq data. BMC Bioinformatics, 14:7, 2013. DOI
GsvaExprData
,
GsvaGeneSets
,
GsvaMethodParam
,
plageParam
,
zscoreParam
,
ssgseaParam
library(GSVA)
library(GSVAdata)
data(leukemia)
data(c2BroadSets)
## for simplicity, use only a subset of the sample data
ses <- leukemia_eset[1:1000, ]
gsc <- c2BroadSets[1:100]
gp1 <- gsvaParam(ses, gsc)
gp1
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.