decon | R Documentation |
Estimate surrogate variables in a data matrix from cell type- or tissue-
specific gene sets. The surrogate variables are determined by the
deconComponents
method.
decon(object, model = NULL, geneSets, doPerm = TRUE, nPerm = 249, pvalueCutoff = 0.01, nComp = 1, trim = FALSE, seed = NULL, ...)
object |
An object of class |
model |
A formula or an n x k design matrix specifying the model of
interest see |
geneSets |
An object of class |
doPerm |
A boolean value of length 1, specifying if permutation testing for significance of gene sets should be performed. See below for details on the permutation testing and its interpretation |
nPerm |
A numeric value of length 1, specifying how many permutations should be performed |
pvalueCutoff |
A numeric value of length 1, generally between 1/nPerm and 1 specifying at what significance level should gene sets be considered informative in the dataset. |
nComp |
A numeric value of length 1, specifying how many components to test for in each gene set. For well-formed gene sets, this should be set to 1, indicating that the gene set should have one major set of correlated genes. If more components that nComp are found to be significant, a warning is thrown, as the gene set is likely not well specified. |
trim |
Logical. Should gene sets be trimmed before summarization. If TRUE, then only genes with an average pairwise correlation coefficient > 0.1 are included in the decon algorithm. Otherwise, the whole gene set is used. |
seed |
A seed to set for random number generator used in the permutation. Setting this will allow for reproducible p-values to be generated for the gene sets. |
... |
Currently not used, but may be used in the future. |
decon
attempts to identify gene sets that are significantly
informative in the residuals matrix for a given linear model. This is done
through a gene-wise permutation strategy. For each permutation, the first
nComp
eigenvalues are compared to the eigenvalues of the original
gene set expression matrix. An empirical p-value is calculated by finding
how many random eigenvalues are greater than the observed eigenvalues.
For gene sets that are considered significant (permutation p-value less
than the supplied alpha), an eigengene for the gene set is calculated. For
each significant eigenvalue, an eigengene is calculated by the method
described for deconComponents
. The first eigengene typically
represents the relative amount of that cell or tissue type in the mixed
sample.
Eigengenes beyond the first can be somewhat difficult to interpret, and by default are not looked for. However, looking for significance of the second (or third) eigenvalue can be informative about the relative consistency within a gene set. Ideally, the majority of the variance of the gene set would be explained by the first eigenvector. If there is a large amount of variance explained by the second eigenvector, this suggests that your gene set is identifying two separate expression patterns in the data set of interest.
An object of class DeconResults
with the following slots:
pvalueCutoff |
A single numeric value giving the significance cutoff at which gene sets are considered informative |
pvalues |
A numeric vector with an entry for each gene set that has a
p-value less than |
eigengenes |
A numeric matrix with one column for each significant gene set and one row for each sample in the expression data provided |
nComp |
A numeric vector of length 1, giving the number of significant components |
J.A. Hackney
svd
deconComponents
## Not run: library(GEOquery) library(hgu133plus2.db) deconGSC <- DeconGeneSetCollection() GSE11058 <- getGEO("GSE11058")[[1]] exprs(GSE11058) <- log2(exprs(GSE11058)) annotation(GSE11058) <- "hgu133plus2" deconU133GSC <- mapIdentifiers(deconGSC, AnnotationIdentifier(), revmap(hgu133plus2ENTREZID)) deconResults <- decon(GSE11058, ~1, deconU133GSC) ## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.