gss: Gene set statistics

GeneSetStatisticsR Documentation

Gene set statistics

Description

Functions to calculate a gene set statistic, as used in the gss parameter of gsAnalysis. A gene set statistic summarizes a single gene set.

Usage

gss.mean(x, geneSetIndices)

gss.sum(x, geneSetIndices)

gss.wilcoxonRankTest(x, geneSetIndices)

gss.maxmean(x, geneSetIndices)

gss.median(x, geneSetIndices)

gss.enrichmentScore(x, geneSetIndices, p = 1)

gss.fisherExactTest(x, geneSetIndices)

gss.gsz(x, geneSetIndices, w1 = 0.2, w2 = 0.5, preVar = 0, varConstant = 10)

Arguments

x

A vector comprising one numeric value for each gene in the data set. This vector is usually obtained from the previous step, the gene-level statistic (see gls) or the transformed gene-level statistic (see transformation).

geneSetIndices

A vector containing the indices of the genes in the gene set with respect to the full gene set (i.e., the indices of the rows containing the measurements for these genes in dat).

p

Factor for gss.enrichmentScore that specifies the way hits are weighted. For p = 0, the enrichment score is a Kolmogorov-Smirnov statistic. For p = 1 (the default), hits are weighted by their correlation.

w1

Weight for the median of the variance estimates for a gene set of size varConstant. Should be between 0 and 1. Default is w1 = 0.2.

w2

Weight for the median of the variance estimates for a gene set across the whole gene list. Should be between 0 and 1. Default is w2 = 0.5.

preVar

Parameter for incorporating the uncertainty of the observations. This is omitted by default (preVar = 0).

varConstant

Reference gene set size used for variance estimates. Default is varConstant = 10.

Details

Standard functions for the calculation of gene set statistics (to be used in an analysis pipeline defined by gsAnalysis):

  • gss.mean: Calculates the mean of the (transformed) gene-level statistic values for the genes in the set.

  • gss.sum: Calculates the sum of the (transformed) gene-level statistic values for the genes in the set.

  • gss.wilcoxonRankTest: Calculates a Wilcoxon test comparing the (transformed) gene-level statistic values for the genes in the set versus those of the genes not in the set.

  • gss.maxmean: Calculates the maximum of the means of positive and negative statistic values, weighted by the overall proportion of positive/negative values (e.g. for correlation scores where the sign denotes the direction). Described in Efron and Tibshirani.

  • gss.median: Calculates the median of the (transformed) gene-level statistic values for the genes in the set.

  • gss.enrichmentScore: Calculates the enrichment score of the (transformed) gene-level statistic values for the genes in the set, as described in Subramanian et al.

  • gss.fisherExactTest: Performs Fisher's exact test to check gene sets for overrepresentation in the differential genes. This should be used used in combination with the transformation transformation.adjustAndBinarize.

  • gss.gsz: Calculates the Gene Set Z-score of the (transformed) gene-level statistic values for the genes in the set. Described in Toronen et al.

Value

Each method returns a single numeric value, the gene set statistic for the supplied gene set.

References

Efron, B., Tibshirani, R. (2007) On testing the significance of sets of genes. Annals of Applied Statistics, 1, 107-129.

Subramanian, A., Tamayo, P., Mootha, V. K., Mukherjee, S., Ebert, B. L., Gillette, M. A., Paulovich, A., Pomeroy, S. L., Golub, T. R., Lander, E. S., Mesirov, J. P. (2005) Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proceedings of the National Academy of Science of the United States of America, 102, 15545–15550.

Toronen, P., Ojala, P. J., Marttinen, P., Holm L. (2009) Robust extraction of functional signals from gene set analysis using a generalized threshold free scoring function. BMC Bioinformatics, 10(1), 307.

See Also

geneSetAnalysis, gsAnalysis, gls, transformation


GiANT documentation built on Sept. 11, 2024, 9:16 p.m.