Gene Set Collection Indexes from the MSigDB Database

Description

It prepares the MSigDB gene set collections to be used for the EGSEA analysis.

Usage

1
2
buildMSigDBIdx(entrezIDs, geneSets = "all", species = "Homo sapiens",
  min.size = 1)

Arguments

entrezIDs

character, a vector that stores the Entrez Gene IDs tagged in your dataset. The order of the Entrez Gene IDs should match those of the count/expression matrix row names.

geneSets

character, a vector determines which gene set collections should be used from the MSigDB. It can take values from this list: "all", "h", "c1", "c2", "c3", "c4", "c5", "c6","c7". "c1" is human specific. If "all", all available gene set collections are loaded.

species

character, determine the organism of selected gene sets: "human", "mouse" or "rat".

min.size

integer, the minium number of genes required in a testing gene set

Details

It indexes the MSigDB gene sets and loads gene set annotation.

Value

indexed gene set annotation that can be used with other functions in the package. Each annotation is a list of seven elements: original stores the original gene sets, idx stores the indexed gene sets, anno that stores detailed annotation for each gene set, label a unique id that identifies the collection of gene sets, featureIDs stores the entrezIDs used in building the annotation, species stores that organism name of gene sets and name stores the collection name to be used in the EGSEA report.

Examples

1
2
3
4
5
6
library(EGSEAdata)
data(il13.data)
v = il13.data$voom
gs.annots = buildMSigDBIdx(entrezIDs=rownames(v$E), geneSets=c("h", 
"c2"), species="human")
names(gs.annots)