WebGestaltR | R Documentation |
Main function for enrichment analysis
WebGestaltR(
enrichMethod = "ORA",
organism = "hsapiens",
enrichDatabase = NULL,
enrichDatabaseFile = NULL,
enrichDatabaseType = NULL,
enrichDatabaseDescriptionFile = NULL,
interestGeneFile = NULL,
interestGene = NULL,
interestGeneType = NULL,
collapseMethod = "mean",
referenceGeneFile = NULL,
referenceGene = NULL,
referenceGeneType = NULL,
referenceSet = NULL,
minNum = 10,
maxNum = 500,
sigMethod = "fdr",
fdrMethod = "BH",
fdrThr = 0.05,
topThr = 10,
reportNum = 20,
perNum = 1000,
gseaP = 1,
isOutput = TRUE,
outputDirectory = getwd(),
projectName = NULL,
dagColor = "continuous",
saveRawGseaResult = FALSE,
gseaPlotFormat = c("png", "svg"),
setCoverNum = 10,
networkConstructionMethod = NULL,
neighborNum = 10,
highlightType = "Seeds",
highlightSeedNum = 10,
nThreads = 1,
cache = NULL,
hostName = "https://www.webgestalt.org/",
...
)
WebGestaltRBatch(
interestGeneFolder = NULL,
enrichMethod = "ORA",
isParallel = FALSE,
nThreads = 3,
...
)
enrichMethod |
Enrichment methods: |
organism |
Currently, WebGestaltR supports 12 organisms. Users can use the function
|
enrichDatabase |
The functional categories for the enrichment analysis. Users can use
the function |
enrichDatabaseFile |
Users can provide one or more GMT files as the functional
category for enrichment analysis. The extension of the file should be |
enrichDatabaseType |
The ID type of the genes in the |
enrichDatabaseDescriptionFile |
Users can also provide description files for the custom
|
interestGeneFile |
If |
interestGene |
Users can also use an R object as the input. If |
interestGeneType |
The ID type of the interesting gene list. The supported ID types of
WebGestaltR for the selected organism can be found by the function |
collapseMethod |
The method to collapse duplicate IDs with scores. |
referenceGeneFile |
For the ORA method, the users need to upload the reference gene
list. The extension of the |
referenceGene |
For the ORA method, users can also use an R object as the reference
gene list. |
referenceGeneType |
The ID type of the reference gene list. The supported ID types
of WebGestaltR for the selected organism can be found by the function |
referenceSet |
Users can directly select the reference set from existing platforms in
WebGestaltR and do not need to provide the reference set through |
minNum |
WebGestaltR will exclude the categories with the number of annotated genes
less than |
maxNum |
WebGestaltR will exclude the categories with the number of annotated genes
larger than |
sigMethod |
Two methods of significance are available in WebGestaltR: |
fdrMethod |
For the ORA method, WebGestaltR supports five FDR methods: |
fdrThr |
The significant threshold for the |
topThr |
The threshold for the |
reportNum |
The number of enriched categories visualized in the final report. The default
is |
perNum |
The number of permutations for the GSEA method. The default is |
gseaP |
The exponential scaling factor of the phenotype score. The default is |
isOutput |
If |
outputDirectory |
The output directory for the results. |
projectName |
The name of the project. If |
dagColor |
If |
saveRawGseaResult |
Whether the raw result from GSEA is saved as a RDS file, which can be
used for plotting. Defaults to
|
gseaPlotFormat |
The graphic format of GSEA enrichment plots. Either |
setCoverNum |
The number of expected gene sets after set cover to reduce redundancy.
It could get fewer sets if the coverage reaches 100%. The default is |
networkConstructionMethod |
Netowrk construction method for NTA. Either
|
neighborNum |
The number of neighbors to include in NTA Network Expansion method. |
highlightType |
The type of nodes to highlight in the NTA Network Expansion method,
either |
highlightSeedNum |
The number of top input seeds to highlight in NTA Network Retrieval & Prioritizaiton method. |
nThreads |
The number of cores to use for GSEA and set cover, and in batch function. |
cache |
A directory to save data cache for reuse. Defaults to |
hostName |
The server URL for accessing data. Mostly for development purposes. |
... |
In batch function, passes parameters to WebGestaltR function. Also handles backward compatibility for some parameters in old versions. |
interestGeneFolder |
Run WebGestaltR for gene list files in the folder. |
isParallel |
If jobs are run parallelly in the batch. |
WebGestaltR function can perform three enrichment analyses: ORA (Over-Representation Analysis) and GSEA (Gene Set Enrichment Analysis).and NTA (Network Topology Analysis). Based on the user-uploaded gene list or gene list with scores, WebGestaltR function will first map the gene list to the entrez gene ids and then summarize the gene list based on the GO (Gene Ontology) Slim. After performing the enrichment analysis, WebGestaltR function also returns a user-friendly HTML report containing GO Slim summary and the enrichment analysis result. If functional categories have DAG (directed acyclic graph) structure or genes in the functional categories have network structure, those relationship can also be visualized in the report.
The WebGestaltR function returns a data frame containing the enrichment analysis
result and also outputs an user-friendly HTML report if isOutput
is TRUE
.
The columns in the data frame depend on the enrichMethod
and they are the following:
ID of the gene set.
Description of the gene set if available.
Link to the data source.
The number of genes in the set after filtering by minNum
and maxNum
.
The number of mapped input genes that are annotated in the gene set.
Expected number of input genes that are annotated in the gene set.
Enrichment ratio, overlap / expect.
Enrichment score, the maximum running sum of scores for the ranked list.
Normalized enrichment score, normalized against the average enrichment score of all permutations.
Number of genes/phosphosites in the leading edge.
P-value from hypergeometric test for ORA. For GSEA, please refer to its original publication or online at https://software.broadinstitute.org/gsea/doc/GSEAUserGuideTEXT.htm.
Corrected P-value for mulilple testing with fdrMethod
for ORA.
The gene/phosphosite IDs of overlap
for ORA (entrez gene IDs or
phosphosite sequence).
Genes/phosphosites in the leading edge in entrez gene ID or phosphosite sequence.
The gene/phosphosite IDs of overlap
for ORA or leadingEdgeId
for GSEA in User input IDs.
Path of the GSEA enrichment plot.
Name of the source database if multiple enrichment databases are given.
In NTA, like geneSet
, the enriched GO terms of genes in the
returned subnetwork.
In NTA, the gene IDs in the subnetwork with 0/1 annotations indicating if it is from user input.
The WebGestaltRBatch function returns a list of enrichment results.
## Not run:
####### ORA example #########
geneFile <- system.file("extdata", "interestingGenes.txt", package="WebGestaltR")
refFile <- system.file("extdata", "referenceGenes.txt", package="WebGestaltR")
outputDirectory <- getwd()
enrichResult <- WebGestaltR(enrichMethod="ORA", organism="hsapiens",
enrichDatabase="pathway_KEGG", interestGeneFile=geneFile,
interestGeneType="genesymbol", referenceGeneFile=refFile,
referenceGeneType="genesymbol", isOutput=TRUE,
outputDirectory=outputDirectory, projectName=NULL)
####### GSEA example #########
rankFile <- system.file("extdata", "GeneRankList.rnk", package="WebGestaltR")
outputDirectory <- getwd()
enrichResult <- WebGestaltR(enrichMethod="GSEA", organism="hsapiens",
enrichDatabase="pathway_KEGG", interestGeneFile=rankFile,
interestGeneType="genesymbol", sigMethod="top", topThr=10, minNum=5,
outputDirectory=outputDirectory)
####### NTA example #########
enrichResult <- WebGestaltR(enrichMethod="NTA", organism="hsapiens",
enrichDatabase="network_PPI_BIOGRID", interestGeneFile=geneFile,
interestGeneType="genesymbol", sigMethod="top", topThr=10,
outputDirectory=getwd(), highlightSeedNum=10,
networkConstructionMethod="Network_Retrieval_Prioritization")
## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.