Description Usage Arguments Value Examples
View source: R/HTSanalyzeR2Pipe.R
This function performs a complete analyses of common phenotype data.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29  | HTSanalyzeR2Pipe(
  data4enrich,
  hits = character(),
  doGSOA = FALSE,
  doGSEA = TRUE,
  listOfGeneSetCollections,
  species = "Hs",
  initialIDs = "SYMBOL",
  keepMultipleMappings = TRUE,
  duplicateRemoverMethod = "max",
  orderAbsValue = FALSE,
  pValueCutoff = 0.05,
  pAdjustMethod = "BH",
  nPermutations = 1000,
  cores = 1,
  minGeneSetSize = 15,
  exponent = 1,
  verbose = TRUE,
  GSEA.by = "HTSanalyzeR2",
  keggGSCs = NULL,
  goGSCs = NULL,
  msigdbGSCs = NULL,
  doNWA = FALSE,
  nwaPvalues = NULL,
  interactionMatrix = NULL,
  reportDir = "HTSanalyzerReport",
  nwAnalysisGenetic = FALSE,
  nwAnalysisFdr = 0.001
)
 | 
data4enrich | 
 A numeric or integer vector of phenotypes named by gene identifiers.  | 
hits | 
 A character vector of the gene identifiers (used as hits in the hypergeometric tests). It's needed if you want to do GSOA (gene set overrepresentation analysis).  | 
doGSOA | 
 A logic value specifying whether to do hypergeometric test or not, default is FALSE.  | 
doGSEA | 
 A logic value specifying whether to do gene set enrichment analysis or not, default is TRUE.  | 
listOfGeneSetCollections | 
 A list of gene set collections (a 'gene set collection' is a list of gene sets).  | 
species | 
 A single character value specifying the species for which the data should be read.  | 
initialIDs | 
 A single character value specifying the type of initial identifiers for input geneList  | 
keepMultipleMappings | 
 A single logical value. If TRUE, the function keeps the entries with multiple mappings (first mapping is kept). If FALSE, the entries with multiple mappings will be discarded.  | 
duplicateRemoverMethod | 
 A single character value specifying the method to remove the duplicates. See duplicateRemover for details.  | 
orderAbsValue | 
 A single logical value indicating whether the values should be converted to absolute values and then ordered (if TRUE), or ordered as they are (if FALSE).  | 
pValueCutoff | 
 A single numeric value specifying the cutoff for p-values considered significant in gene set collection analysis.  | 
pAdjustMethod | 
 A single character value specifying the p-value adjustment method to be used (see 'p.adjust' for details) in gene set collection analysis.  | 
nPermutations | 
 A single integer or numeric value specifying the number of permutations for deriving p-values in GSEA.  | 
cores | 
 A single integer or numeric value specifying the number of cores to be used for GSEA.  | 
minGeneSetSize | 
 A single integer or numeric value specifying the minimum number of elements shared by a gene set and the input total genes. Gene sets with fewer than this number are removed from both hypergeometric analysis and GSEA.  | 
exponent | 
 A single integer or numeric value used in weighting phenotypes in GSEA.  | 
verbose | 
 A single logical value specifying to display detailed messages (when verbose=TRUE) or not (when verbose=FALSE)  | 
GSEA.by | 
 A single character value to choose which algorithm to do GSEA. Valid value
could either be "HTSanalyzeR2"(default) or "fgsea". If performed by "fgsea", the result explanation
please refer to   | 
keggGSCs | 
 A character vector of names of all KEGG gene set collections.  | 
goGSCs | 
 A character vector of names of all GO gene set collections.  | 
msigdbGSCs | 
 A character vector of names of all MSigDB gene set collections.  | 
doNWA | 
 A logic value specifying whether to do subnetwork analysis or not, default is FALSE.  | 
nwaPvalues | 
 A single numeric value specifying the false discovery for the scoring of nodes in NWA analysis (see BioNet::scoreNodes and Dittrich et al., 2008 for details)  | 
interactionMatrix | 
 An interaction matrix including columns 'InteractionType', 'InteractorA' and 'InteractorB'. If this matrix is available, the interactome can be directly built based on it.  | 
reportDir | 
 A single character value specifying the directory to store reports. For default the enrichment analysis reports will be stored in the directory called "HTSanalyzerReport".  | 
nwAnalysisGenetic | 
 A single logical value. If TRUE, genetic interactions will be kept; otherwise, they will be removed from the data set.  | 
nwAnalysisFdr | 
 A single numeric value specifying the false discovery for the scoring of nodes (see BioNet::scoreNodes and Dittrich et al., 2008 for details)  | 
This pipeline function will finally return a list of GSCA object and NWA object.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37  | ## Not run: 
library(GO.db)
library(org.Hs.eg.db)
library(KEGGREST)
data(d7)
## define data4enrich
data4enrich <- as.vector(d7$neg.lfc)
names(data4enrich) <- d7$id
## select hits if you also want to do GSOA, otherwise ignore it
hits <-  names(data4enrich[which(abs(data4enrich) > 2)])
## set up a list of gene set collections
GO_MF <- GOGeneSets(species="Hs", ontologies=c("MF"))
PW_KEGG <- KeggGeneSets(species="Hs")
ListGSC <- list(GO_MF=GO_MF, PW_KEGG=PW_KEGG)
## start analysis
rslt <- HTSanalyzeR2Pipe(data4enrich = data4enrich,
                         hits = hits,
                         doGSOA = TRUE,
                         doGSEA = TRUE,
                         listOfGeneSetCollections = ListGSC,
                         species = "Hs",
                         initialIDs = "SYMBOL",
                         pValueCutoff = 0.05,
                         nPermutations = 1000,
                         cores = 2,
                         minGeneSetSize = 100,
                         keggGSCs=c("PW_KEGG"),
                         goGSCs = c("GO_MF"),
                         doNWA = FALSE)
report(rslt$gsca)
## End(Not run)
 | 
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.