clipper: clipper
In ToPASeq: Topology-based pathway analysis of RNA-seq data

Description Usage Arguments Value Author(s) References Examples

clipper is a method for topological gene set analysis. It implements a two-step empirical approach based on the exploitation of graph decomposition into a junction tree to reconstruct the most relevant signal path. In the first step clipper selects significant pathways according to statistical tests on the means and the concentration matrices of the graphs derived from pathway topologies. Then, it "clips" the whole pathway identifying the signal paths having the greatest association with a specific phenotype.

clipper(x, group, pathways, type, which = "proteins", edgeType = NULL,
  preparePaths = TRUE, norm.method = NULL, test.method = NULL,
  method = "mean", testCliques = FALSE, nperm = 1000,
  alphaV = 0.05, both.directions = TRUE, maxNodes = 150,
  minEdges = 0, commonTh = 2, filterSPIA = FALSE,
  convertTo = "none", convertBy = NULL)

`x`	An `ExpressionSet` object or a gene expression data matrix or count matrix, rows refer to genes, columns to samples
`group`	Name or number of the phenoData column or a character vector or factor that contains required class assigments
`pathways`	A list of pathways in a form from `graphite` package or created by `preparePathways()`
`type`	Type of the input data, `"MA"` for microarray and `"RNASeq"` for RNA-Seq
`which`	Character, which type of nodes is preserved in a pathway. Possible values are `"proteins"`,`"metabolites"`,`"mixed"`
`edgeType`	Character, which type of edges is preserved in a pathway. If `NULL`, all edges are kept.
`preparePaths`	Logical, by default the pathways are transformed with `preparePathways()`. Use `FALSE`, if you have done this transformation separately
`norm.method`	Character, the method to normalize RNAseq data. If `NULL` then vst-normalization is performed. Possible values are: `"edgeR", "vst", "rLog", "none"`
`test.method`	Character, the method for differentiall expression analysis of RNAseq data. If `NULL` then `"voomlimma"` is used. Possible values are: `"DESeq2", "voomlimma", "vstlimma", "edgeR"`. This analysis is needed only for the visualization.
`method`	Character, `"mean"` or `"var"`, the kind of test to perform on the cliques
`testCliques`	Logical, if `TRUE` then the test is applied also on the cliques of the each pathway. It is a very time consuming calculation, especially for many or big pathways
`nperm`	Number of permutations, if `0` then asymptotic distribution is used. May not be valid when shrinked estimator is used.
`alphaV`	Numeric, the threshold for variance test. The calculation of mean test depends on the result of variance test.
`both.directions, maxNodes, minEdges, commonTh, filterSPIA, convertTo, convertBy`	Arguments for the `preparePathways()`

A list:

`res`	A list. First slot is a data frame containing p-values and q-values of mean and variance tests on pathways. The second slot is a list containing data.frames of the most affected paths in each pathway. The columns of the data frames contain: 1 - Index of the starting clique 2 - Index of the ending clique 3 - Index of the clique where the maximum value is reached 4 - length of the path 5 - maximum score of the path 6 - average score along the path 7 - percentage of path activation 8 - impact of the path on the entire pathway 9 - clique involved and significant 10 - clique forming the path 11 - genes forming the significant cliques 12 - genes forming the path
`topo.sig`	if `testCliques=TRUE`, a list where each slot contains the pvalues and a list of cliques in one pathway. `NULL` otherwise
`degtest`	A data.frame of gene-level differential expression statistics

Ivana Ihnatova

Martini P, Sales G, Massa MS, Chiogna M, Romualdi C. Along signal paths: an empirical gene set approach exploiting pathway topology. Nucleic Acids Res. 2013 Jan 7;41(1):e19. doi: 10.1093/nar/gks866. Epub 2012 Sep 21. PubMed PMID: 23002139; PubMed Central PMCID: PMC3592432.

## Not run: 
if (require(breastCancerVDX)) {
data("vdx")
pathways<-pathways("hsapiens","biocarta")[1:3]
MAdata<-Biobase::exprs(vdx)[,1:10]
rownames(MAdata)<-Biobase::fData(vdx)[,"Gene.symbol"]
MAdata<-MAdata[!duplicated(rownames(MAdata)),]

clipper(MAdata, Biobase::pData(vdx)[,"er"][1:10], pathways, type="MA", convertTo="SYMBOL", nperm=10)
}

## End(Not run)