Description Usage Arguments Value Note Author(s) See Also
View source: R/plot_gsea.leadingedge.HCL.R
This function compares genesets in terms of the overlap of the leading edge
genes
within each geneset. This information is presented as a hierarchical
clustering plot,
where distance is measured proportional to the number of genes in common
(see jacquard
).
The user can either pass in a leading edge list (ie a list where each
element is
a vector of gene symbols), or (as will be more common),
an entire leading edge list (eg 1434 elements for c2_all), in conjunction
with a GSEA
top table (tt), coupled with a threshold, such as the top 50, or those with
FDR < 0.25
via the N, FDR, P, or FWER arguments.
1 2 3 4 5 6 | plot_gsea.leadingedge.HCL(x, N = NULL, NES = NULL, FDR = NULL, P = NULL,
FWER = NULL, direction = c("either", "up", "down"),
main = "Leading edge similarities", xlab = "", yaxis = c("similarity",
"distance"), max.distance = 1, rename.genesets = FALSE,
label.clusters = FALSE, cluster.threshold = 0.99,
hclust.method = "complete", ...)
|
x |
a GSEA object. see |
N |
if not |
NES |
if not |
FDR |
if not |
P |
if not |
FWER |
if not |
direction |
include just the genesets that are “up”, “down”, or “either” means top 50 genesets that are either up or down |
main |
additional arguments passed to plot. |
xlab |
additional arguments passed to plot. |
yaxis |
either “similarity” or “distance”. Two closely related genesets have eg 90% similarity, and a distance of 0.1. |
max.distance |
(experimental) attempt to filter out genesets that are dissimilar to all other genesets. This is a distance threshold, not a similarity threshold, so values of 0.9 are a good place to start. See note. |
rename.genesets |
should the genesets be renamed to include the rank/fdr/direction |
label.clusters |
if TRUE, then draw red boxes around the clusters. see cluster.threshold. |
cluster.threshold |
the distance threshold if label.clusters=TRUE. 0.99 is a good default in practice. |
hclust.method |
the hierarchical clustering method. see |
... |
additional arguments passed to plot. |
invisibly returns the hclust object.
max.distance
The intuitive outcome of setting max.distance
(eg 0.1)
is that no genesets with a distance of >0.1 (ie a similarity of <90%)
should remain in the HCL. This is not quite the case. We filter out genesets
that have distance to all other genesets > 0.9 in the 'distance' object
(see gsea.leadingedge.distance
) which is slightly different to filtering
out nodes in the resulting HCL (which is made upon the distance object).
Net effect:
- You can get singletons in the filtered HCL because all of
the things that the geneset was connected to have been filtered out.
- You can get genesets in a tree of significant genesets, which is itself
below the distance threshold due to a similar explanation to above.
Mark Cowley, 2009-04-06
plot_gsea.leadingedge.HCL
plclust.gsea
gsea.leadingedge.distance
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.