Description Usage Arguments Details Value Author(s) See Also Examples
View source: R/genomedistances.R
Computes the Jaccard distances between all pairs of genomes.
1 | distJaccard(pan.matrix)
|
pan.matrix |
A pan-matrix, see |
The Jaccard index between two sets is defined as the size of the intersection of the sets divided by the size of the union. The Jaccard distance is simply 1 minus the Jaccard index.
The Jaccard distance between two genomes describes their degree of overlap with respect to gene
cluster content. If the Jaccard distance is 0.0, the two genomes contain identical gene clusters.
If it is 1.0 the two genomes are non-overlapping. The difference between a genomic fluidity (see
fluidity
) and a Jaccard distance is small, they both measure overlap between genomes,
but fluidity is computed for the population by averaging over many pairs, while Jaccard distances are
computed for every pair. Note that only presence/absence of gene clusters are considered, not multiple
occurrences.
The input pan.matrix is typically constructed by panMatrix
.
A dist
object (see dist
) containing all pairwise Jaccard distances
between genomes.
Lars Snipen and Kristian Hovde Liland.
1 2 3 4 5 6 7 8 | # Loading a pan-matrix in this package
data(xmpl.panmat)
# Jaccard distances
Jdist <- distJaccard(xmpl.panmat)
# Making a dendrogram based on the distances,
# see example for distManhattan
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.