binaryDistance: Binary Distance Definitions Across Methods
In Mercator: Clustering and Visualizing Distance Matrices

binaryDistance

R Documentation

Binary Distance Definitions Across Methods

Description

The binaryDistance function defines various similarity or distance measures between binary vectors, which represent the first step in the algorithm underlying the Mercator visualizations.

Usage

binaryDistance(X, metric)

Arguments

`X`	An object of class `matrix`.
`metric`	An object of class `character` limited to the names of 10 selected distance metrics: `jaccard`, `sokalMichener`, `hamming`, `russellRao`, `pearson`, `goodmanKruskal`, `manhattan`, `canberra`, `binary`, or `euclid`.

Details

Similarity or difference between binary vectors can be calculated using a variety of distance measures. In the main reference (below), Choi and colleagues reviewed 76 different measures of similarity of distance between binary vectors. They also produced a hierarchical clustering of these measures, based on the correlation between their distance values on multiple simulated data sets. For metrics that are highly similar, we chose a single representative.

Cluster 1, represented by the jaccard distance, contains Dice & Sorenson, Ochiai, Kulcyznski, Bray & Curtis, Baroni-Urbani & Buser, and Jaccard.

Cluster 2, represented by the sokalMichener distance, contains Sokal & Sneath, Gilbert & Wells, Gower & Legendre, Pearson & Heron, Hamming, and Sokal & Michener. Also within this cluster are 4 distances represented independently within this function: hamming, manhattan, canberra, and euclidean distances

Cluster 3, represented by the russellRao distance, contains Driver & Kroeber, Forbes, Fossum, and Russell & Rao.

The remaining metrics are more isolated, without strong clustering. We considered a few examples, including the Pearson distance (pearson) and the Goodman & Kruskal distance (goodmanKruskal). The binary distance is also included.

Value

Returns an object of class dist corresponding to the distance metric provided.

Note

Although the distance metrics provided in the binaryDistance function are explicitly offered for use on matrices of binary vectors, some metrics may return useful distances when applied to non-binary matrices.

Author(s)

Kevin R. Coombes <krc@silicovore.com>, Caitlin E. Coombes

References

Choi SS, Cha SH, Tappert CC, A Survey of Binary Similarity and Distance Measures. Systemics, Cybernetics, and Informatics. 2010; 8(1):43-48.

Examples

my.matrix <- matrix(rbinom(50*100, 1, 0.15), ncol=50)
my.dist <- binaryDistance(my.matrix, "jaccard")

Mercator documentation built on May 16, 2025, 3:01 a.m.

Mercator index

Mercator for Continuous Data" UMAP and SOM from Distance Matrices" Using the Mercator Package"

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Mercator
Clustering and Visualizing Distance Matrices

binaryDistance: Binary Distance Definitions Across Methods
In Mercator: Clustering and Visualizing Distance Matrices

Binary Distance Definitions Across Methods

Description

Usage

Arguments

Details

Value

Note

Author(s)

References

See Also

Examples

Related to binaryDistance in Mercator...

R Package Documentation

Browse R Packages

We want your feedback!

Mercator Clustering and Visualizing Distance Matrices

binaryDistance: Binary Distance Definitions Across Methods In Mercator: Clustering and Visualizing Distance Matrices

Binary Distance Definitions Across Methods

Description

Usage

Arguments

Details

Value

Note

Author(s)

References

See Also

Examples

Related to binaryDistance in Mercator...

R Package Documentation

Browse R Packages

We want your feedback!

Mercator
Clustering and Visualizing Distance Matrices

binaryDistance: Binary Distance Definitions Across Methods
In Mercator: Clustering and Visualizing Distance Matrices