hdist | R Documentation |
hdist
returns the Hamming distance between the labeled graphs g1
and g2
in set dat
for dichotomous data, or else the absolute (manhattan) distance. If normalize
is true, this distance is divided by its dichotomous theoretical maximum (conditional on |V(G)|).
hdist(dat, dat2=NULL, g1=NULL, g2=NULL, normalize=FALSE,
diag=FALSE, mode="digraph")
dat |
a stack of input graphs. |
dat2 |
a second graph stack (optional). |
g1 |
a vector indicating which graphs to compare (by default, all elements of |
g2 |
a vector indicating against which the graphs of |
normalize |
divide by the number of available dyads? |
diag |
boolean indicating whether or not the diagonal should be treated as valid data. Set this true if and only if the data can contain loops. |
mode |
string indicating the type of graph being evaluated. "digraph" indicates that edges should be interpreted as directed; "graph" indicates that edges are undirected. |
The Hamming distance between two labeled graphs G_1
and G_2
is equal to |\{e : (e \in E(G_1), e \not\in E(G_2)) \wedge (e \not\in E(G_1), e \in E(G_2))\}|
. In more prosaic terms, this may be thought of as the number of addition/deletion operations required to turn the edge set of G_1
into that of G_2
. The Hamming distance is a highly general measure of structural similarity, and forms a metric on the space of graphs (simple or directed). Users should be reminded, however, that the Hamming distance is extremely sensitive to nodal labeling, and should not be employed directly when nodes are interchangeable. The structural distance (Butts and Carley (2001)), implemented in structdist
, provides a natural generalization of the Hamming distance to the more general case of unlabeled graphs.
Null hypothesis testing for Hamming distances is available via cugtest
, and qaptest
; graphs which minimize the Hamming distances to all members of a graph set can be found by centralgraph
. For an alternative means of comparing the similarity of graphs, consider gcor
.
A matrix of Hamming distances
For non-dichotomous data, the distance which is returned is simply the sum of the absolute edge-wise differences.
Carter T. Butts buttsc@uci.edu
Banks, D., and Carley, K.M. (1994). “Metric Inference for Social Networks.” Journal of Classification, 11(1), 121-49.
Butts, C.T. and Carley, K.M. (2005). “Some Simple Algorithms for Structural Comparison.” Computational and Mathematical Organization Theory, 11(4), 291-305.
Butts, C.T., and Carley, K.M. (2001). “Multivariate Methods for Interstructural Analysis.” CASOS Working Paper, Carnegie Mellon University.
Hamming, R.W. (1950). “Error Detecting and Error Correcting Codes.” Bell System Technical Journal, 29, 147-160.
sdmat
, structdist
#Get some random graphs
g<-rgraph(5,5,tprob=runif(5,0,1))
#Find the Hamming distances
hdist(g)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.