Discrepancy is the number of mismatches between a packed (binary) matrix and the maximally packed matrix (with same row sums)
A matrix (or something that can be transformed into a matrix when
Discrepancy is a way to measure the nestedness of a matrix. In a comparative study, Ulrich & Gotelli (2007) showed discrepancy to outperform all other measures and hence recommend its use (together with a fixed-columns, fixed-rows null model, such as implemented in
commsimulator in vegan, see example).
This function follows the logic laid out by Brualdi & Sanderson (1999), although, admittedly, I find their mathematical description highly confusing. Another implementation is given by the function
nesteddisc in vegan. The reason to write a new function is simple: I wasn't aware of
nesteddisc! (I was sitting on a train and I wanted to use this measure later on, so I put it into a function consulting only the orignal paper. When looking for the swap algorithm to create null models, which I somehow knew to exist in vegan, I stumbled across
nesteddisc. If you are interested in the swap algorithm and come across this help page, let me re-direct you to
oecosimu in vegan.)
Now that this function exists, too, I found it to differ in output from
nesteddisc. Jari Oksanen was quick to point out, that our two implementations differ in the way they handle ties in column totals. This function is, I think, closer to the results given in Brualdi & Sanderson. Jari also went on to implement different strategies to deal with ties, so my guess is that his version may be (slightly) superior to this one. Having said that, values don't differ much between the two implementations.
So what does it do: The matrix is sorted by marginal totals, yielding a matrix A. Then, all 1s in A are “pushed” to the left to maximally compact the matrix, yielding P. Discrepancy is now simply the number of disagreements between A and P, divided by two (to correct for the fact that every “wrong” 1 will necessarily generate a “wrong” 0).
Returns the number of mismatches, i.e. the discrepancy of the matrix from perfecct nestedness.
Discrepancy is well-defined only for matrices that can be sorted uniquely. For matrices with ties no foolproof way to handle them has been proposed. For small matrices, or large matrices with many ties, this will lead to different discrepancy values. See also how
nesteddisc in vegan handles this issue! (Thanks to Jari Oksanen for pointing this out!)
Carsten F. Dormann
Brualdi, R.A. and Sanderson, J.G. (1999) Nested species subsets, gaps, and discrepancy. Oecologia 119, 256–264
Ulrich, W. and Gotelli, N.J. (2007) Disentangling community patterns of nestedness and species co-occurrence. Oikos 116, 2053–2061
nestednodf in vegan for the best nestedness algorithm in existence today (for both binary and weighted networks!);
nestedness for the most commonly used method to calculate nestedness,
wine for a new, unevaluated but very fast way to calculate nestedness;
nestedtemp (another implementation of the same method used in our
nestedn0 (calculating the number of missing species, which has been shown to be a poor measure of nestedness) in vegan
1 2 3 4 5 6 7 8 9 10 11 12 13
## Not run: #nulls <- replicate(100, discrepancy(commsimulator(Safariland, method="quasiswap"))) nulls <- simulate(vegan::nullmodel(Safariland, method="quasiswap"), nsim = 100) null.res <- apply(nulls, 3, discrepancy) hist(null.res) obs <- discrepancy(Safariland) abline(v=obs, lwd=3, col="grey") c("p value"=min(sum(null.res>obs), sum(null.res<obs))/length(null.res)) # calculate Brualdi & Sanderson's Na-value (i.e. the z-score): c("N_a"=(unname(obs)-mean(null.res))/sd(null.res)) ## End(Not run)