HD | R Documentation |
Hellinger distance (HD
) and a related utility measure (HDutility
)
described in the reference below.
The utility measure is made to be bounded between 0 and 1.
HD(f, g) HDutility(f, g)
f |
Vector of original counts |
g |
Vector of perturbed counts |
HD is defined as "sqrt(sum((sqrt(f) - sqrt(g))^2)/2)
" and
HDutility is defined as "1 - HD(f, g)/sqrt(sum(f))
".
Hellinger distance or related utility measure
Shlomo, N., Antal, L., & Elliot, M. (2015). Measuring Disclosure Risk and Data Utility for Flexible Table Generators, Journal of Official Statistics, 31(2), 305-324. doi: 10.1515/jos-2015-0019
f <- 1:6 g <- c(0, 3, 3, 3, 6, 6) print(c( HD = HD(f, g), HDutility = HDutility(f, g), maxdiff = max(abs(g - f)), meanAbsDiff = mean(abs(g - f)), rootMeanSquare = sqrt(mean((g - f)^2)) ))
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.