HD | R Documentation |
Hellinger distance (HD
) and a related utility measure (HDutility
)
described in the reference below.
The utility measure is made to be bounded between 0 and 1.
HD(f, g)
HDutility(f, g)
f |
Vector of original counts |
g |
Vector of perturbed counts |
HD is defined as "sqrt(sum((sqrt(f) - sqrt(g))^2)/2)
" and
HDutility is defined as "1 - HD(f, g)/sqrt(sum(f))
".
Hellinger distance or related utility measure
Shlomo, N., Antal, L., & Elliot, M. (2015). Measuring Disclosure Risk and Data Utility for Flexible Table Generators, Journal of Official Statistics, 31(2), 305-324. \Sexpr[results=rd]{tools:::Rd_expr_doi("10.1515/jos-2015-0019")}
f <- 1:6
g <- c(0, 3, 3, 3, 6, 6)
print(c(
HD = HD(f, g),
HDutility = HDutility(f, g),
maxdiff = max(abs(g - f)),
meanAbsDiff = mean(abs(g - f)),
rootMeanSquare = sqrt(mean((g - f)^2))
))
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.