README.md
In kldest: Sample-Based Estimation of Kullback-Leibler Divergence

kldest: Kullback-Leibler divergence estimation

The goal of kldest is to estimate Kullback-Leibler (KL) divergence $D_{KL}(P||Q)$ between two probability distributions $P$ and $Q$ based on:

a sample $x_1,...,x_n$ from $P$ and the probability density $q$ of $Q$, or
samples $x_1,...,x_n$ from $P$ and $y_1,...,y_m$ from $Q$.

The distributions $P$ and $Q$ may be uni- or multivariate, and they may be discrete, continuous or mixed discrete/continuous.

Different estimation algorithms are provided for continuous distributions, either based on nearest neighbour density estimation or kernel density estimation. Confidence intervals for KL divergence can also be computed, either via subsampling (preferred) or bootstrapping.

You can install kldest from CRAN:

install.packages("kldest")

Alternatively, can install the development version of kldest from GitHub with:

# install.packages("devtools")
devtools::install_github("niklhart/kldest")

KL divergence estimation based on nearest neighbour density estimates is the most flexible approach.

library(kldest)

Set a seed for reproducibility

set.seed(0)

Analytical KL divergence:

kld_gaussian(mu1 = 0, sigma1 = 1, mu2 = 1, sigma2 = 2^2)
#> [1] 0.4431472

Estimate based on two samples from these Gaussians:

X <- rnorm(100)
Y <- rnorm(100, mean = 1, sd = 2)
kld_est_nn(X, Y)
#> [1] 0.2169136

Estimate based on a sample from the first Gaussian and the density of the second:

q <- function(x) dnorm(x, mean = 1, sd =2)
kld_est_nn(X, q = q)
#> [1] 0.6374628

Uncertainty quantification via subsampling:

kld_ci_subsampling(X, q = q)
#> $est
#> [1] 0.6374628
#> 
#> $ci
#>      2.5%     97.5% 
#> 0.2601375 0.9008446

Analytical KL divergence between an uncorrelated and a correlated Gaussian:

kld_gaussian(mu1 = rep(0,2), sigma1 = diag(2),
             mu2 = rep(0,2), sigma2 = matrix(c(1,1,1,2),nrow=2))
#> [1] 0.5

Estimate based on two samples from these Gaussians:

X1 <- rnorm(100)
X2 <- rnorm(100)
Y1 <- rnorm(100)
Y2 <- Y1 + rnorm(100)
X <- cbind(X1,X2)
Y <- cbind(Y1,Y2)

kld_est_nn(X, Y)
#> [1] 0.3358918

Any scripts or data that you put into this service are public.

kldest documentation built on May 29, 2024, 3 a.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

kldest
Sample-Based Estimation of Kullback-Leibler Divergence

README.md
In kldest: Sample-Based Estimation of Kullback-Leibler Divergence

kldest: Kullback-Leibler divergence estimation

Installation

A minimal example for KL divergence estimation

KL divergence between 1-D Gaussians

KL divergence between 2-D Gaussians

Try the kldest package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

kldest Sample-Based Estimation of Kullback-Leibler Divergence

README.md In kldest: Sample-Based Estimation of Kullback-Leibler Divergence

kldest: Kullback-Leibler divergence estimation

Installation

A minimal example for KL divergence estimation

KL divergence between 1-D Gaussians

KL divergence between 2-D Gaussians

Try the kldest package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

kldest
Sample-Based Estimation of Kullback-Leibler Divergence

README.md
In kldest: Sample-Based Estimation of Kullback-Leibler Divergence