# KgCor: Kernel Gini Distance Correlation Statistics In GiniDistance: A New Gini Correlation Between Quantitative and Qualitative Variables

## Description

Computes Kernel Gini distance correlation statistics, in which Xs are quantitative, Y are categorical, sigma is kernel standard deviation, alpha is an exponent on the Euclidean distance and returns the kernel Gini mean difference.

## Usage

 1  KgCor(x, y, sigma) 

## Arguments

 x data y label of data or univariate response variable sigma kernel standard deviation

## Details

Kgcor compute kernel Gini distance correlation statistics for data. It is a self-contained R function dealing with both univariate and multivariate data. The sample size (number of rows) of the data must agree with the length of the label vector, and samples must not contain missing values. Arguments x, y are treated as data and labels.

Gini distance correlation are generalized to RKHS, \mathcal{H}_κ, as

\mathrm{gCor}_κ(X,Y) = \frac{∑_{k=1}^{K} p_k ≤ft[ 2 {E}d_κ(X_k,X) - {E}d_κ(X_k,{X_k}') - {E}d_κ(X,X')\right]}{{E}d_κ(X,X')}.\label{gCork}

In this case, we use the default Gaussian distance function

d_κ(x,x') = √{1-e^{-\frac{|x-x'|_q^2}{σ^2}}},

induced by a weighted Gaussian kernel, κ(x,x') = \frac{1}{2}e^{-\frac{|x-x'|_q^2}{σ^2}}.

## Value

KgCor returns the sample Kernel Gini distance correlation between x and y.

## References

Zhang, S., Dang, X., Nguyen, D. and Chen, Y. (2019). Estimating feature - label dependence using Gini distance statistics. IEEE Transactions on Pattern Analysis and Machine Intelligence (submitted), https://arXiv.org/pdf/1906.02171.pdf

## See Also

gCov gCor dCor

## Examples

 1 2 3  x<-iris[,1:4] y<-unclass(iris[,5]) KgCor(x, y, sigma=1) 

GiniDistance documentation built on June 28, 2019, 5:03 p.m.