multivariance-package: multivariance: Measuring Multivariate Dependence Using...

Description Details Multivariance Functions to use and interpret multivariance Dependence structures Examples References

Description

The multivariance package provides basic functions to calculate distance multivariance and related quantities. To test independence use multivariance.test, it provides an interface (via its arguments) to all the tests based on distance (m-/total-)multivariance. The package offers also several other functions related to distance multivariance, e.g. a detection and visualization of dependence structures dependence.structure. See below for details on the full content of the package.

Details

Distance multivariance is a multivariate dependence measure, which can be used to detect dependencies between an arbitrary number of random vectors each of which can have a distinct dimension. The necessary functions are implemented in this package, and examples are given. For the theoretic background we refer to the papers [1,2,3,4,5,6]. Paper [3] includes a summary of the first two. It is the recommended starting point for users with an applied interest. Paper [4] is concerned with new (faster) p-value estimates for the independence tests, [5] introduces the copula versions of distance multivariance, [6] discusses the quantification of dependence using distance multicorrelations.

The (current) code is speed improved in comparison to the former releases. Certainly there is still room for improvement and development. Questions, comments and remarks are welcome: bjoern.boettcher@tu-dresden.de

For infos on the latest changes and/or updates to the package use news(package="multivariance").

To cite this package use the standard citation for R packages, i.e., the output of citation("multivariance").

Multivariance

multivariance computes the distance multivariance

total.multivariance computes the total distance multivariance

m.multivariance computes the m-multivariance (introduced in [3])

It might be convenient to compute these simultaneously using multivariances.all.

copula.multivariance computes the copula versions of the above (introduced in [5])

multicorrelation computes the multicorrelations (discussed specifically in [6])

Functions to use and interpret multivariance

rejection.level computes a (conservative) rejection level for a given significance level. This can be used for a conservative interpretation of distance multivariance. The counterpart is multivariance.pvalue, which computes a conservative p-value for a given distance multivariance. Both methods are distribution-free.

resample.rejection.level and resample.pvalue are the distribution dependent versions of the above. They are approximately sharp, but computational more expensive. Any resampling is done by resample.multivariance.

Using the methods developed in [4] approximate p-value estimates are provided by pearson.pvalue. This method is much faster than the resampling method.

multivariance.test provides the corresponding tests of independence. The former provides output as common for tests in R.

cdm and cdms compute the doubly centered distance matrix and matrices, respectively. These can be used to speed up repeated computations of distance multivariance.

In [4] various methods to estimate the moments of the test statistic under H0 were developed, these are (implicitly) implemented in this package only for the moments used in pearson.pvalue. Further and explicit functions can be added upon request. Please feel free to contact the author.

emp.transf computes the Monte Carlo empirical transform of the data. This data yields the copula version of distance multivariance. Hereto note, that values become randomized due to the "Monte Carlo empirical transform", i.e., the copula versions yield in a finite sample setting not identical values for repeated runs.

For planing of large projects or studies it might be convenient to estimate the computation time of multivariance via multivariance.timing.

Dependence structures

dependence.structure performs the dependence structure detection algorithm as described in [3].

find.cluster is the basic building block of dependence.structure. It is recommended to use dependence.structure.

Examples

coins and tetrahedron generate samples of pairwise independent random variables, with dependence of higher order.

dep_struct_iterated_13_100, dep_struct_ring_15_100, dep_struct_several_26_100 and dep_struct_star_9_100 are example data sets for the dependence structure detection. These might also serve as benchmark examples.

anscombe.extended provides an extension of Anscombe's Quartett. It illustrates that a large value of Pearson's correlation can occur for very different dependencies and that this is not a small-sample problem. These dependencies are at least partly differentiated by values of distance multicorrelation.

References

[1] B. Böttcher, M. Keller-Ressel, R.L. Schilling, Detecting independence of random vectors: generalized distance covariance and Gaussian covariance. Modern Stochastics: Theory and Applications, Vol. 5, No. 3(2018) 353-383. https://www.vmsta.org/journal/VMSTA/article/127/info

[2] B. Böttcher, M. Keller-Ressel, R.L. Schilling, Distance multivariance: New dependence measures for random vectors. The Annals of Statistics, Vol. 47, No. 5 (2019) 2757-2789. doi: 10.1214/18-AOS1764

[3] B. Böttcher, Dependence and Dependence Structures: Estimation and Visualization using the Unifying Concept of Distance Multivariance. Open Statistics, Vol. 1, No. 1 (2020) 1-46. doi: 10.1515/stat-2020-0001

[4] G. Berschneider, B. Böttcher, On complex Gaussian random fields, Gaussian quadratic forms and sample distance multivariance. Preprint. https://arxiv.org/abs/1808.07280

[5] B. Böttcher, Copula versions of distance multivariance and dHSIC via the distributional transform – a general approach to construct invariant dependence measures. Statistics, (2020) 1-18. doi: 10.1080/02331888.2020.1748029

[6] B. Böttcher, Notes on the interpretation of dependence measures – Pearson's correlation, distance correlation, distance multicorrelations and their copula versions. Preprint. https://arxiv.org/abs/2004.07649


multivariance documentation built on Oct. 6, 2021, 5:08 p.m.