Calculates a robust and asymptotically fully efficient correlation matrix, see Gervini (2003) for details.
1 |
data |
matrix with the observations in rows. |
boundq |
quantile bound for comparison of theoretical and empirical distribution function, see details. |
startestimator |
start estimator to be used: |
This implementation calculates the (asymptotically) fully efficient scatter estimator proposed by Gervini (2003). Based on a an initial scatter estimator which can be determined by startestimator
, residuals and their cumulative distribution function are calculated. This empirical distribution is compared with the theoretical one (a chi square distribution). Only if the empirical distribution function lays under the theoretic one, from the boundq
quantile onwards, observations are identified as outliers, the number depends on the distance between the distribution functions. Based on all observations which are not marked as outliers, the usual correlation is returned.
This procedure has asymptotically the same efficiency as the usual empirical correlation and retains the breakdown point of the initial scale estimator.
Numeric correlation matrix.
Alexander Dürre
Gervini, D. (2003): A robust and efficient adaptive reweighted estimator of multivariate location and scatter, Journal of multivariate analysis, vol 84, 116–144, doi: 10.1016/S0047-259X(02)00018-0.
1 2 3 4 5 |
Questions? Problems? Suggestions? Tweet to @rdrrHQ or email at ian@mutexlabs.com.
All documentation is copyright its authors; we didn't write any of that.