zscore | R Documentation |
This function calculates classical or robust z-score (standardization) for a numeric vector.
zscore(x, cutoff = 3, robust = FALSE, drop.na = FALSE)
x |
A numeric vector. |
cutoff |
A numeric value indicating the threshold above which data points are identified and flagged as potential outliers. By default, |
robust |
A logical value indicating whether to calculate classical or robust z-score. If |
drop.na |
A logical value indicating whether to remove missing values ( |
Z-scores are useful for comparing data points from different distributions because they are dimensionless and standardized. A positive z-score indicates that the data point is above the mean (or the median in the robust approach), while a negative z-score indicates that the data point is below the mean (or the median). One common rule to detect outliers using z-scores is the "three-sigma rule", in which data points with an absolute z-score greater than 3 (|z| > 3) can be considered potential outliers (default), as they fall outside the range that covers 99.7% of the data points in a normal distribution. (Note that a cutoff of |z| > 2.5 is also often used).
A tibble with two columns:
data
: The original numeric values.
score
: The calculated z-scores.
flag
: TRUE
if the corresponding data point is flagged as a potential outlier, and FALSE
otherwise.
Christian L. Goueguel
Rousseeuw, P. J., and Croux, C. (1993). Alternatives to the median absolute deviation. Journal of the American Statistical Association, 88(424), 1273-1283.
Rousseeuw, P. J., and Hubert, M. (2011). Robust statistics for outlier detection. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 1(1), 73-79.
Donoho, D., (1982). Breakdown properties of multivariate location estimators. Ph.D. Qualifying paper, Dept. Statistics, Harvard University, Boston.
Stahel, W., (1981). Robuste Schätzungen: infinitesimale Optimalität und Schätzungen vonKovarianzmatrizen. PhD thesis, ETH Zürich.
x <- c(1:5, 100)
# Non-robust approach
zscore(x)
# Robust approach
zscore(x, robust = TRUE)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.