Description Usage Arguments References See Also Examples
Probabilistic set distance
1 |
U |
A matrix, from which to detect outliers (rows). E.g. PC scores. |
kNN |
Number of nearest neighbors to use. Default is |
robMaha |
Whether to use a robust Mahalanobis distance instead of the
normal euclidean distance? Default is |
ncores |
Number of cores to use. Default is |
Kriegel, Hans-Peter, et al. "LoOP: local outlier probabilities." Proceedings of the 18th ACM conference on Information and knowledge management. ACM, 2009.
1 2 3 4 5 6 7 8 9 10 | X <- readRDS(system.file("testdata", "three-pops.rds", package = "bigutilsr"))
svd <- svds(scale(X), k = 10)
U <- svd$u
test <- prob_dist(U)
plof <- test$dist.self / test$dist.nn
plof_ish <- test$dist.self / sqrt(test$dist.nn)
plot(U[, 1:2], col = (plof_ish > tukey_mc_up(plof_ish)) + 1, pch = 20)
plot(U[, 3:4], col = (plof_ish > tukey_mc_up(plof_ish)) + 1, pch = 20)
plot(U[, 5:6], col = (plof_ish > tukey_mc_up(plof_ish)) + 1, pch = 20)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.