pcout: PCOut Method for Outlier Identification in High Dimensions
In mvoutlier: Multivariate Outlier Detection Based on Robust Methods

Description Usage Arguments Details Value Author(s) References See Also Examples

Fast algorithm for identifying multivariate outliers in high-dimensional and/or large datasets, using the algorithm of Filzmoser, Maronna, and Werner (CSDA, 2007).

1 2	pcout(x, makeplot = FALSE, explvar = 0.99, crit.M1 = 1/3, crit.c1 = 2.5, crit.M2 = 1/4, crit.c2 = 0.99, cs = 0.25, outbound = 0.25, ...)

`x`	a numeric matrix or data frame which provides the data for outlier detection
`makeplot`	a logical value indicating whether a diagnostic plot should be generated (default to FALSE)
`explvar`	a numeric value between 0 and 1 indicating how much variance should be covered by the robust PCs (default to 0.99)
`crit.M1`	a numeric value between 0 and 1 indicating the quantile to be used as lower boundary for location outlier detection (default to 1/3)
`crit.c1`	a positive numeric value used for determining the upper boundary for location outlier detection (default to 2.5)
`crit.M2`	a numeric value between 0 and 1 indicating the quantile to be used as lower boundary for scatter outlier detection (default to 1/4)
`crit.c2`	a numeric value between 0 and 1 indicating the quantile to be used as upper boundary for scatter outlier detection (default to 0.99)
`cs`	a numeric value indicating the scaling constant for combined location and scatter weights (default to 0.25)
`outbound`	a numeric value between 0 and 1 indicating the outlier boundary for defining values as final outliers (default to 0.25)
`...`	additional plot parameters, see help(par)

Based on the robustly sphered data, semi-robust principal components are computed which are needed for determining distances for each observation. Separate weights for location and scatter outliers are computed based on these distances. The combined weights are used for outlier identification.

`wfinal01`	0/1 vector with final weights for each observation; weight 0 indicates potential multivariate outliers.
`wfinal`	numeric vector with final weights for each observation; small values indicate potential multivariate outliers.
`wloc`	numeric vector with weights for each observation; small values indicate potential location outliers.
`wscat`	numeric vector with weights for each observation; small values indicate potential scatter outliers.
`x.dist1`	numeric vector with distances for location outlier detection.
`x.dist2`	numeric vector with distances for scatter outlier detection.
`M1`	upper boundary for assigning weight 1 in location outlier detection.
`const1`	lower boundary for assigning weight 0 in location outlier detection.
`M2`	upper boundary for assigning weight 1 in scatter outlier detection.
`const2`	lower boundary for assigning weight 0 in scatter outlier detection.

Peter Filzmoser <P.Filzmoser@tuwien.ac.at> http://cstat.tuwien.ac.at/filz/

P. Filzmoser, R. Maronna, M. Werner. Outlier identification in high dimensions, Computational Statistics and Data Analysis, 52, 1694-1711, 2008.

sign1, sign2

# geochemical data from northern Europe
data(bsstop)
x=bsstop[,5:14]
# identify multivariate outliers
x.out=pcout(x,makeplot=FALSE)
# visualize multivariate outliers in the map
op <- par(mfrow=c(1,2))
data(bss.background)
pbb(asp=1)
points(bsstop$XCOO,bsstop$YCOO,pch=16,col=x.out$wfinal01+2)
title("Outlier detection based on pcout")
legend("topleft",legend=c("potential outliers","regular observations"),pch=16,col=c(2,3))

# compare with outlier detection based on MCD:
x.mcd <- robustbase::covMcd(x)
pbb(asp=1)
points(bsstop$XCOO,bsstop$YCOO,pch=16,col=x.mcd$mcd.wt+2)
title("Outlier detection based on MCD")
legend("topleft",legend=c("potential outliers","regular observations"),pch=16,col=c(2,3))
par(op)

Loading required package: sgeostat
sROC 0.1-2 loaded

mvoutlier documentation built on July 30, 2021, 9:09 a.m.

mvoutlier index

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

mvoutlier
Multivariate Outlier Detection Based on Robust Methods

pcout: PCOut Method for Outlier Identification in High Dimensions
In mvoutlier: Multivariate Outlier Detection Based on Robust Methods

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples

Example output

Related to pcout in mvoutlier...

R Package Documentation

Browse R Packages

We want your feedback!

mvoutlier Multivariate Outlier Detection Based on Robust Methods

pcout: PCOut Method for Outlier Identification in High Dimensions In mvoutlier: Multivariate Outlier Detection Based on Robust Methods

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples

Example output

Related to pcout in mvoutlier...

R Package Documentation

Browse R Packages

We want your feedback!

mvoutlier
Multivariate Outlier Detection Based on Robust Methods

pcout: PCOut Method for Outlier Identification in High Dimensions
In mvoutlier: Multivariate Outlier Detection Based on Robust Methods