mark_outliers: Find univariate outliers in a sample of data
In ryan-a-schneider/legaldmlab: Legal Decision Making Lab Helper Functions for Common Tasks

mark_outliers

R Documentation

Find univariate outliers in a sample of data

Description

Search through the indicated column in a data set and mark all outliers with a new variable, coded 1 (outlier) or 0 (not an outlier). The formula used is from Rex Kline's Principles and Practice of Structural Equation Modeling, Fourth Edition (see details below).

Usage

mark_outliers(df, col, newCol_name)

Arguments

`df`	The supplied data set
`col`	The column to examine for outliers.

Details

"There is no single definition of 'extreme', but one heuristic is that scores more than three standard deviations beyond the mean may be outliers...but this method is susceptible to distortion by the very outliers that it is supposed to detect; that is, it is not robust...A more robust decision rule for detecting univariate outliers is:

\frac{|X|-Mdn}{1.483*MAD}>2.24

where *Mdn designates the sample median–which is more robust against outliers than the mean–and MAD is the Median Absolute Deviation (MAD) of all scores from the sample median. The quantity MAD does not estimate the population standard deviation, but the product of MAD and the scale factor 1.43 is an unbiased estimator of \sigma in a normal distribution. The value of the ratio in this equation is the distance between a score and the median expressed in robust standard deviation units. The constant 2.24 in this equation is the square root of the approximate 97.5th percentile in a central Chi-square distribution with a single degree of freedom. A potential outlier thus has a score on the ratio in this equation that exceeds 2.24." (Kline, 2016, p. 72).

ryan-a-schneider/legaldmlab documentation built on July 2, 2023, 5:02 a.m.

ryan-a-schneider/legaldmlab index

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

ryan-a-schneider/legaldmlab
Legal Decision Making Lab Helper Functions for Common Tasks

mark_outliers: Find univariate outliers in a sample of data
In ryan-a-schneider/legaldmlab: Legal Decision Making Lab Helper Functions for Common Tasks

Find univariate outliers in a sample of data

Description

Usage

Arguments

Details

Related to mark_outliers in ryan-a-schneider/legaldmlab...

R Package Documentation

Browse R Packages

We want your feedback!

ryan-a-schneider/legaldmlab Legal Decision Making Lab Helper Functions for Common Tasks

mark_outliers: Find univariate outliers in a sample of data In ryan-a-schneider/legaldmlab: Legal Decision Making Lab Helper Functions for Common Tasks

Find univariate outliers in a sample of data

Description

Usage

Arguments

Details

Related to mark_outliers in ryan-a-schneider/legaldmlab...

R Package Documentation

Browse R Packages

We want your feedback!

ryan-a-schneider/legaldmlab
Legal Decision Making Lab Helper Functions for Common Tasks

mark_outliers: Find univariate outliers in a sample of data
In ryan-a-schneider/legaldmlab: Legal Decision Making Lab Helper Functions for Common Tasks