View source: R/linreg-observations.R
identify_outliers | R Documentation |
A data point flagged as an outlier means that is has an extreme value in its response (y) variable. If this is the case, the data point(s) is/are influential, meaning that it has an outsized influence on a regression.
identify_outliers(object, id = NULL, .cutoff = 3)
## S3 method for class 'lm'
identify_outliers(object, id = NULL, .cutoff = 3)
object |
A model object (such as a fitted |
id |
(Optional) A vector of values, the same length as the number of observations, used as an identifier for each data point. If left as NULL, the row number will be added as the ID column. |
.cutoff |
(Optional) Used to determine which standard residuals are indicative of an outlier. The default is the rule-of-thumb 3 (see details). |
Outliers are defined as those data points that have a standardized residual value greater than some cutoff value. A traditional rule-of-thumb is for that cutoff value to be three.
A tibble.
Kutner, M., Nachtsheim, C., Neter, J. and Li, W. (2005). Applied Linear Statistical Models. ISBN: 0-07-238688-6. McGraw-Hill/Irwin.
library(tidytest)
#> `lm` Method
mod_lm_fit <- lm(mpg ~ disp + wt + hp, data = mtcars)
identify_outliers(mod_lm_fit)
identify_outliers(mod_lm_fit, id = rownames(mtcars))
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.