fast_outlier_id: Analyzes the values of a given column list in a given...
In UBC-MDS/redahelper: Makes your EDA analysis easier!

Description Usage Arguments Value Examples

View source: R/fast_outliers.R

Analyzes the values of a given column list in a given dataframe, identifies outliers using either the Z-Score algorithm or interquantile range algorithm. The return is a dataframe containing the following columns: column name, list containing the outlier's index position, percentaje of total counts considered outliers. Modifies an existing dataframe, with missing values imputed based on the chosen method.

fast_outlier_id(
  data,
  cols = "All",
  method = "z-score",
  threshold_low_freq = 0.05
)

`data`	dataframe - Dataframe to be analyzed
`cols`	list - List containing the columns to be analyzed.
`method`	string - string indicating which method to be used to identify outliers (methods available are: "Z score" or "Interquantile Range")
`threshold_low_freq`	double - Indicates the threshold for evaluating outliers in categorical columns.