cleanFeatures: Automated data cleaning
In XanderHorn/autoML: Automated machine learning

Description Usage Arguments Value Author(s) Examples

Cleans features in a dataset for machine learning purposes. Utilizes the edaFrame generated by exploreData. Cleaning involves imputation, clipping outliers and creating tracking features

1
2
3

cleanFeatures(x, feats, edaFrame, trackingFeats = TRUE,
  clipOutliers = TRUE, imputeMissing = TRUE, progress = FALSE,
  autoCode = TRUE)

`x`	[data.frame \| Required] Data.frame containing numeric features to transform
`feats`	[character vector \| Required] Character vector of features to clean
`edaFrame`	[data.frame \| Required] Data.frame object returned by exploreData function
`trackingFeats`	[logical \| Optional] Should tracking features be created. Tracking features are binary features that keep track of data before changes have been applied to, useful for tree type models
`clipOutliers`	[logical \| Optional] Should outliers be clipped using the method specified in the exploreData function
`imputeMissing`	[logical \| Optional] Should features be imputed using median imputation for numerics and mode for categoricals
`progress`	[logical \| Optional] Display progress
`autoCode`	[logical \| Optional] Should code be generated when running the function