preprocess: Preprocess Data
In egenn/rtemis: Machine Learning and Visualization

preprocess

R Documentation

Preprocess Data

Description

Preprocess data for analysis and visualization.

Usage

## S7 generic
preprocess(x, parameters, ...)

Arguments

`x`	data.frame or similar: Data to be preprocessed.
`parameters`	PreprocessorParameters or Preprocessor: PreprocessorParameters when preprocessing training set data. Setup using setup_Preprocessor. Preprocessor when preprocessing validation and test set data.
`...`	Used to pass `dat_validation` and `dat_test` to the method for Preprocessor.

Details

Methods are provided for preprocessing training set data, which accepts a PreprocessorParameters object, and for preprocessing validation and test set data, which accept a Preprocessor object.

Order of operations:

keep complete cases only
remove constants
remove duplicates
remove cases by missingness threshold
remove features by missingness threshold
integer to factor
integer to numeric
logical to factor
logical to numeric
numeric to factor
cut numeric to n bins
cut numeric to n quantiles
numeric with less than N unique values to factor
character to factor
factor NA to named level
add missingness column
impute
scale and/or center
one-hot encoding

Value

Preprocessor object.

Author(s)

EDG

egenn/rtemis documentation built on June 14, 2025, 11:54 p.m.

egenn/rtemis index

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Tweet to @rdrrHQ

GitHub issue tracker

ian@mutexlabs.com