preprocessing: Conduct basic preprocessing processes

View source: R/preprocessing.R

preprocessingR Documentation

Conduct basic preprocessing processes

Description

Conduct basic preprocessing processes

Usage

preprocessing(
  data,
  y = NULL,
  time = NULL,
  status = NULL,
  type,
  verbose = FALSE
)

Arguments

data

A data source, that is one the of major R formats: data.table, data.frame, matrix, and so on.

y

A string that indicates a target column name for regression or classification. Either y, or pair: time, status can be used. By default NULL.

time

A string that indicates a time column name for survival analysis task. Either y, or pair: time, status can be used. By default NULL.

status

A string that indicates a status column name for survival analysis task. Either y, or pair: time, status can be used. By default NULL.

type

A string that determines if Machine Learning task is the 'binary_clf', 'regression', 'survival', or 'multiclass' task.

verbose

A logical value, if set to TRUE, provides all information about the process, if FALSE gives none.

Value

A list containing three objects:

  • `data` A preprocessed dataset,

  • `rm_colnames` The names of removed columns,

  • `bin_labels` The text labels before target binarization.


ModelOriented/forester documentation built on June 6, 2024, 7:29 a.m.