preproc2: preproc2

View source: R/util.R

preproc2R Documentation

preproc2

Description

Preprocesses a training and a test dataset

Usage

preproc2(
  X1,
  X2,
  remove.nan = TRUE,
  remove.const = TRUE,
  to.scale = TRUE,
  verbose = T,
  cluster = 0
)

Arguments

X1:

training dataset

X2:

test dataset

remove.nan:

TRUE/FALSE. It removes NaN

remove.const:

TRUE/FALSE. It removes constant columns.

to.scale:

TRUE/FALSE. It normalizes columns

cluster:

Number of feature clusters. If =0 it does nothing.

Details

preproc2

It performs hierarchical clustering to reduce the number of features

Value

A list with:

  • X1: preprocessed training dataset

  • X2: preprocessed training dataset

  • remcol: indices of remaining columns

  • groups: indices of clusters components

Author(s)

Gianluca Bontempi gbonte@ulb.ac.be

References

Handbook Statistical foundations of machine learning


gbonte/gbcode documentation built on Feb. 27, 2024, 7:38 a.m.