pipe_remove_high_correlation_features: Removes highly correlated features whilst keeping as many as...

Description Usage Arguments Value

View source: R/correlation_removal.R

Description

Removes highly correlated features whilst keeping as many as possible, using heuristics.

Usage

1
2
pipe_remove_high_correlation_features(train,
  exclude_columns = character(0), threshold = 0.8)

Arguments

train

The train dataset, as a data.frame or data.table. Data.tables may be changed by reference.

exclude_columns

Columns that should not be considered for removal.

threshold

If the correlation is between two columns is larger than this, both will be considered for removal

Value

A list containing the transformed train dataset and a trained pipe.


jeroenvdhoven/datapiper documentation built on July 14, 2019, 9:34 p.m.