thinning: Thinning the Original Data

Description Usage Arguments Value

View source: R/thinning.R

Description

Delete features in the original data with a majority of data missing.

Usage

1
thinning(data, col_preserve = 0.5, col_pct = 0.7)

Arguments

data

Original dataset

col_preserve

The maximum percentage of number of columns can be deleted due to massive missingness.

col_pct

Criterion for column deletion due to massive missingness. If missing percentage is larger than this threshold, delete the corresponding column.

Value

Returns a list of two elements. "data.new" Returns data after deleting non-informative columns. "misscol" Returns olumns deleted by the procedure.


SOCR/DataSifter documentation built on Dec. 11, 2021, 2:55 p.m.