filter_dataset: A function to filter dataset

Description Usage Arguments

View source: R/filter_dataset.R

Description

This function allows to filter the features. It first filter features with NA, after it removes features with too many zeros and the ones with high correlation with others.

Usage

1
filter_dataset(X, cor_th = 0.95, zero_th = 0.8)

Arguments

X

is the dataset matrix with samples on rows and features on columns.

cor_th

is the maximum accepted correlation between couple of features. Default value is 0.95

zero_th

is the maximum percentage of zeros accepted in a feature. Default value is 0.8


angy89/hyQSAR documentation built on Sept. 24, 2019, 7:31 a.m.