sfs: Sequential Forward Selection

Description Usage Arguments Details Value Author(s) References Examples

Description

Applies the Sequential Forward Selection algorithm for Feature Selection.

Usage

1
2
sfs(data, method = c("lda", "knn", "rpart"), kvec = 5,
 repet = 10)

Arguments

data

Dataset to be used for feature selection

method

Classifier to be used, currently only the lda, knn and rpart classifiers are supported

kvec

Number of neighbors to use for the knn classification

repet

Number of times to repeat the selection.

Details

The best subset of features, T, is initialized as the empty set and at each step the feature that gives the highest correct classification rate along with the features already in T, is added to set. The "best subset" of features is constructed based on the frequency with which each attribute is selected in the number of repetitions given. Due to the time complexity of the algorithm its use is not recommended for datasets with a large number of attributes(say more than 1000).

Value

bestsubset

subset of features that have been determined to be relevant.

Author(s)

Edgar Acuna

References

Acuna, E , (2003) A comparison of filters and wrappers for feature selection in supervised classification. Proceedings of the Interface 2003 Computing Science and Statistics. Vol 34.

Examples

1
2
3
#---- Sequential forward selection using the knn classifier----
data(iris)
sfs(iris,method="lda",repet=3)


Search within the dprep package
Search all R packages, documentation and source code

Questions? Problems? Suggestions? or email at ian@mutexlabs.com.

Please suggest features or report bugs with the GitHub issue tracker.

All documentation is copyright its authors; we didn't write any of that.