subsamp: Subsample Winner Algorithm for Variable Selection in Linear Regression with a Large Number of Variables

This subsample winner algorithm (SWA) for regression with a large-p data (X, Y) selects the important variables (or features) among the p features X in explaining the response Y. The SWA first uses a base procedure, here a linear regression, on each of subsamples randomly drawn from the p variables, and then computes the scores of all features, i.e., the p variables, according to the performance of these features collected in each of the subsample analyses. It then obtains the 'semifinalist' of the features based on the resulting scores and determines the 'finalists', i.e., the important features, from the 'semifinalist'. Fan, Sun and Qiao (2017) <>.

Getting started

Package details

AuthorYiying Fan [aut, cre], Jiayang Sun [aut], Xingye Qiao [aut]
MaintainerYiying Fan <>
LicenseGPL-2 | GPL-3
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:

Try the subsamp package in your browser

Any scripts or data that you put into this service are public.

subsamp documentation built on May 1, 2019, 6:47 p.m.