Description Usage Arguments Details Value Author(s) References See Also Examples
Lasso (least absolute shrinkage and selection operator) estimation is performed and evaluated for different tuning parameter choices. To address tuning parameter selection uncertainty a weighted average of these estimators is calculated. The weight vector is chosen such that a k-fold cross validation criterion is minimized.
1 2 3 |
X |
A dataframe or matrix containing the data to be analyzed. |
ycol |
An integer or string specifying the column of the outcome variable. The outcome for |
kfold |
An integer specifying the |
B.var |
An integer specifying the number of bootstrap replications to be used to estimate the standard error of the Lasso estimator. |
calc.variance |
A logical value specifying whether the standard error of the estimates should be estimated at all (by means of bootstrapping). See also details below. |
factor.variables |
A (vector of) string(s) specifying which variables should be treated as factors, i.e. recoded into dummy variables. Factor variables will automatically be recoded if not specified here. |
glm.family |
A character vector specifying one of the following families: |
tries |
An integer for the number of tries in case |
standardize |
A logical value speciying whether the covariate data should be standardized. |
random |
A logical value specifying whether creation of datasets for cross validation should be random or not. |
pd |
A logical value specifying whether messages should be printed or not. |
... |
Other arguments to be passed, i.e. to |
Note that the candidate tuning parameters are selected automatically (by cv.glmnet
). The bootstrap standard error for LASSO does not assume a fixed
tuning parameter, i.e. tuning parameter selection is done seperately in each bootstrap sample. Lasso averaging works on standard errors related to each tuning parameter,
but the variance between the different weighted estimates is taken into account. The importance measure based on the
averaging weights could be interpreted as the importance of variables with respect to their predictive ability.
Returns an object of class
‘lae’:
coefficients |
A matrix of coefficients and standard errors for Lasso averaging, Lasso selection, and OLS estimation. |
variable.importance |
A matrix containing the relative importance of each variable based on model averaging weights. |
sae.weights |
A vector containing the weights used for Lasso averaging. |
sel.weights |
A vector indicating the complexity parameter that was chosen for Lasso estimation based on k-fold cross validation. |
complexity.parameter |
A vector of the actual complexity parameter values used as candidate values for Lasso Averaging Estimation. |
setup |
A list of length two containing the data matrix and model family. |
Michael Schomaker
Schomaker, M. (2012) Shrinkage Averaging Estimation, Statistical Papers, 53:1015-1034
plot.lae
for visualizing the estimation results.
1 2 3 |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.