The package carries out variable selection in binary Heckman selection model and use bootstrap validation technique to estimate optimisim in the metrics for predictive accuracy. AUROC (Area under the receiver operating curve), AUCROC (Area under the Receiver Operating Characteristic curve), AUPRC (Area under the precision-recall curve), BS (Brier Score), ECE (Expected Calibration Error) and MCE (Maximum Calibration Error) are implemented. Lasso and Adaptive Lasso are implemented for variable selection. Normal error and AMH (Ali-Mikhail-Haq) copula errors are implemented. We implemented the bootstrap approach for models developed via variable selection using P-values in the functions "HeckPval" and "bootValidate_Pval".
We also implemented Probit Lasso regression in our "ProbitLasso"" function. It is similar to the GLMNET package as they both implemented the coordinate descent algorithm. The main addition is that the model can be validated using bootstrap validation method via the function "boot_ProbitLasso".
Functions implemented (use help (e.g. ?HeckSelect) to read more about the functions)
Function for binary outcome with sample selection and variable selection. Adaptive Lasso and Lasso are implemented. Normal error and AMH copula (with probit marginals) based approach is implemented at the moment.
Bootstrap internal validation technique to correct for overoptimism in predictions - "mboot" is the number of bootstrap samples. The function takes the object created by HeckSelect and use non-parametric bootstrap method to compute optimism corrected predictive accuracy measures.
This function is based on the use of P-value to select variables in Binary Heckman selection model. Default P-value = 0.05.
Bootstrap internal validation technique to correct for overoptimism in predictions - the alpha value is inherited from the object HeckPval. Note that this is different from the "bootValidate" as this is based on dropping variables whose values are greater than the alpha value from the model. If no variable selection is required, please set alpha =1 in HeckPval object.
The package also contain functions for regularized probit regression. The results are similar to GLMNET package as they both implemented the coordinate descent algorithm.
This is probit regression. The missing data is delected to fit the model to complete data
Bootstrap internal validation technique to correct for overoptimism in predictions - mboot is the number of bootstrap samples. The function takes the object created by ProbitLasso.
library(HeckmanSelect)
data()
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.