Description Usage Arguments Value
Fit a GLM Ensemble model via [stats]glm
in parallel. Training datasets
are chosen to have equal number of each class; and, a single dataset is used to determine
prediction error (and ensemble weight) for each element in the ensemble. Each GLM does variable
selection via the [stats]step
function with non-verbose output.
1 2 3 4 |
df |
A |
dep_var |
A character string denoting the dependent variable in |
cols |
A vector of column indices corresponding the the variables you wish to regress on. This allows for variable (de)selection prior to model building. Defaults to using all columns. |
n |
An integer denoting the number of ensembles to build; defaults to |
level |
level of interest. If |
major_class_wt |
Controls the number of major class cases selected in each
partition as a multiple of the number of minority class observations. Defaults to |
seed |
An integer. Seed for reproducibility; defaults to |
test_pct |
A number in (0,1) specifying the size of the test dataset as a percentage. |
direction |
A character vector for the step process. |
family |
Used to specify the details of the glm methods. See |
leave_cores |
An integer for number of cores to leave unused. |
A list of with a matrix of coefficients from each ensemble element, the element weights, and the weighted coefficient estimates.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.