glm_sab2 | R Documentation |
Program for fitting a GLM equipped with the yet-unpublished version 2 of the
'sensible adaptive bayes' prior. Version 2 refers to \beta^o + \bm P
\beta^a \sim N(\omega m_\alpha, \eta \omega^2 {\bm S}_\alpha / \phi)
(contrast to
Version 1, the original SAB from Boonstra and Barbaro, which is implemented
in glm_sab()
and which uses \beta^o + \bm P \beta^a \sim
N(m_\alpha, \eta {\bm S}_\alpha / \phi)
)
glm_sab2(
y,
x_standardized,
family = "binomial",
alpha_prior_mean,
alpha_prior_cov,
aug_projection,
phi_dist = "trunc_norm",
phi_mean = 1,
phi_sd = 0.25,
eta_param = Inf,
omega_mean = 0,
omega_sd = 0.15,
omega_sq_in_variance = TRUE,
beta_orig_scale,
beta_aug_scale,
local_dof = 1,
global_dof = 1,
slab_dof = Inf,
slab_scale = 15,
mu_sd = 5,
only_prior = F,
mc_warmup = 1000,
mc_iter_after_warmup = 1000,
mc_chains = 1,
mc_thin = 1,
mc_stepsize = 0.1,
mc_adapt_delta = 0.9,
mc_max_treedepth = 15,
return_as_CmdStanMCMC = FALSE,
eigendecomp_hist_var = NULL,
scale_to_variance225 = NULL,
seed = sample.int(.Machine$integer.max, 1),
slab_precision = NULL
)
y |
(vector) outcomes corresponding to the type of glm desired. This should match whatever datatype is expected by the stan program. |
x_standardized |
(matrix) matrix of numeric values with number of rows equal to the length of y and number of columns equal to p+q. It is assumed without verification that each column is standardized to whatever scale the prior expects - in Boonstra and Barbaro, all predictors are marginally generated to have mean zero and unit variance, so no standardization is conducted. In practice, all data should be standardized to have a common scale before model fitting. If regression coefficients on the natural scale are desired, they can be easily obtained through unstandardizing. |
family |
(character) Similar to argument in |
alpha_prior_mean |
(vector) p-length vector giving the mean of alpha from the historical analysis, corresponds to m_alpha in Boonstra and Barbaro |
alpha_prior_cov |
(matrix) pxp positive definite matrix giving the variance of alpha from the historical analysis, corresponds to S_alpha in Boonstra and Barbaro |
aug_projection |
(matrix) pxq matrix that approximately projects the regression coefficients of the augmented predictors onto the space of the regression coefficients for the original predictors.This is the matrix P in the notation of Boonstra and Barbaro. It can be calculated using the function 'create_projection' |
phi_dist |
(character) the name of the distribution to use as a prior on phi. This must be either 'trunc_norm' or 'beta'. |
phi_mean |
see |
phi_sd |
(real) prior mean and standard deviation of phi. At a minimum, phi_mean must be between 0 and 1 (inclusive) and phi_sd must be non-negative (you can choose phi_sd = 0, meaning that phi is identically equal to phi_mean). If 'phi_dist' is 'trunc_norm', then 'phi_mean' and 'phi_sd' are interpreted as the parameters of the untruncated normal distribution and so are not actually the parameters of the resulting distribution after truncating phi to the 0,1 interval. If 'phi_dist' is 'beta', then 'phi_mean' and 'phi_sd' are interpreted as the literal mean and standard deviation, from which the shape parameters are calculated. When 'phi_dist' is 'beta', not all choices of 'phi_mean' and 'phi_sd' are valid, e.g. the standard deviation of the beta distribution must be no greater than sqrt(phi_mean * (1 - phi_mean)). Also, the beta distribution is difficult to sample from if one or both of the shape parameters is much less than 1. An error will be thrown if an invalid parameterization is provided, and a warning will be thrown if a parameterization is provided that is likely to result in a "challenging" prior. |
eta_param |
(real) prior hyperparmeter for eta, which scales the
|
omega_mean |
see |
omega_sd |
(real) prior mean and standard deviation for omega, which is
apriori distributed as a log-normal random variable. The log-normal is
parametrized such that these are the mean and normal of the log of the
random variable, not the random variable itself. If the link function is
the identity function, i.e. |
omega_sq_in_variance |
(logical) should omega^2 additionally scale the
prior variance? If |
beta_orig_scale |
see |
beta_aug_scale |
(pos. real) constants indicating the prior scale of the
horseshoe. Both values correspond to 'c / sigma' in the notation of
Boonstra and Barbaro, because that paper never considers
beta_orig_scale!=beta_aug_scale. Use the function
|
local_dof |
(pos. integer) number indicating the degrees of freedom for lambda_j. Boonstra and Barbaro always used local_dof = 1. Choose a negative value to tell the function that there are no local hyperparameters. |
global_dof |
(pos. integer) number indicating the degrees of freedom for tau. Boonstra and Barbaro always used global_dof = 1. Choose a negative value to tell the function that there is no global hyperparameter. |
slab_dof |
see |
slab_scale |
(pos. real) these control the slab-part of the regularized
horseshoe. Specifically, in the notation of Boonstra and Barbaro,
d^2~InverseGamma( |
mu_sd |
(pos. real) the prior standard deviation for the intercept parameter mu |
only_prior |
(logical) should all data be ignored, sampling only from the prior? |
mc_warmup |
number of MCMC warm-up iterations |
mc_iter_after_warmup |
number of MCMC iterations after warm-up |
mc_chains |
number of MCMC chains |
mc_thin |
every nth draw to keep |
mc_stepsize |
positive stepsize |
mc_adapt_delta |
between 0 and 1 |
mc_max_treedepth |
max tree depth |
return_as_CmdStanMCMC |
(logical) should the function return the CmdStanMCMC object asis or should a summary of CmdStanMCMC be returned as a regular list |
eigendecomp_hist_var |
R object of class 'eigen' containing a pxp matrix of eigenvectors in each row (equivalent to v_0 in Boonstra and Barbaro) and a p-length vector of eigenvalues. This is by default equal to eigen(alpha_prior_cov) |
scale_to_variance225 |
a vector assumed to be such that, when multiplied by the diagonal elements of alpha_prior_cov, the result is a vector of elements each equal to 225. This is explicitly calculated if it is not provided |
seed |
seed for the underlying STAN model to allow for reproducibility |
slab_precision |
(pos. real) the slab-part of the regularized horseshoe,
this is equivalent to (1/d)^2 in the notation of Boonstra and Barbaro. If
specified, it is assumed that you want a fixed slab component and will take
precedence over any provided values of |
list
object containing the draws and other information.
data(current)
alpha_prior_cov = matrix(data = c(0.02936, -0.02078, 0.00216, -0.00637,
-0.02078, 0.03192, -0.01029, 0.00500,
0.00216, -0.01029, 0.01991, -0.00428,
-0.00637, 0.00500, -0.00428, 0.01650),
byrow = FALSE, nrow = 4);
scale_to_variance225 = diag(alpha_prior_cov) / 225;
eigendecomp_hist_var = eigen(alpha_prior_cov);
aug_projection1 = matrix(data = c(0.0608, -0.02628, -0.0488, 0.0484, 0.449, -0.0201,
0.5695, -0.00855, 0.3877, 0.0729, 0.193, 0.4229,
0.1816, 0.37240, 0.1107, 0.1081, -0.114, 0.3704,
0.1209, 0.03683, -0.1517, 0.2178, 0.344, -0.1427),
byrow = TRUE, nrow = 4);
foo = glm_sab2(y = current$y_curr,
x_standardized = current[,2:11],
family = "binomial",
alpha_prior_mean = c(1.462, -1.660, 0.769, -0.756),
alpha_prior_cov = alpha_prior_cov,
aug_projection = aug_projection1,
phi_dist = "trunc_norm",
phi_mean = 1,
phi_sd = 0.25,
eta_param = Inf,
omega_mean = 0,
omega_sd = 0.15,
omega_sq_in_variance = TRUE,
beta_orig_scale = 0.0223,
beta_aug_scale = 0.0223,
local_dof = 1,
global_dof = 1,
mu_sd = 5,
only_prior = 0,
mc_warmup = 200,
mc_iter_after_warmup = 200,
mc_chains = 2,
mc_thin = 1,
mc_stepsize = 0.1,
mc_adapt_delta = 0.999,
mc_max_treedepth = 15,
eigendecomp_hist_var = eigendecomp_hist_var,
scale_to_variance225 = scale_to_variance225);
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.