glm.leap: Posterior of latent exchangeability prior (LEAP)
In hdbayes: Bayesian Analysis of Generalized Linear Models with Historical Data

glm.leap

R Documentation

Posterior of latent exchangeability prior (LEAP)

Description

Sample from the posterior distribution of a GLM using the latent exchangeability prior (LEAP) by Alt et al. (2023).

Usage

glm.leap(
  formula,
  family,
  data.list,
  K = 2,
  prob.conc = NULL,
  offset.list = NULL,
  beta.mean = NULL,
  beta.sd = NULL,
  disp.mean = NULL,
  disp.sd = NULL,
  iter_warmup = 1000,
  iter_sampling = 1000,
  chains = 4,
  ...
)

Arguments

`formula`	a two-sided formula giving the relationship between the response variable and covariates.
`family`	an object of class `family`. See `?stats::family`.
`data.list`	a list of `data.frame`s. The first element in the list is the current data, and the rest are the historical data sets. For LEAP implementation, all historical data sets will be stacked into one historical data set.
`K`	the desired number of classes to identify. Defaults to 2.
`prob.conc`	a scalar or a vector of length `K` giving the concentration parameters for Dirichlet prior. If length == 2, a `Beta(prob.conc[1], prob.conc[2])` prior is used. If a scalar is provided, `prob.conc` will be a vector of repeated elements of the given scalar. Defaults to a vector of 1s.
`offset.list`	a list of matrices giving the offset for current data followed by historical data. For each matrix, the number of rows corresponds to observations and columns correspond to classes. Defaults to a list of matrices of 0s. Note that the first element of `offset.list` (corresponding to the offset for current data) should be a matrix of repeated columns if `offset.list` is not NULL.
`beta.mean`	a scalar or a `⁠p x K⁠` matrix of mean parameters for initial prior on regression coefficients, where `p` is the number of regression coefficients (including intercept). If a scalar is provided, `beta.mean` will be a matrix of repeated elements of the given scalar. Defaults to a matrix of 0s.
`beta.sd`	a scalar or a `⁠p x K⁠` matrix of sd parameters for the initial prior on regression coefficients, where `p` is the number of regression coefficients (including intercept). If a scalar is provided, same as for `beta.mean`. Defaults to a matrix of 10s.
`disp.mean`	a scalar or a vector whose dimension is equal to the number of classes (`K`) giving the location parameters for the half-normal priors on the dispersion parameters. If a scalar is provided, `disp.mean` will be a vector of repeated elements of the given scalar. Defaults to a vector of 0s.
`disp.sd`	a scalar or a vector whose dimension is equal to the number of classes (`K`) giving the scale parameters for the half-normal priors on the dispersion parameters. If a scalar is provided, same as for `disp.mean`. Defaults to a vector of 10s.
`iter_warmup`	number of warmup iterations to run per chain. Defaults to 1000. See the argument `iter_warmup` in `sample()` method in cmdstanr package.
`iter_sampling`	number of post-warmup iterations to run per chain. Defaults to 1000. See the argument `iter_sampling` in `sample()` method in cmdstanr package.
`chains`	number of Markov chains to run. Defaults to 4. See the argument `chains` in `sample()` method in cmdstanr package.
`...`	arguments passed to `sample()` method in cmdstanr package (e.g., `seed`, `refresh`, `init`).

Details

The latent exchangeability prior (LEAP) discounts the historical data by identifying the most relevant individuals from the historical data. It is equivalent to a prior induced by the posterior of a finite mixture model for the historical data set.

Value

The function returns an object of class draws_df giving posterior samples, with an attribute called 'data' which includes the list of variables specified in the data block of the Stan program.

References

Alt, E. M., Chang, X., Jiang, X., Liu, Q., Mo, M., Xia, H. M., and Ibrahim, J. G. (2023). LEAP: The latent exchangeability prior for borrowing information from historical data. arXiv preprint.

Examples

data(actg019)
data(actg036)
# take subset for speed purposes
actg019 = actg019[1:100, ]
actg036 = actg036[1:50, ]
if (instantiate::stan_cmdstan_exists()) {
  glm.leap(
    formula = outcome ~ scale(age) + race + treatment + scale(cd4),
    family = binomial('logit'),
    data.list = list(actg019, actg036),
    K = 2,
    chains = 1, iter_warmup = 500, iter_sampling = 1000
  )
}

hdbayes documentation built on Sept. 11, 2024, 5:34 p.m.