igg: Inverse Gamma-Gamma Regression

Description Usage Arguments Details Value Author(s) References Examples

Description

This function provides a fully Bayesian approach for obtaining a sparse estimate of the p \times 1 vector, β in the univariate linear regression model,

y = X β + ε,

where ε \sim N_n (0, σ^2 I_n). This is achieved by placing the inverse gamma-gamma (IGG) prior on the coefficients of β. In the case where p > n, we utilize an efficient sampler from Bhattacharya et al. (2016) to reduce the computational cost of sampling from the full conditional density of β to O(n^2 p).

Usage

1
igg(X, y, a=NA, b=NA, sigma2=NA, max.steps=10000, burnin=5000)

Arguments

X

n \times p design matrix.

y

n \times 1 response vector.

a

The parameter for IG(a,1). If not specified (a=NA), defaults to 1/2+1/p. User may specify a value for a between 0 and 1.

b

The parameter for G(b,1). If not specified (b=NA), defaults to 1/p. User may specify a value for b between 0 and 1.

sigma2

The variance parameter. If the user does not specify this (sigma2=NA), the Gibbs sampler will estimate this using Jeffreys prior. If σ^2 is known or estimated separately (e.g. through empirical Bayes), the user may also specify it.

max.steps

The total number of iterations to run in the Gibbs sampler. Defaults to 10,000.

burnin

The number of burn-in iterations for the Gibbs sampler. Defaults to 5,000.

Details

The function performs sparse estimation of β in the standard linear regression model and variable selection from the p covariates. Variable selection is performed by assessing the 95 percent marginal posterior credible intervals. The lower and upper endpoints of the 95 percent credible intervals for each of the p covariates are also returned so that the user may assess uncertainty quantification. The full model is:

Y | (X, β ) \sim N_n(X β, σ^2 I_n),

β_i | ( λ_i, ξ_i, σ^2) \sim N(0, σ^2 λ_i ξ_i), i = 1, ..., p,

λ_i \sim IG(a,1), i = 1, ..., p,

ξ_i \sim G(b,1), i = 1, ..., p,

σ^2 \propto 1/σ^2.

If σ^2 is known or estimated separately, the Gibbs sampler does not sample from the full conditional for σ^2.

Value

The function returns a list containing the following components:

beta.hat

The posterior mean estimate of β.

beta.med

The posterior median estimate of β.

beta.intervals

The lower and upper endpoints of the 95 percent credible intervals for all p estimates in β.

igg.classifications

A p-dimensional binary vector with "1" if the covariate is selected and "0" if it is deemed irrelevant.

Author(s)

Ray Bai and Malay Ghosh

References

Bai, R. and Ghosh, M. (2018). "The Inverse Gamma-Gamma Prior for Optimal Posterior Contraction and Multiple Hypothesis Testing." Submitted, arXiv:1711.07635.

Bhattacharya, A., Chakraborty, A., and Mallick, B.K. (2016). "Fast Sampling with Gaussian Scale Mixture Priors in High-Dimensional Regression." Biometrika, 69(2): 447-457.

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
###################### 
# Load diabetes data #
######################
data(diabetes)
attach(diabetes)
X <- scale(diabetes$x)
y <- scale(diabetes$y)

################################
# Fit the IGG regression model #
################################
igg.model <- igg(X=X, y=y, max.steps=5000, burnin=2500)

##############################
# Posterior median estimates #
##############################
igg.model$beta.med

###########################################
# 95 percent posterior credible intervals #
###########################################
igg.model$beta.intervals

######################
# Variable selection #
######################
igg.model$igg.classifications

IGG documentation built on May 2, 2019, 2:04 a.m.