BGGM-package: BGGM: Bayesian Gaussian Graphical Models

Description References

Description

The R package BGGM provides tools for making Bayesian inference in Gaussian graphical models (GGM). The methods are organized around two general approaches for Bayesian inference: (1) estimation \insertCiteWilliams2019BGGM and (2) hypothesis testing \insertCiteWilliams2019_bfBGGM. The key distinction is that the former focuses on either the posterior or posterior predictive distribution, whereas the latter focuses on model comparison with the Bayes factor.

The methods in BGGM build upon existing algorithms that are well-known in the literature. The central contribution of BGGM is to extend those approaches:

  1. Bayesian estimation with the novel matrix-F prior distribution \insertCiteMulder2018BGGM.

    • Estimation estimate.

  2. Bayesian hypothesis testing with the novel matrix-F prior distribution \insertCiteMulder2018BGGM.

    • Exploratory hypothesis testing explore.

    • Confirmatory hypothesis testing confirm.

  3. Comparing GGMs \insertCitewilliams2020comparingBGGM

    • Partial correlation differences ggm_compare_estimate.

    • Posterior predictive check ggm_compare_ppc.

    • Exploratory hypothesis testing ggm_compare_explore.

    • Confirmatory hypothesis testing ggm_compare_confirm.

  4. Extending inference beyond the conditional (in)dependence structure

    • Predictability with Bayesian variance explained \insertCitegelman_r2_2019BGGM predictability.

    • Posterior uncertainty in the partial correlations estimate.

    • Custom Network Statistics roll_your_own.

Furthermore, the computationally intensive tasks are written in c++ via the R package Rcpp \insertCiteeddelbuettel2011rcppBGGM and the c++ library Armadillo \insertCitesanderson2016armadilloBGGM, there are plotting functions for each method, control variables can be included in the model, and there is support for missing values bggm_missing.

Supported Data Types:

Additional Features:

The primary focus of BGGM is Gaussian graphical modeling (the inverse covariance matrix). The residue is a suite of useful methods not explicitly for GGMs:

  1. Bivariate correlations for binary (tetrachoric), ordinal (polychoric), mixed (rank based), and continous (Pearson's) data zero_order_cors.

  2. Multivariate regression for binary (probit), ordinal (probit), mixed (rank likelihood), and continous data (estimate).

  3. Multiple regression for binary (probit), ordinal (probit), mixed (rank likelihood), and continous data (e.g., coef.estimate).

Note on Conditional (In)dependence Models for Latent Data:

All of the data types (besides continuous) model latent data. That is, unoboserved data that is assumed to be Gaussian distributed. For example, a tetrachoric correlation (binary data) is a special case of a polychoric correlation (ordinal data). Both relations are between "theorized normally distributed continuous latent variables" (Wikipedia). In both instances, the correpsonding partial correlation between observed variables is conditioned on the remaining variables in the latent space. This implies that interpration is similar to continuous data, but with respect to latent variables. We refer interested users to \insertCite@page 2364, section 2.2, in @webb2008bayesian;textualBGGM.

High Dimensional Data?

BGGM was builit specificially for social-behvarioal scientists. Of course, the methods can be used by all researchers. However, there is not support for high-dimensonal data (i.e., more variables than observations) that are common place in the genetics literature. These data are rare in the social-behavioral sciences. In the future, support for high-dimensional data may be added to BGGM.

References

\insertAllCited
BGGM documentation built on Aug. 20, 2021, 5:08 p.m.