bayz: Bayesian mixed models, shrinkage and interaction kernel...
In MarniTausen/BayzR: R package for bayz

bayz	R Documentation

Bayesian mixed models, shrinkage and interaction kernel regression

Description

The bayz function fits various mixed-linear, Bayesian shrinkage, (sparse) kernel-regression, (kernel) interaction and multi-trait models with complex covariance structures using an extended R-formula syntax.

Usage

bayz(
  model,
  VE = "",
  data = NULL,
  chain = c(0, 0, 0),
  method = "",
  verbose = 1,
  init = NULL
)

Arguments

`model`	An R formula describing the model to be fitted using an extended R formula syntax (see details).
`VE`	Model for the residual variance as formula or string (see details).
`data`	Data frame with variables (responses, explanatory variables) to build the model.
`chain`	Vector c(length, burn-in, skip) with total chain length to run, burn-in, and skip-interval for saving samples and collecting posterior statistics.
`method`	String to indicate analysis method: "Bayes" (full Bayesian, default), "BLUPMC" (BLUE/BLUP solutions with Monte Carlo to get SD/SE).
`verbose`	Integer to regulate printing to R console: 0 (quiet), 1 (some), >=2 (more), with verbose=1 as default.
`init`	An object of class "bayz", output from a previous bayz run, to supply initialisation values to start a new chain.

Details

The model formula in bayz has an extended syntax with all explanatory (right-hand-side) terms wrapped by a function to specify how to fit the variables in the model, for example Yield ~ fx(Year) + rn(Variety) to fit Yield with Year as a fixed class effect and Variety as a random class effect. The syntax allows to add options on each model term, such as variance-structures, fitting, prior, and output options. Common applications are to specify a relationship / similarity "kernel" as correlation-structure for random effects using rn(Variety, V=myGmat), extensions of this with interactions and a variance-structure specified as a kronecker product, and the fitting of regularized regression models on (multiple) large sets of covariates with different shrinkage options. The model-terms currently available are fx() (fixed effects of one or multiple interacting class variables), rn() (random effects of one or multiple interacting class variables with various options to model the covariance structure), rr() (random/ridge regression on a table of covariates with options for homogeneous or heterogeneous shrinkage), rg() (fixed regressions and nested fixed regressions) and a model can specify any number of such model terms.

Model-terms for class variables fx() and rn() allow to specify interactions of variables with a colon, such as fx(Year:Location). This has the same interpretation as in other R models; main effects, if desired, need to be explicitly added in the model. Nested regressions are specified in the common R syntax rg(TempSum|Year:Location) to specify regressions on TempSum within each Year-Location. For random effects, interacting variables can have interaction-variance structures written as a multiplication of similarity matrices / kernels (interpreted as kronecker product) like rn(Variety:Location, V=KG*KE), or can involve an estimated covariance structure specified as VCOV, for instance to specify a multitrait model with rn(Variety:Trait, V=KG*VCOV). Kernels and covariance structures can be made sparser and of reduced rank by setting cut-offs on the eigenvectors to use, by setting an in-model Bayesian variable-selection on eigenvectors, or by estimating a large covariance structure as the kronecker product of two or more smaller matrices. Estimated VCOV covariances structures by default run a Bayesian factor-analytic model estimating a limited number of eigenvectors.

The data argument supplies a data frame with response(s) and explanatory variables to build the model in the usual R-way, and bayz will also search the R environment if variables are not found in the supplied data. The data-frame typically does not include tables with large sets of predictors, and covariance structures / kernels to be used in variance models, and bayz will also search these objects in the R environment. These additional tables of data and matrices should have row-names to match a variable (sample id) in the data, and bayz will link on sample ids, allowing for sample ids in the data to be in different order, repeated, or missing.

The VE argument sets the model for the error (residual) variance structure, which can either be a string with one or a combination of pre-defined structures such as WGHT (weighted), GRP (to fit heterogenous variances by some grouping factor), VCOV (general var-covariance structures), or by a linear model. The latter is interpreted as fitting a log-linear model for the variances and allows to parsimoniously include multiple explanatory variables in the residual variance modelling. VE can be omitted, which will fit a default iid homogeneous variance.

For method="Bayes" (default), bayz runs the common Bayesian estimation of all model parameters by using a Monte Carlo Markov chain. For method="BLUPMC", updating of variance parameters is omitted, and variance estimates should be supplied by using the init argument with the output of a previous run with estimated variances.

The output of the bayz function is a list of class "bayz" that includes several data-frames and lists. For instance, model estimtes are in <output>$Estimates with members named after model parameters. Overviews of the output can be obtained using summary(), which also includes convergence diagnostics and Highest Posterior Density (HPD) intervals on selected parameters. Parameter estimates can also be extracted from the output with methods coef() (all fixed and random effect estimates), fixef() and ranef() (fixed, and random effects, respectively), vcomp() (all variance estimates). Additionally, vcomp() can compute "PVEs" (proportions of variance explained), including HPD intervals, when supplied with information on how to weigh each variance parameter in the total explained variance. Rbayz also includes predict(), contrast() and plot() methods.

Full details are available in the github site ljanss.github.io/Rbayz.

Value

A list of class "bayz" containing results from the fitted model. Several methods are available to summarize, extract, plot or compute contrasts, predictions, etc. on the output.

MarniTausen/BayzR documentation built on Dec. 8, 2024, 8:25 a.m.