Home

/

GitHub

/

DominikMueller64/bayesMultiGroupRR

/

bayesMultiGroupRR: Bayesian hierarchical multi-group random regression model for...

bayesMultiGroupRR: Bayesian hierarchical multi-group random regression model for...
In DominikMueller64/bayesMultiGroupRR: Fit a multi-group bayesian hierarchical random regression model for genomic prediction.

Description Usage Arguments Details Value See Also Examples

Bayesian hierarchical multi-group random regression model for genomic prediction.

1	bayesMultiGroupRR(data, ...)

data

A list. The data for fitting the model:

name	structure
group	index vector with group memberships
X	marker matrix with genotypes in rows
y	vector with phenotypic data

This function is a minimal wrapper around stan and it only supplies the model file argument and the data. All other argument must be taken from stan.

A stanfit-class object.

See stan, for which this function is a minimal wrapper.

# Stan model file
stanfile <- system.file('extdat', 'bayesMultiGroupRR.stan', package = 'bayesMultiGroupRR')

## Load packages and set options. ------------------------------------------------------------

library('rstan')
library('magrittr')
rstan_options(auto_write = TRUE)
options(mc.cores = parallel::detectCores())
rm(list = ls())

## copied from https://github.com/DominikMueller64/dmisc/blob/master/R/equal_split.R.
equal_split <- function(x, n, random = TRUE, beginning = FALSE) {
  len <- length(x)
  base_size <- len %/% n
  rem <- len %% n
  sizes <- rep(base_size, times = n)
  if (rem > 0L) {
    ix <- if (beginning) seq.int(from = 1L, to = rem) else sample.int(n, size = rem)
    sizes[ix] <- sizes[ix] + 1L
  }

  s <- seq_len(length(x))
  ret <- purrr::map(sizes, function(size) {
    if (random) {
      smp <- sample(x = s, size = size)
    } else {
      smp <- s[seq.int(from = 1L, to = size)]
    }
    s <<- setdiff(s, smp)
    x[smp]
  })
  if (rem > 0L)
    attr(ret, which = 'indices') <- ix
  ret
}

## Load data ---------------------------------------------------------------------------------

data('wheat', package = 'BGLR')
X <- 2 * wheat.X
rm(list = grep(pattern = 'wheat*', x = ls(), value = TRUE))

## Set parameters ----------------------------------------------------------------------------

h2 <- 0.9  ## heritability
n_loci <- min(ncol(X), 20L)  ## number of (known) loci
n_groups <- 5L  ## number of groups (sub-populations)
base <- 0.90
  ## The true correlation matrix of the locus effects.
(true_cor_eff <- base^abs(outer(seq_len(n_groups), seq_len(n_groups), FUN = `-`)))
  ## Sampling true locus effects.
true_eff <- mvtnorm::rmvnorm(n = n_loci, sigma = true_cor_eff) %>% split(., f = col(.))
  ## Empirical covariances.
(emp_cov_eff <- cor(do.call(what = cbind, true_eff)))

## Prepare data ------------------------------------------------------------------------------

  ## Subset data to the desired number of loci.
X <- X[, sort(sample.int(n = ncol(X), size = n_loci)), drop = FALSE]
  ## Split the genotypes into (randomly sampled!) groups.
X <- dmisc::equal_split(x = split(x = X, f = row(X)), n = n_groups, random = TRUE)
  ## Compute genetic values.
g <- purrr::map2(X, true_eff, function(x, e) purrr::map_dbl(x, ~.x %*% e))
  ## Vector indication group-membership of individual observations.
group_size <- X %>% purrr::map_dbl(length)
group <- rep(seq_along(group_size), times = group_size)
n <- sum(group_size)  ## Total number of observations.
  ## Flatten genotypes to a list of matrices.
X <- X %>% purrr::map(~do.call(what = 'rbind', args = .x))
  ## Standardize genotypes within groups. Remove first homozygous loci!
## X <- X %>% purrr::map(scale)
  ## Flatten genotypes to matrix.
X <- X %>% do.call(what = 'rbind')
  ## Flatten genetic values to vector
g <- purrr::flatten_dbl(g)
  ## Add some noise.
y <- g + rnorm(n = n, sd = sqrt(var(g) * (1 - h2) / h2))

dat <- list(n = n,
            n_groups = n_groups,
            n_loci = n_loci,
            group = group,
            y = y,
            X = X)
## Not run: 
## Fit model.
fit <- stan(file = 'stanfile.stan', data = dat, iter = 500L, warmup = 100L,
            chains = 4L, verbose = TRUE, include = FALSE)

## Inspect traceplots of chains.
rstan::traceplot(fit, 'mu') # overall mean
rstan::traceplot(fit, 'beta')  # group-specific means
rstan::traceplot(fit, 'alpha_mu')  # overall locus effects
## rstan::traceplot(fit, 'alpha') # group-specific locus effects (too much)
rstan::traceplot(fit, 'sigma_e') # residual standard deviation
rstan::traceplot(fit, 'sigma_alpha_mu') # overall locus effect standard deviation
rstan::traceplot(fit, 'sigma_alpha') # group-specific locus effect standard deviation
rstan::traceplot(fit, 'Omega') # correlations
rstan::traceplot(fit, 'h2') # heritability (locus - level!)
rstan::traceplot(fit, 'theta') # linear predictor (for genetic values)

## Compute posterior covariance matrix, correlation matrix and linear predictors (GEBVs).
tmp <- rstan::get_posterior_mean(fit, 'Sigma')[, 'mean-all chains']
(post_mean_cov_eff <- matrix(data = tmp, ncol = n_groups))
cov2cor(post_mean_cov_eff)
tmp <- rstan::get_posterior_mean(fit, 'Omega')[, 'mean-all chains']
(post_mean_cor_eff <- matrix(data = tmp, ncol = n_groups))
post_theta <- rstan::get_posterior_mean(fit, 'theta')[, 'mean-all chains']

## Compute the prediction accuracy on per-group basis.
purrr::map2(split(g, f = group), split(post_theta, f = group), ~cor(.x, .y))


## End(Not run)

DominikMueller64/bayesMultiGroupRR documentation built on May 6, 2019, 2:52 p.m.

DominikMueller64/bayesMultiGroupRR index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

DominikMueller64/bayesMultiGroupRR
Fit a multi-group bayesian hierarchical random regression model for genomic prediction.

bayesMultiGroupRR: Bayesian hierarchical multi-group random regression model for...
In DominikMueller64/bayesMultiGroupRR: Fit a multi-group bayesian hierarchical random regression model for genomic prediction.

Description

Usage

Arguments

Details

Value

See Also

Examples

Related to bayesMultiGroupRR in DominikMueller64/bayesMultiGroupRR...

R Package Documentation

Browse R Packages

We want your feedback!

DominikMueller64/bayesMultiGroupRR Fit a multi-group bayesian hierarchical random regression model for genomic prediction.

bayesMultiGroupRR: Bayesian hierarchical multi-group random regression model for... In DominikMueller64/bayesMultiGroupRR: Fit a multi-group bayesian hierarchical random regression model for genomic prediction.

Description

Usage

Arguments

Details

Value

See Also

Examples

Related to bayesMultiGroupRR in DominikMueller64/bayesMultiGroupRR...

R Package Documentation

Browse R Packages

We want your feedback!

DominikMueller64/bayesMultiGroupRR
Fit a multi-group bayesian hierarchical random regression model for genomic prediction.

bayesMultiGroupRR: Bayesian hierarchical multi-group random regression model for...
In DominikMueller64/bayesMultiGroupRR: Fit a multi-group bayesian hierarchical random regression model for genomic prediction.