R/data.R

#' simulated data for demonstrating the features of BayesQVGEL
#'
#' Simulated gene expression data for demonstrating the features of BayesQVGEL.
#'
#' @docType data
#' @keywords datasets
#' @name data
#' @aliases data y e C g w k coeff
#' @format The data object consists of seven components: y, e, C, g, w ,k and coeff. coeff contains the true values of parameters used for generating Y.
#'
#' @details
#'
#' \strong{The data and model setting}
#'
#' Consider a longitudinal study on \eqn{n} subjects with \eqn{k} repeated measurement for each subject. Let \eqn{Y_{ij}} be the measurement for the \eqn{i}th subject at each time point \eqn{j}(\eqn{1\leq i \leq n, 1\leq j \leq k}) .We use a \eqn{m}-dimensional vector \eqn{G_{ij}} to denote the genetics factors, where \eqn{G_{ij} = (G_{ij1},...,G_{ijm})^{T}}. Also, we use \eqn{p}-dimensional vector \eqn{E_{ij}} to denote the environment factors, where \eqn{E_{ij} = (E_{ij1},...,E_{ijp})^{T}}. \eqn{X_{ij} = (1, T_{ij})^{T}}, where \eqn{T_{ij}^{T}} is a vector of time effects . \eqn{Z_{ij}} is a \eqn{h \times 1} covariate associated with random effects and \eqn{\alpha_{i}} is a \eqn{h\times 1} vector of random effects.  At the beginning,  the interaction effects is modeled as the product of genomics features and environment factors with 4 different levels. After representing the environment factors as three dummy variables, the identification of the gene by environment interaction needs to be performed as group level.  Combing the genetics factors, environment factors and their interactions that associated with the longitudinal phenotype, we have the following mixed-effects model:
#' \deqn{Y_{ij} = X_{ij}^{T}\gamma_{0}+E_{ij}^{T}\gamma_{1}+G_{ij}^{T}\gamma_{2}+(G_{ij}\bigotimes E_{ij})^{T}\gamma_{3}+Z_{ij}^{T}\alpha_{i}+\epsilon_{ij}.}
#' where \eqn{\gamma_{1}},\eqn{\gamma_{2}},\eqn{\gamma_{3}} are \eqn{p},\eqn{m} and \eqn{mp} dimensional vectors that represent the coefficients of the environment effects, the genetics effects and interactions effects, respectively. Accommodating the Kronecker product of the \eqn{m} - dimensional vector \eqn{G_{ij}} and the \eqn{p}-dimensional vector \eqn{E_{ij}}, the interactions between genetics and environment factors can be expressed as a \eqn{mp}-dimensional vector, denoted as the following form:
#' \deqn{G_{ij}\bigotimes E_{ij} = [E_{ij1}E_{ij1},E_{ij2}E_{ij2},...,E_{ij1}E_{ijp},E_{ij2}E_{ij1},...,E_{ijm}E_{ijp}]^{T}.}
#' When \eqn{Z_{ij}^{T} = (1,j)} and  \eqn{\alpha_{i} = (\alpha_{i1},\alpha_{i2})^{T}}, the model becomes random intercept and slope model. When \eqn{Z_{ij}^{T} = 1} and \eqn{\alpha_{i} = \alpha_{i1}}, the model becomes random intercept model.
#'
#' @examples
#' data(data)
#' dim(y)
#' dim(g)
#' dim(e)
#' dim(w)
#' print(k)
#' print(C)
#' print(coeff)
#'
#' @seealso \code{\link{BayesQVGEL}}
NULL

Try the BayesQVGEL package in your browser

Any scripts or data that you put into this service are public.

BayesQVGEL documentation built on April 27, 2023, 1:10 a.m.