MethyImpute-package: Imputing the missing covariates via methylation data

Description Details Author(s) Examples

Description

DNA methylation is a widely studied epigenetic mechanism and alterations in methylation patterns may be involved in the development of common diseases. Unlike inherited changes in genetic sequence, variation in site-specific methylation varies by tissue, developmental stage, disease status, and may be impacted by aging and exposure to environmental factors such as diet or smoking. These non-genetic factors are typically included in epigenome-wide association studies (EWAS) because they may be confounding factors to the association between methylation and disease. However, missing values in these variables can lead to reduced sample size and decrease the statistical power of EWAS. We propose a site selection and multiple imputation (MI) method to impute missing covariate values and to perform association tests in EWAS. Then, we compare this method to an alternative projection-based method. Through simulations, we show that the MI-based method is slightly conservative, but provides consistent estimates for effect size.

Details

The DESCRIPTION file: This package was not yet installed at build time.

Index: This package was not yet installed at build time.

Author(s)

Chong Wu

Maintainer: Chong Wu <wuxx0845@umn.edu>

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
#Do Not Run
## load the necessary packages
library("truncnorm")
library("mice")

## Set the initial conditions
N <- 500 ## Total sample size (subjects)
m <- 1000  ## Total CpG sites
missing.prop = 0.3 ## missing rate
impute.times = 10  ## The number of imputations
b4.value = 0.02    ## The effect size

seed = 1
data = simulatingData(seed,N,m,missing.prop, b4.value)

Y = data$reference.Y
pheno = data$pheno
pheno = pheno[,2:5]
reference.index = data$reference.num
missing.index = data$test.num
missing.cov.name = c("cell.1","cell.2","cell.3")
complete.cov.name = "x1"
m=30
maxit = 5
max.refernce.methy = 30
defaultMethod = c("norm", "logreg", "polyreg", "polr")
#imp = methy.mice(Y, pheno, missing.index, reference.index, missing.cov.name, complete.cov.name, #max.refernce.methy = 30, m = 30, maxit = 5, defaultMethod = c("norm", "logreg", "polyreg", "polr"))

ChongWu-Biostat/MethyImpute documentation built on May 6, 2019, 11:18 a.m.