projection.based: Projection Based Method

Description Usage Arguments Details Value Author(s) Examples

View source: R/projectionBased.R

Description

A extension to the Houseman 2012 method with adaptive reference sites.

Usage

1
projection.based(reference.CpG, reference.pheno, infer.CpG, missing.name, missing.n = 200)

Arguments

reference.CpG

Validation methylation (CpGs x subjects)

reference.pheno

Validation pheno

infer.CpG

Inference methylation (CpGs x subjects)

missing.name

The missing covariates name

missing.n

The number of CpG sites you want to use to impute the missing data

Details

We tailor the method proposed by Houseman et al.to a more general situation. Briefly, the Houseman algorithm identifies 100-300 CpG sites that discriminated cellular composition in sorted normal human cell populations (consisting of B cells, granulocytes, monocytes, NK, and CD4+ and CD8+ T cells). The method fits a linear model at each of these CpG sites using a reference dataset to estimate the coefficient for each cellular component. It then uses a matrix projection approach to map these estimated coefficients to the relative proportions of each cellular component in the samples without cell type composition information. This method can be applied to other phenotypes besides cell type composition, if a reference dataset is available with both DNA methylation data and phenotype of interest. In the case of missing data, we use the set of samples with complete information as the reference dataset, and apply the projection approach to impute the missing values.

Value

Return the imputed covariates matrix

Author(s)

Chong Wu

Examples

1
2
3
##---- Should be DIRECTLY executable !! ----
##-- ==>  Define data, use random,
##--	or do  help(data=index)  for the standard data sets.

ChongWu-Biostat/MethyImpute documentation built on May 6, 2019, 11:18 a.m.