colEMlrt: EM Likelihood Ratio Test

Description Usage Arguments Details Value Author(s) References See Also Examples

View source: R/colEMlrt.R

Description

Performs the Expectation Maximimization Likelihood Ratio Test (EMLRT) proposed by Weinberg (1999) for each SNP in a matrix in genotype format.

Usage

1
2
3
4
5
colEMlrt(mat.snp, model = c("general", "dominant", "recessive"), maternal = FALSE, 
     parentMissing = c("father", "mother", "either"), iter = 40, eps = 10^-16)

## S3 method for class 'colEMlrt'
print(x, top = 5, digits = 4, ...)

Arguments

mat.snp

a numeric matrix in which each column represents a SNP. Each column must be a numeric vector of length 3 * t representing a SNP genotyped at t trios. Each of the t blocks must consist of the genotypes of father, mother, and offspring (in this order). The genotypes must be coded by 0, 1, and 2. Missing values are allowed and need to be coded by NA. This matrix in genotype format might be generated from a ped-file by, e.g., employing ped2geno.

model

a character string specifying the genetic effect that should be considered in the Poisson regression. By default, the general model proposed by Weinberg (1999) is fitted. Alternatively, a dominant or recessive mode of inheritance might be considered by setting model = "dominant" or model = "recessive", respectively. Abbreviations such as "dom" or "rec" are also accepted.

maternal

logical specifying whether parameters for a maternal effects should be added to the Poisson regression model. If TRUE and model = "general", the most general model described by Weinberg (1999) is used.

parentMissing

a character string specifying whether the genotype of the father, the mother, or either the mother or the father is allowed to be missing. By default, only the genotype of the father is allowed to be missing in a trio. Thus, in this case, all complete trios and all trios in which data are available for the father and the offspring are used in the testing of the considered SNP. If parentMissing = "either", all trios in which the genotype of the offspring or the genotypes of both parents are missing for a particular SNP are ignored in the analysis of this SNP.

iter

a non-negative numeric value specifying the maximum number of iterations that should be used in the application of the expectation maximization algorithm.

eps

a non-negative small numeric value specifying the accuracy for the stopping criterion of the EM algorithm. If the sum of the squared differences of the estimated expected numbers of trios over the 15 possible genotype combinations in trios between two consecutive iterations of the EM algorithm is smaller than eps, the EM algorithm stops. Often, less than ten genotype combinations are required.

x

an object of class colEMlrt, i.e. the output of colEMlrt.

digits

number of digits that should be printed.

top

number of SNPs that should be printed. If top is less than or equal to zero or larger than the total number of SNPs in mat.snp, then the statistics for all SNPs are printed in the order as they were in the genotype matrix used as input into colEMlrt. Otherwise, the top interactions with the smallest p-values are printed.

...

ignored.

Details

While in functions such as colTDT all trios in which the genotype of one or more of the members of this trio is missing for a particular SNP are removed from the analysis of this SNP, the procedure proposed by Weinberg (1999) can handle missing genotypes by employing an expectation maximization (EM) algorithm to estimate the expected numbers of trios for the 15 different genotype combinations possible in a trio (when considering the genotypes of mothers and fathers individually) and a likelihood ratio test based on two nested Poisson regression models using the estimated expected numbers of trios as outcome.

Value

An object of class colEMlrt consisting of the following numeric vectors:

stat

the values of the test statistic of the likelihood ratio test for all SNPs in mat.snp,

pval

the corresponding p-values,

ll.full

the values of the maximized likelihoods of the full models,

ll,red

the values of the maximied likelihoods of the reduced models not containing the parameter(s) corresponding to the genetic model of interest.

Author(s)

Philipp Berger, philipp.berger@hhu.de

References

Weinberg, C.R. (1999). Allowing for Missing Parents in Genetic Studies of Case-Parent Triads. American Journal of Human Genetics, 64, 1186-1193.

See Also

colTDT, ped2geno

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
# Load the simulated data.
data(trio.data)

# The EM Likelihood Ratio Test can be applied
# to the SNPs in mat.test by
em.out <- colEMlrt(mat.test)

# If a dominant mode of inheritance is of interest,
# the corresponding EM Likelihood Ratio Test can be
# performed by 
emd.out <- colEMlrt(mat.test, model="dominant")

# By default, statistics for the top 5 SNPs are displayed.
# If another number of SNPs, say 10, should be displayed,
# then this can be done by
print(em.out, top = 10)

# The statistics for all SNPs (not ordered by their 
# significance) can be obtained by
print(em.out, top = 0)

trio documentation built on Nov. 8, 2020, 7:41 p.m.