mixmapTest: Implements the MixMAP algorithm using hypothesis testing...

Description Usage Arguments Details Value Author(s) References Examples

Description

This function implements the MixMAP algorithm, which performs gene-level tests of association using data from a previous GWAS or data from a meta-analysis of several GWAS. Conceptually, genes are detected as significant if the collection of p-values within a gene are determined to be collectively smaller than would be observed by chance.

Usage

1
2
mixmapTest(data.set, pval="pval", snp="snp", gene="gene",
  coord="coord", chr="chr", alpha = 0.05)

Arguments

data.set

A data.frame containing the input data. Each observation in this data set is a SNP. This file must contain, at least, the SNP name, p-value for a SNP, group name (usually a gene name) where SNP is located, SNP location coordinate, and chromosome number of the SNP.

pval

A character string with the name of the variable containing the p-values in the data.set. Default is "pval". The data contained in this variable must be numeric.

snp

A character string with the name of the variable containing the SNP name in the data.set. Default is "snp".

gene

A character string with the name of the variable containing the gene name in the data.set. Default is "gene".

coord

A character string with the name of the variable containing the location coordinate of the SNP in the data.set. Default is "coord". The data contained in this variable must be numeric.

chr

A character string with the name of the variable containing the chromosome number of the SNP location in the data.set. Default is "chr". The data contained in this variable must be numeric.

alpha

A numeric scalar indicating the level of significance the user chooses to use for detection. The default is 0.05.

Details

The user must provide a file that includes SNP name, SNP p-value, and a group name (assumed to be gene, but the user can use any grouping they choose), among other inputs.

Note about SNPs in genes: It is possible for individual SNPs to be located in two overlapping genes. In this case, the user can choose, when creating the input file, to list that SNP in both genes, only one of the genes, or simply ignore SNPS that are not in a unique gene. If the user chooses to list a SNP in more than one gene, the SNP must have multiple rows in the input file with a different gene in each row.

If the user only has SNP names and p-values, gene name must be appended to the file. The user can either use their own file to append gene name, base pair, and chromosome. The R package biomaRt located in bioconductor is a good source for linking SNPs to genes.

Value

An object of class MixMAP.

output

A data.frame with a row for each gene containing gene symbol, posterior estimates for all gene level effects, variance used in intervals, upper bound of one sided interval, the number of SNPs in each gene, the chromsome of the SNP, the location coordinate of the SNP, the gene-level p-value, the Bonferroni adjusted gene-level p-value, and the q-value based on Banjamini-Hochberg fasle discovery rate.

num.genes.detected

A vector containing the number of SNPs detected and the total number of genes

detected.genes

A data.frame with a row for each gene containing gene symbol, posterior estimates for all gene level effects, variance used in intervals, upper bound of one sided interval, the number of SNPs in each gene, the chromsome of the SNP, the location coordinate of the SNP, the gene-level p-value, the adjusted gene-level p-value, the name of the SNP with the smallest p-value, the minimum p-value in the gene, and a 5 number summary of the p-values within each gene

lmer.out

A mer object containing all of the model output information, including parameter estimates, from the lmer function.

Author(s)

Gregory J. Matthews

References

Foulkes, A.S., Matthews, G.J., Das, U., Ferguson, J., Reilly, M. (2013) “Mixed Modeling of Meta-Analysis P-Values (MixMAP) Suggests Multiple Novel Gene Loci for Low Density Lipoprotein Cholesterol". PLoS ONE 8(2): e54812.

Matthews, G.J. and Foulkes, A.S. (2015) “MixMAP: An R Package for Mixed Modeling of Meta-Analysis $p$ Values in Genetic Association Studies." Journal of Statistical Software. 66(3): 1-11.

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
library(MixMAP)	
#Load data
#This data has been prepared to be used as input to the MixMAP function
data(MixMAP_example)
#Run MixMAP
MixOut<-mixmapTest(MixMAP_example,pval="GC.Pvalue",snp="MarkerName",
  chr="Chr",coord="Coordinate",gene="Gene")
#Display first ten detected genes
summary(MixOut)
#MixManhattan Plot
plot(MixOut)

gjm112/MixMAP documentation built on May 17, 2019, 6:03 a.m.