load.summary: Loading multiple summary statistics from genome-wide...

Description Usage Arguments Value Author(s) References See Also Examples

Description

The function loads multiple meta-GWAS summary statistics, for subsequent multi-trait GWAS. Currently, the package only analyzes summary statistics from inverse-Gaussianized continuous traits.

Usage

1
2
3
load.summary(files, cor.pheno = NULL, indep.snps = NULL, est.var = FALSE,
  columnNames = c("snp", "a1", "a2", "freq", "beta", "se", "n"),
  fixedN = NULL)

Arguments

files

A vector of file names as strings. Each file name should contain summary statistics of one trait to be included in the multi-trait analysis. The columns of the summary statistics have to contain (uppercase or lowercase does not matter) 'snp' (marker ID), 'a1' (the first allele), 'a2' (the second allele), 'freq', (frequency of the first allele), 'beta' (effect size), 'se' (standard error), and 'n' (sample size).

cor.pheno

A #traits x #traits matrix of correlation matrix of the phenotypes, to be used to construct the multi-trait test statistic. If NULL, this matrix will be estimated from genome-wide summary statistics. If you have partially overlapping samples for different traits, shrinkage correlation matrix is recommended (see reference), so in that case, unless you know what you are doing, leave this argument as default, i.e. NULL.

indep.snps

A vector of strings containing the names of a set of independent SNPs. This is recommended to be generated by LD-pruning the genotype data in a certain cohort. Typically the number of SNPs should be more than 10,000 in order to obtain a good estimate of cor.pheno. If cor.pheno = NULL, this argument cannot be NULL.

est.var

A logical value. If FALSE, each phenotypic variance is assumed to be known as 1. If TRUE, each phenotypic variance will be estimated to adjust the summary statistics, so that the corresponding phenoypic variance is 1.

columnNames

A vector with names of columns containing necessary information in the input file; default values are c('snp','a1','a2','freq','beta','se','n'). The values are case-insensitive. Note: check your allele definitions for different traits are based on the same strand!

fixedN

sample size to assume across all analyses, when provided, this number will be used (instead of the ones specified in the input files)

Value

The function returns a list of class multi.summary, containing two elements: gwa (the cleaned data to be processed in multi-trait GWAS), cor.pheno (user input or estimated), and var.pheno (default or estimated).

Author(s)

Xia Shen, Yakov, Tsepilov, Yurii S. Aulchenko

References

Xia Shen, Yakov Tsepilov, ..., Yudi Pawitan, Chris S. Haley, Yurii S. Aulchenko (2017). Discovery, replication, and in silico functional investigation of 22 new pleiotropic anthropometric loci. Submitted.

See Also

MultiSummary

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
## Not run: 
## download the six example files from:
## https://www.dropbox.com/sh/hhta45cewvvea2s/AADfj4OXlbroToZAwIii2Buha?dl=0
## the summary statistics from Randall et al. (2013) PLoS Genet
## for males only
## bmi: body mass index
## hip: hip circumference
## wc: waist circumference
## whr: waist-hip ratio

## load the prepared set of independent SNPs
indep.snps <- as.character(read.table('indep.snps')$V1)

## load summary statistics of the six traits
stats.male <- load.summary(files = list.files(pattern = '*.txt'), indep.snps = indep.snps)

## perform multi-trait meta-GWAS
result <- MultiSummary(stats.male)
head(result)

## End(Not run)

MultiABEL documentation built on June 20, 2017, 9:04 a.m.