This package implements assorted tools for genetic association analyses, which is viewed as being entirely an exercise in regressing a (possibly multivariate) phenotypic “response variable” onto one or more “explanatory variables” that include genetic variables.
Currently, this package does not provide computationally efficient functions for genetic association analyses at a genome wide scale (genome wide association studies; GWAS). These are already provided by other R packages and by standalone software such as PLINK. Rather, the focus of this package is to provide functions for analysing and manipulating phenotype data before conducting a GWAS (“pre-GWAS”), and on functions for analysing summary statistics resulting from a GWAS (“post-GWAS”). Many of the “post-GWAS” functions implement regression analyses using summary statistics, which are intended to closely approximate results that would be obtained by directly analysing the subject-specific genotype and phenotype data.
Functions for “pre-GWAS” analyses include functions useful for deriving response variables from phenotype data, especially response variables for pharmacogenetic analyses derived from clinical trial phenotype data; functions for power analyses; and functions for annotating and plotting results.
Functions for “post-GWAS” analyses currently support calculation of approximate Bayes factors; multi-SNP risk score analyses; multi-SNP conditional regression analyses; and multi-phenotype analyses.
Approximate Bayes factors can be calculated using
abf.Wakefield
, abf.normal
and
abf.t
.
For multi-SNP risk score analyses, the main functions for analysing
summary statistics are grs.summary
,
grs.plot
and grs.filter.Qrs
. The summary
statistics necessary for these analyses are single SNP association
statistics, which can be calculated using a wide variety of existing
tools for GWAS analysis and meta-analysis.
For multi-SNP conditional or multiple regression analyses, the main
functions for performing multiple regression using summary statistics
are combine.moments2
, est.moments2
,
lm.moments2
and stepup.moments2
. The
summary statistics necessary for these analyses can be calculated from
subject-specific genotype and phenotype data, using the function
make.moments2
.
Multi-phenotype analyses can be performed using
multipheno.T2
and multipheno.OBrien
.
In addition, there are “helper” functions for reading and
manipulating subject-specific genotype and phenotype data, and which
provide a convenient interface from R to genotype data exported from
PLINK, and imputed genotype data generated by MACH, minimac, or
IMPUTE. These provide a platform for calculating the necessary
summary statistics, and for performing “exact” analyses to
validate some of the approximate summary statistic based methods. The
main functions provided are read.snpdata.plink
,
read.snpdata.mach
, read.snpdata.minimac
,
and read.snpdata.impute
.
Toby Johnson Toby.x.Johnson@gsk.com
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.