#opts_chunk$set(eval = TRUE)

About

This pages shows particular examples to illustrate the 'solarius' package's behaviour with large datasets. In particular, these examples were generated wuth the GAIT (Genetic Analysis of Idiopathic Thrombophilia) dataset. The GAIT Project included 397 individuals from 21 extended Spanish families (mean pedigree size = 19) [1]. A genome-wide set of 307,984 SNPs was typed in all of the participants using the Infinium 317 k Beadchip on the Illumina platform (San Diego, CA, USA).

We selected 3 specific examples where we could compare the results obtained with the 'solarius' package with those previously obtained and published. In particular we applied a GWAS and a Linkage model to the BMI and the Factor XI phenotypes and the Trhombosis affection. For each of these traits, we show how the main functions of the a'solarius' package were applied and we confirm that they are in concordance with the previsouly published ones.

BMI

library(solarius)
cores <- 32
library(gait)


dat  <- gait1.phen()
mibddir <- gait1.mibddir()
gait1.snpfiles <- gait1.snpfiles()
pdat <- mutate(dat, tr_FXIc=dat$FXI_T*5.1,   ln_bmi = log(bmi),  tr_bmi = 6.1 * ln_bmi)
# traits previously transformed, only significant covariates
M1 <- solarPolygenic(tr_bmi ~ AGE, pdat, covtest = TRUE)
M1
A1 <- solarAssoc(tr_bmi ~ AGE, pdat, genocov.files = gait1.snpfiles$genocov.files, snplists.files = gait1.snpfiles$snplists.files, snpmap.files = gait1.snpfiles$snpmap.files, cores = cores)
summary(A1)
plot(A1)
plot(A1, "qq")
L1 <- solarMultipoint(formula = tr_bmi ~ AGE, data = dat, mibddir = mibddir, 
    chr = 1:22, interval = 5, cores = 22, verbose = 1)
summary(L1)
plot(L1)

FXI

M2 <- solarPolygenic(tr_FXI ~ AGE, pdat, covtest=T)
A2 <- solarAssoc(tr_FXI ~ AGE, pdat, genocov.files = gait1.snpfiles$genocov.files, snplists.files = gait1.snpfiles$snplists.files, snpmap.files = gait1.snpfiles$snpmap.files, cores = cores)
summary(A2)
plot(A2)
plot(A2, "qq")

We observe that our significant SNPs correspond to those reported in [2]. They are three loci: rs710446 and rs4253399 located in the structural F11 gene, and and rs4241824, located in the kininogen 1 (KNG1) gene. Both rs710446 and rs4241824 were reported in our previous GWAS published in [2].

L2 <- solarMultipoint(formula = tr_FXI ~ AGE, data = dat, mibdir = mibdir, chr = 1:22, interval = 5, cores = 22, verbose = 1)
summary(L2)
plot(L2)

Throm

M3 <- solarPolygenic(aff ~ AGE, pdat, covtest = TRUE)
M3

``` {r assoc3, echo=T, cache = TRUE} A3 <- solarAssoc(aff ~ AGE, pdat, genocov.files = gait1.snpfiles$genocov.files, snplists.files = gait1.snpfiles$snplists.files, snpmap.files = gait1.snpfiles$snpmap.files, cores = cores)

```r
plot(A3)
plot(A3, "qq")
L3<- solarMultipoint(formula = aff ~ AGE, data = dat, mibdir = mibdir, chr = 1:22, interval = 5, cores = 22, verbose = 1)
summary(L3)
plot(L3)

Refences

[1] JC Souto, L Almasy, M Borrell , F Blanco-Vaca, J Mateo, et al. Genetic susceptibility to thrombosis and its relationship to physiological risk factors: the GAIT study Genetic Analysis of Idiopathic Thrombophilia. American Journal of Human Genetics.2000; 67:1452–1459. [2] J.C. Souto, G.Pena, A. Ziyatdinov, A.Buil, S. López, J. Fontcuberta, J.M. Soria. A genomewide study of body mass index and its genetic correlation with thromboembolic risk. Thrombosis & Haemostasis. 2014;112(5):1036-43. [3] Maria Sabater-Lleal, Angel Martinez-Perez, Alfonso Buil, Lasse Folkersen, Juan Carlos Souto, Maria Bruzelius, Montserrat Borrell, Jacob Odeberg, Angela Silveira, Per Eriksson, Laura Almasy, Anders Hamsten, José Manuel Soria. A genome-wide association study identifies KNG1 as a genetic determinant of plasma factor XI Level and activated partial thromboplastin time.Arteriosclerosis, Thrombosis, and Vascular Biology. 2012; 32 (8): 2008-16.



ugcd/solarius documentation built on May 3, 2019, 2:22 p.m.