Description Usage Format Source
To simulate genotypes, we used the 1000 Genomes Project database. Variants within 500kbs of the BRCA1 gene, for which several known mutations are associated with a higher risk of developing breast, ovarian and prostate cancers, were selected. To avoid any multicollinearity, we pruned variants based on linkage disequilibrium r^2 > 0.7. Further, a total of 503 subjects with a European genetic ancestry are selected in order to avoid any population structure. One discrete and two continuous traits were simulated using a gaussian copula to model the joint dependence. Finally, we also simulated one discrete and one continuous covariate.
1 |
This data frame has 503 rows and the following 35 columns:
discrete trait simulated from a latent gaussian variable
continous trait simulated from a gaussian distribution
continous trait simulated from a Gamma distribution
intercept
discrete covariate
continous covariate
30 consecutive SNPs sampled from a random genomic region within 500kbs of BRCA1 gene found on chromosome 17
https://www.internationalgenome.org/data/
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.