The same pathways are present in all humans, but are these genes somehow tuned differently based on population ancestry?
This is a project to develop a computational pipeline that determines instances of population-driven SNP-SNP coevolution at the pathway level in an effort to better understand the evolution of human pathways. A wide number of software packages and statistical methods are used and are outlined below (NOTE: the following pipeline is currently carried out between the CEU and YRI population cohorts)
| METHOD | RESULT | | ------------- | ------------- | | PLINK | ~1.5 million SNPs genotyped in CEU and YRI (HapMap 3) | | GSEA | 56 pathways (19 non-redundant) enriched for population-driven positive selection (i.e., high FST genes) | | Linkage disequilibrium | 4 (CEU) / 10 (YRI) within-pathway and 16 (CEU) / 59 (YRI) between-pathway coevolution signals discoered (FDR ≤ 0.2)|
HYPOTHESIS: evolutionary maintenance of pathway-level SNP interactions that influence population fitness
Pipeline scripts found here
# within R environment (calls recodeFAM.R, popPCA.R, calcFST.R, SNP2gene.R, setupGSEArun.R, getPathStats.R, LDstatsWPM.R, and LDstatsBPM.R)
> source(runPipeline.R) # runs entire pipeline
recodeFAM.R
)popPCA.R
)calcFST.R
)calcFST.R
)head(markerFST) Marker CHI2 1 rs1000000 0.100100 2 rs10000003 0.122500 3 rs10000007 -0.273025 4 rs10000010 -0.005300 5 rs10000011 -0.179690 6 rs10000015 -0.177080 ```
SNP2gene.R
)head(snp2gene) V1 V2 V3 1 rs12562034 LOC100288069 0 2 rs7518545 LOC100288069 0 3 rs9651273 AGRN 0 4 rs61766341 AGRN 0 5 rs12134754 AGRN 0 6 rs11586034 AGRN 0 ```
> head(res)
Geneset
1 CELL FATE COMMITMENT%GOBP%GO:0045165
2 SRP-DEPENDENT COTRANSLATIONAL PROTEIN TARGETING TO MEMBRANE%GOBP%GO:0006614
3 EUKARYOTIC TRANSLATION TERMINATION%REACTOME DATABASE ID RELEASE 56%72764
4 EUKARYOTIC TRANSLATION ELONGATION%REACTOME DATABASE ID RELEASE 56%156842
5 VIRAL MRNA TRANSLATION%REACTOME%R-HSA-192823.1
6 TRANSMEMBRANE RECEPTOR PROTEIN SERINE/THREONINE KINASE SIGNALING PATHWAY%GOBP%GO:0007178
Size ES NES NominalP FDR FWER
1 109 0.406 4.667 0.0000 0.009 0.0506
2 83 0.130 4.629 0.0007 0.009 0.0556
3 77 0.114 4.592 0.0025 0.009 0.0623
4 78 0.107 5.024 0.0019 0.010 0.0184
5 75 0.126 4.901 0.0013 0.010 0.0260
6 134 0.372 4.676 0.0000 0.010 0.0492
getPathStats.R
)getPathStats.R
)head(dat) snp_1 chr_1 pos_1 snp_2 chr_2 pos_2 R.squared pathway 1 rs10119465 9 19376144 rs1693547 8 101714394 8.019353e-03 pathway_1 2 rs10119465 9 19376144 rs1881157 4 99712077 5.271225e-03 pathway_1 3 rs10119465 9 19376144 rs11684944 2 3633917 1.151087e-02 pathway_1 4 rs10119465 9 19376144 rs632401 7 73563425 3.049999e-03 pathway_1 5 rs10119465 9 19376144 rs6545480 2 55460751 2.641132e-02 pathway_1 6 rs10119465 9 19376144 rs11586570 1 93305371 1.475042e-05 pathway_1 set pop 1 Enriched CEU 2 Enriched CEU 3 Enriched CEU 4 Enriched CEU 5 Enriched CEU 6 Enriched CEU ```
Between-pathway results ```
head(dat) snp_1 chr_1 pos_1 gene_1 snp_2 chr_2 pos_2 R.squared 1 rs166125 5 65201373 ERBB2IP rs10762276 10 71052469 0.0084093756 2 rs12493155 3 138400332 PIK3CB rs10762276 10 71052469 0.0079737336 3 rs2131432 17 1297472 YWHAE rs10762276 10 71052469 0.0102617490 4 rs12076073 1 154944156 SHC1 rs10762276 10 71052469 0.0002102607 5 rs741334 12 112985328 PTPN11 rs10762276 10 71052469 0.0031810952 6 rs3756668 5 67596088 PIK3R1 rs10762276 10 71052469 0.0048012803 pathway_pair1 1 ALPHA6BETA4INTEGRIN_IOB_ALPHA6BETA4INTEGRIN 2 ALPHA6BETA4INTEGRIN_IOB_ALPHA6BETA4INTEGRIN 3 ALPHA6BETA4INTEGRIN_IOB_ALPHA6BETA4INTEGRIN 4 ALPHA6BETA4INTEGRIN_IOB_ALPHA6BETA4INTEGRIN 5 ALPHA6BETA4INTEGRIN_IOB_ALPHA6BETA4INTEGRIN 6 ALPHA6BETA4INTEGRIN_IOB_ALPHA6BETA4INTEGRIN pathway_pair2 ixn_num set pop 1 CARBOHYDRATE_TRANSPORT_GOBP_GO_0008643 interaction_1 Enriched CEU 2 CARBOHYDRATE_TRANSPORT_GOBP_GO_0008643 interaction_1 Enriched CEU 3 CARBOHYDRATE_TRANSPORT_GOBP_GO_0008643 interaction_1 Enriched CEU 4 CARBOHYDRATE_TRANSPORT_GOBP_GO_0008643 interaction_1 Enriched CEU 5 CARBOHYDRATE_TRANSPORT_GOBP_GO_0008643 interaction_1 Enriched CEU 6 CARBOHYDRATE_TRANSPORT_GOBP_GO_0008643 interaction_1 Enriched CEU ```
Pathway-level FDR statistic representing the significance of coevolution signal (determined via Kolmogorov-Smirnov statistic)
Within-pathway: LD r2 distribution per selection-enriched pathway against cumulative LD r2 distribution of unenriched pathways
head(pvals)
pathway
1 EMBRYONIC_MORPHOGENESIS
2 CHROMATIN_SILENCING
3 NONSENSE_MEDIATED_DECAY_NMD_INDEPENDENT_OF_THE_EXON_JUNCTION_COMPLEX_EJC
4 NONSENSE_MEDIATED_DECAY_NMD_ENHANCED_BY_THE_EXON_JUNCTION_COMPLEX_EJC
5 NONSENSE_MEDIATED_DECAY_NMD
6 CELL_CHEMOTAXIS
pvals_CEU bonf_CEU fdr_CEU pvals_YRI bonf_YRI fdr_YRI
1 0.00142 0.0793 0.0793 0.26100 1.000 0.3570
2 0.00367 0.2058 0.1029 0.80100 1.000 0.8790
3 0.01135 0.6357 0.1325 0.02480 1.000 0.0555
4 0.01183 0.6623 0.1325 0.00289 0.162 0.0192
5 0.01183 0.6623 0.1325 0.00289 0.162 0.0192
6 0.02210 1.0000 0.1920 0.00253 0.141 0.0192
Between-pathway: LD r2 distribution per selection-enriched pathway-pathway pair against cumulative LD r2 distribution of all unenriched pathway-pathway pairs
head(pvals)
interaction.pathway_pair1
1 ALPHA6BETA4INTEGRIN
2 EMBRYONIC_MORPHOGENESIS
3 ALPHA6BETA4INTEGRIN
4 ALPHA6BETA4INTEGRIN
5 CELLULAR_RESPONSE_TO_LIGHT
6 ALPHA6BETA4INTEGRIN
interaction.pathway_pair2
1 CELLULAR_RESPONSE_TO_LIGHT
2 IL4_MEDIATED_SIGNALING_EVENTS
3 PROTEASOMAL_PROTEIN_CATABOLIC_PROCESS
4 VIRAL_TRANSLATION_AND_PROTEIN_TARGETING
5 IMMUNE_CELL_REGULATION
6 CELL_CHEMOTAXIS
pvals_CEU bonf_CEU fdr_CEU pvals_YRI bonf_YRI fdr_YRI
1 0.000379 0.0649 0.0324 0.4730 1 0.691
2 0.000207 0.0355 0.0324 0.6850 1 0.831
3 0.002506 0.4285 0.1012 0.2540 1 0.447
4 0.002959 0.5060 0.1012 0.3830 1 0.583
5 0.002764 0.4727 0.1012 0.8360 1 0.929
6 0.013680 1.0000 0.1533 0.0268 1 0.112
SUMMARY * Proof-of-concept pipeline successfully discovers instances of coevolution among pathways influencing population fitness * Pathways demonstrate population-driven signals of positive selection (FST) between Europeans and Africans * LD maintains inter-chromosomal SNP-SNP interactions within and between pathways enriched for positive selection
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.