hapmap: Example dataset - Hapmap Phase III

Description Format


The example dataset is from the Hapmap Phase III project (https://www.ncbi.nlm.nih.gov/variation/news/NCBI_retiring_HapMap/). Our training sample consisted of unrelated individuals from two different populations: a) Utah residents with Northern and Western European ancestry (CEU), and b) Toscans in Italy (TSI). We present the eigenvalues and PC scores obtained from performing PCA on the SNPs on chromosome 7.


This example dataset is a list containing the following elements:


Sample eigenvalues of the training sample.


PC scores of the training sample. This has PC1 and PC2 scores for 198 observations.


We obtained the predicted scores by leaving one observation out at a time, applying PCA to the rest of the data and then predicting the PC score of the left out observation. This has PC1 and PC2 scores of 198 observations.


Number of observations in the training set = 198.


Number of SNPs on chromosome 7.

hdpca documentation built on Jan. 16, 2021, 5:33 p.m.

Related to hapmap in hdpca...