A data set containing family pedigrees, in which some families have duplicates. The data represent pedigrees that are ascertained based on family history, and include information on cancer statuses and ages, genetic testing results, etc. Each family has a FamID; duplicate families will have the same FamID. Each family has a unique RequestID. Hence, the goal of the algorithm is to detect the duplicates, or the families with the same FamID. Within a family, each individual has an ID, and the MotherID and FatherID represent the ID of the mother and father, respectively. A portion of the data set is provided below. The data is mostly in the format for running the Mendelian model PanelPRO (https://projects.iq.harvard.edu/bayesmendel/panelpro).
1 |
A data frame with 207,214 rows and 32 variables:
ID for the family member
1 for males, 0 for females
ID for reported mother
ID for reported father
1 for proband, 0 otherwise
Cancer status (BC = breast cancer, OC = ovarian cancer, COL = colorectal cancer, ENDO = endometrial cancer, PANC = pancreatic cancer, MELA = melanoma)
Cancer age (BC = breast cancer, OC = ovarian cancer, COL = colorectal cancer, ENDO = endometrial cancer, PANC = pancreatic cancer, MELA = melanoma)
Death status
BRCA1 testing results (1 = positive, 2 = negative, 0 = untested)
BRCA2 testing results (1 = positive, 2 = negative, 0 = untested)
MLH1 testing results (1 = positive, 2 = negative, 0 = untested)
MSH2 testing results (1 = positive, 2 = negative, 0 = untested)
MSH6 testing results (1 = positive, 2 = negative, 0 = untested)
CDKN2A testing results (1 = positive, 2 = negative, 0 = untested)
Twin marker (twins in the family will have the same positive integer)
True ID for the family
Reported ID for the family (true duplicates will have a different RequestID but the same FamID)
1 if the pedigree has a duplicate
Number of duplicate pedigrees of the family
Relationship to proband
Number of family members in the family
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.