pedigrees: Simulated family pedigrees

Description Usage Format

Description

A data set containing family pedigrees, in which some families have duplicates. The data represent pedigrees that are ascertained based on family history, and include information on cancer statuses and ages, genetic testing results, etc. Each family has a FamID; duplicate families will have the same FamID. Each family has a unique RequestID. Hence, the goal of the algorithm is to detect the duplicates, or the families with the same FamID. Within a family, each individual has an ID, and the MotherID and FatherID represent the ID of the mother and father, respectively. A portion of the data set is provided below. The data is mostly in the format for running the Mendelian model PanelPRO (https://projects.iq.harvard.edu/bayesmendel/panelpro).

Usage

1

Format

A data frame with 207,214 rows and 32 variables:

ID

ID for the family member

Sex

1 for males, 0 for females

MotherID

ID for reported mother

FatherID

ID for reported father

isProband

1 for proband, 0 otherwise

isAff*

Cancer status (BC = breast cancer, OC = ovarian cancer, COL = colorectal cancer, ENDO = endometrial cancer, PANC = pancreatic cancer, MELA = melanoma)

Age*

Cancer age (BC = breast cancer, OC = ovarian cancer, COL = colorectal cancer, ENDO = endometrial cancer, PANC = pancreatic cancer, MELA = melanoma)

isDead

Death status

BRCA1

BRCA1 testing results (1 = positive, 2 = negative, 0 = untested)

BRCA2

BRCA2 testing results (1 = positive, 2 = negative, 0 = untested)

MLH1

MLH1 testing results (1 = positive, 2 = negative, 0 = untested)

MSH2

MSH2 testing results (1 = positive, 2 = negative, 0 = untested)

MSH6

MSH6 testing results (1 = positive, 2 = negative, 0 = untested)

CDKN2A

CDKN2A testing results (1 = positive, 2 = negative, 0 = untested)

Twins

Twin marker (twins in the family will have the same positive integer)

FamID

True ID for the family

RequestID

Reported ID for the family (true duplicates will have a different RequestID but the same FamID)

Duplicate

1 if the pedigree has a duplicate

nDuplicates

Number of duplicate pedigrees of the family

relationship

Relationship to proband

famSize

Number of family members in the family


bayesmendel/snipR documentation built on Jan. 25, 2022, 12:33 a.m.