ibdEstimate | R Documentation |
Estimate the IBD coefficients (k0, k1, k2) or the condensed identity coefficients (d1, ..., d9) between a pair (or several pairs) of pedigree members, using maximum likelihood methods.
ibdEstimate( x, ids = typedMembers(x), param = c("kappa", "delta"), markers = NULL, start = NULL, tol = sqrt(.Machine$double.eps), beta = 0.5, sigma = 0.5, contourPlot = FALSE, levels = NULL, verbose = TRUE )
x |
A |
ids |
Either a vector with ID labels, or a data frame/matrix with two
columns, where each row contains the ID labels of two individuals. The
entries are coerced to characters, and must match uniquely against the ID
labels of |
param |
Either "kappa" (default) or "delta"; indicating which set of coefficients should be estimated. |
markers |
A vector with names or indices of markers attached to x, indicating which markers to include. If NULL (default), all markers are used. |
start |
A probability vector (i.e., with nonnegative entries and sum 1)
of length 3 (if |
tol, beta, sigma |
Control parameters for the optimisation routine; can usually be left untouched. |
contourPlot |
A logical. If TRUE, contours of the log-likelihood function are plotted overlaying the IBD triangle. |
levels |
(Only relevant if |
verbose |
A logical. |
It should be noted that this procedure estimates the realised identity coefficients of each pair, i.e., the actual fractions of the autosomes in each IBD state. These may deviate substantially from the theoretical pedigree coefficients.
Maximum likelihood estimation of relatedness coefficients originates with Thompson (1975). Optimisation of κ is done in the (k0, k2)-plane and restricted to the triangle defined by
k0 >= 0, k2 >= 0, k0 + k2 <= 1
. Optimisation of Δ is done in unit simplex of R^8, using the first 8 coefficients.
The implementation optimises the log-likelihood using a projected gradient descent algorithm, combined with a version of Armijo line search.
An object of class ibdEst
, which is basically a dataframe with
either 6 columns (if param = "kappa"
) or 12 columns (if param = "delta"
). The first three columns are id1
(label of first individual),
id2
(label of second individual) and N
(the number of markers with no
missing alleles). The remaining columns contain the coefficient estimates.
Magnus Dehli Vigeland
E. A. Thompson (1975). The estimation of pairwise relationships. Annals of Human Genetics 39.
E. A. Thompson (2000). Statistical Inference from Genetic Data on Pedigrees. NSF-CBMS Regional Conference Series in Probability and Statistics. Volume 6.
ibdBootstrap()
### Example 1: Siblings x = nuclearPed(2) # Simulate 100 markers x = markerSim(x, N = 100, alleles = 1:4, seed = 123, verbose = FALSE) # Estimate kappa (expectation: (0.25, 0.5, 0.25) ibdEstimate(x, ids = 3:4) # Plot contours of the log-likelihood function ibdEstimate(x, ids = 3:4, contourPlot = TRUE) ### Example 2: Full sib mating y = fullSibMating(1) # Simulate 200 SNP markers y = markerSim(y, N = 1000, alleles = 1:10, seed = 123, verbose = FALSE) # Estimate ibdEstimate(y, ids = 5:6, param = "delta")
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.