Description Usage Arguments Details Value Author(s)
This function fits a hierarchical model to sequence counts from a collection of siblings and returns genotyped information. The hierarchy comes from the fact that they share the same parents. If you also have parental sequencing data, then you can include this to improve estimates.
1 2 3 4 |
ocounts |
A vector of non-negative integers. The ith element is the number of reads of the reference allele in the ith child. |
osize |
A vector of positive integers. The ith element is the total number of reads for the ith child. |
ploidy |
A positive integer. The number of copies of the genome in the species. This is the assumed to be the same for all individuals. |
p1counts |
A vector of non-negative integers. The ith element
is the number of reads of the reference allele in the ith
sample of parent 1. If |
p1size |
A vector of positive integers. The ith element is the
total number of reads in the ith sample of parent 1. If
|
p2counts |
A vector of non-negative integers. The ith element
is the number of reads of the reference allele in the ith
sample of parent 2. If |
p2size |
A vector of positive integers. The ith element is the
total number of reads in the ith sample of parent 2. If
|
seq_error |
A non-negative numeric. This is the known sequencing error rate. The default is to estimate this using data that are all approximately the reference allele. |
integrate |
A logical. Should we integrate over our
uncertainty in the parental genotypes ( |
do_eb |
Should we do empirical Bayes ( |
overdispersion |
A logical. Should we fit a beta-binomial model
to account for overdispersion ( |
update_geno |
A logical. Update the parental genotypes? If |
update_pi |
A logical. Update the mixing proporiton? If |
update_outlier |
A logical. Update the outlier distribution? I |
update_rho |
A logical. Update the overdispersion parameter? |
If you have a lot of parental sequencing data, then it could
suffice to run updog
with update_geno
set to FALSE
, which
would save a lot of time.
Otherwise, you will probably want to borrow strength
between the offspring by setting update_geno
to TRUE
.
A list with some or all of the following elements:
opostprob
: A matrix of proportions whose (i, j)th
element is the posterior probability that child j has i - 1
copies of the reference allele. That is, the rows index the
genotype and the columns index the offspring.
p1postprob
: A vector of proportions whose ith element is
the posterior probability that parent 1 has i - 1 copies of the
reference allele. These are derived ONLY from parent 1's
sequence data and not jointly with all of the data.
p2postprob
: A vector of proportions whose ith element is
the posterior probability that parent 2 has i - 1 copies of the
reference allele. These are derived ONLY from parent 2's
sequence data and not jointly with all of the data.
pival
: The estimated proportion of observations that are not outliers.
rho
: The overdispersion parameter.
out_mu
: The outlier distribution mean.
out_rho
: The outlier distribution overdispersion parameter.
p1geno
: The estimated genopype of parent 1.
p2geno
: The estimated genotype of parent 2.
prob_ok
: A vector of proportions. The ith element is the posterior
probability that the ith element is not an outlier.
ogeno
: A vector of integers. The ith element is the estimated genotype
of the ith offspring.
alpha
: The outlier distributions's shape 1 parameter.
beta
: The outlier distributions's shape 2 parameter.
seq_error
: The sequencing error rate used during the updog iterates.
David Gerard
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.