Description Usage Arguments Value Author(s) References Examples
This Method uses Markov chain Monte Carlo to estimate time to the common ancestor for a beneficial allele.
1 2 3 4 5 |
vcf.file |
a vcf file that includes genotypes of individuals at genomic region of interest surrounding the putatively selected. A 1 Mb region is recommended (500kb flanking both sides of the seleced allele). This file should contain individuals with and without the selected allele. |
rec.file |
a text file containing a recombination map with 3 columns: (1) chromosome, (2) position i, and (3) recombination rate between position i and i+1. This may include the recombination map for the entire genome. if rec.file = NULL, a uniform recombination rate is used from a specified rec.rate parameter value. |
mut.rate |
the mutation rate per basepair per generation. |
rec.rate |
the recombination rate per basepair per generation. |
nsel |
the sample size for the number of chromosomes with the selected allele to use. |
nanc |
the sample size for the reference panel of chromosomes without the selected allele. |
chain.length |
the number of iterations for the MCMC to run. |
proposal.sd |
the standard deviation for the proposal distribution of the time to the common ancester (TMRCA). Where t is the current parameter value for the tmrca, proposals for a new value, t', are as follows: t' = t + rnorm(1,0,proposal.sd) |
nanc.post |
the number of values from the end of the chain to use for the posterior distribution of the ancestral haplotype. |
sample.ids |
a single column text file for individuals in the "selected population." if sample.ids = NULL, all individuals in the vcf are used. |
refsample.ids |
a single column text file for individuals in the "reference population." if refsample.ids = NULL, reference individuals are used from sample.ids. |
pos |
the position of the putatively selected allele. |
sel.allele |
specifies which allele is the selected allele. Takes value 0 or 1. |
bed.file |
uses a bed file to exclude sites that have not been sequenced in the VCF file. |
upper.t.limit |
an upper limit from which to draw a uniformly distributed initial value for t before starting the MCMC. |
output.file |
a character vector with a file path. If NULL, a default path will be used (VCF file name with "_mcmc_list.RDATA" extension). If NA, no file will be created. |
The function will return a list of two elements:
t.chain |
a four column matrix of the MCMC output for the TMRCA. The number of rows is equal to the number specified by the chain.length argument. The first column is the proposed TMRCA value, the second column is the emission probability of the current TMRCA and ancestral haplotype values. The third column is the Metropolis-Hastings ratio. The fourth column indicates whether the proposed value was accepted or not. |
anchap.chain |
a sample from the posterior distribution of the ancestral haplotype. The number of rows is specified by the nanc.post argument, and indicates the size of the posterior sample from the end of the chain. |
The output of the function can be assigned to an object independently of being saved in a file.
Joel Smith
Joel Smith, Graham Coop, Matthew Stephens and John Novembre (2018): Estimating Time to the Common Ancestor for a Beneficial Allele. Molecular Biology and Evolution. https://doi.org/10.1093/molbev/msy006
1 2 3 4 5 6 7 8 9 10 11 12 13 14 | ## Not run: run.startmrca(vcf.file = "examples/FIN_chr2_pos136608646.vcf.gz",
rec.file = "examples/decode_recmap_sexaveraged.txt",
sample.ids = "examples/sample_ids.txt",
refsample.ids = "examples/sample_ids.txt",
mut.rate = 1.6e-8,
nsel = 50,
nanc = 20,
chain.length = 20,
nanc.post = 10,
pos = 136608646,
sel.allele = 1)
## End(Not run)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.