index.md

MALECOT

Version 0.1.1

Travis-CI Build Status AppVeyor Build Status Coverage Status Documentation

MALECOT is an R package for inferring population structure from genetic data. It is tailored to problems in malaria where each sample commonly contains multiple genotypes. The basic model assumes that there are K subpopulations represented in the data, and that we are interested in working out which samples belong to which subpopulations, similar to programs like STRUCTURE. The analysis works via Bayesian Markov chain Monte Carlo, alternately estimating each of three unknown parameters:

  1. Allele frequencies in all subpopulations
  2. Group allocation, i.e. which samples belong to which subpopulations
  3. Complexity of infection (COI) of each sample, i.e. how many genotypes are represented in each sample

The first two parameters are estimated by programs like STRUCTURE, but the third parameter is not, making STRUCTURE not ideally suited to malaria data where complex infections are common. There are also more complex priors available in MALECOT, and the ability to estimate to estimate K via a technique called generalised thermodynamic integration (GTI).

To get started, take a look at the installation instructions, followed by a basic tutorial on running the program.



bobverity/MALECOT documentation built on May 13, 2019, 4:01 a.m.