In venelin/toyepidemic: Simulating epidemics with within-host strain mutation and between-host SIR dynamics

# make results reproducible
set.seed(5)

library(toyepidemic)

# set this to FALSE to disable cache and run all MCMC-fits.
useCachedResults <- file.exists("epid1.RData") & TRUE

Designing the epidemic

Define the pathogen genotypes

We set the numbers of different alleles for each quantitative trait locus (QTL) in the pathogen genotype. These are specified in the form of a integer vector with elements bigger or equal to 2. The length of this vector corresponds to the number of QTLs.

# numbers of alleles for each quantitative trait locus
numsAllelesAtSites <- c(3, 2)

# Define a matrix of the possible genotype encodings (as allele contents).
genotypes <- generateGenotypes(numsAllelesAtSites)
print(genotypes)

# Define the probability of the first infecting strain (the first pathogen strain starting the epidemic)
pg.init <- rep(0, nrow(genotypes))
pg.init[1] <- 1 # we specify that 1 is the first strain with probability 1.

Define the number and the frequencies of host-types

# number of host-types
n <- 6

# probability of each host type in a susceptible compartment 
pe <- runif(n)
pe <- pe/sum(pe) # ensure that they sum-up to 1

Define the GE-values

The GE-values represent the general genotype x host-type effects. A genotype x host-type effect is the mean trait value of an individual of a given host-type carrying a given pathogen strain in the absence of selection.

# General host-type x strain effects (expected phenotypes for genotype by environment combinations)
GEVs <- matrix(NA, nrow=n, ncol=nrow(genotypes))

# assign random values to each genotype x host-type combination:
for(g in 1:nrow(genotypes)) {
  GEVs[, g] <- rnorm(n=n, mean=2+2/nrow(genotypes)*(g), sd=0.4)
}

To get a visual idea of the different GE-values, we can plot them in the form of an R-matplot:

matplot(GEVs, type='l', lty=1, col=1:6)

Define the host-specific effects distribution

The final phenotype of every infected host is the sum of the GE-value for its currently infecting strain and a random host-specific effect. The toyepidemic package assumes that the host-specific effect is a normally distributed random variable drawn exactly once for each possible infecting strain in a newly infected host. This normal distribution has mean 0 and standard deviation defined by the following parameter $\sigma_e$:

# is the special environmental effect unique for each pathogen genoetype in an individual
eUniqForEachG <- TRUE

sigmae <- .6
# it is possible to specify different standard deviations for the different host-types, i.e. different host-types exhibit stronger or weaker effect on the trait. For this example we keep them fixed for all host-types.
sde <- rep(sigmae, n)

Defining the population parameters

At the between-host level we start by defining the size, the birth- and the natural death rate in the population at equilibrium. These are constants:

#initial population size (equilibrium in the absence of disease)
N <- 1e5

# setting the between-host and within-host dynamics of the simulation:
# natural death rate
mu <- 1/850

# constant birth rate that maintains this equilibrium
nu <- ifelse(is.finite(N), mu*N, 0)

Next, we define the rate parameters for infected hosts:

Infected death-rate

# death-rate as a function of viral load and natural death rate mu
rateDie <- function(z, mu) {
  V <- 10^z
  Dmin <- 2
  Dmax <- 25*12
  D50 <- 10^3
  Dk <- 1.4
  (V^Dk+D50^Dk)/(Dmin*(V^Dk+D50^Dk)+((Dmax-Dmin)*D50^Dk)) + mu
}

curve(rateDie(x, mu), 0, 7, col="red")

Transmission-rate as a function of viral load and rate of risky contacts with susceptible hosts

rateContact <- 1/6

# note that at runtime, rateInfect is scaled by the proportion of susceptible hosts
rateInfect <- function(z, rateContact) {
  V <- 10^z
  Emin <- .3
  Emax <- .6
  E50 <- 10^3
  Ek <- 1.4
  E <- Emin+(Emax-Emin)*V^Ek/(V^Ek+E50^Ek)
  E*rateContact
}

curve(rateInfect(x, mu), 0, 7, col="red")

Within-host strain mutation rate

We define a function rateMutate which calculates the within-host per-locus mutation rate for a number K of infected hosts. The function recieves the GEVs matrix as an argument and 3 vectors of length K as follows:

es: numeric : values of currently active host-specific effects;
envs: integer : host types for the K infected hosts;
genes: integer : currently infecting strains for the K hosts.

In addition, we specify the mode of within-host evolution (in this case - selection)

# per locus mutation rates
rateMutate <- function(GEValues, es, envs, genes) {
  z <- GEValues[cbind(envs, genes)] + es
  V <- 10^z
  Mmin <- 0.00
  Mmax <- 0.2
  M50 <- 10^3
  Mk <- 1.4
  Mmin+(Mmax-Mmin)*V^Mk/(V^Mk+M50^Mk)
}

# are only beneficial (i.e. increasing the trait-value) mutations allowed
selectWithinHost <- TRUE

Sampling rate for an infected host

The event of sampling represents diagnosis and immediate treatment and lifelong immunity and un-infectiousness for the host. For this example, we define a constant sampling rate, independent of the phenotype.

rateSample <- 1/(4*12)

Define the time step and the duration of the simulation

# all events are sampled with this time-step. This means that only one event can happen for an infected host within every next interval of 0.05 (arbitrary time units).
timeStep <- 0.05

# maximum time before starting graceful fadeout of the epidemic (stop the
# transmission events and wait until no more infected hosts live in the population)
maxTime <- 200

# continue the epidemic outbreak until reaching maxNTips diagnosed hosts
maxNTips <- 1000

# time to continue the simulation of transmission after reaching maxNTips 
# (this was introduced in order to study post-outbreak dynamics, i.e. epidemic
# waves after exhaustion of the susceptible pool)
expandTimeAfterMaxNTips <- 0

Running the simulation

To run the simulation we use the function simulateEpidemic with the parameters specified as above:

epidemic <- simulateEpidemic(
  Ninit=N, nu=nu, mu=mu, pe=pe, sde=sde, pg.init=pg.init, GEValues=GEVs,
  rateContact=1/6, rateInfect=rateInfect, rateDie=rateDie,
  rateSample=rateSample,
  rateMutate=rateMutate,
  numsAllelesAtSites=numsAllelesAtSites, eUniqForEachG=eUniqForEachG,
  selectWithinHost=TRUE,
  timeStep=timeStep, maxTime=maxTime, maxNTips=maxNTips,
  expandTimeAfterMaxNTips=expandTimeAfterMaxNTips,
  process="select/select")

# loads the epidemic object
load("epid1.RData")

Analysing the epidemic

Here we glimpse over some of the functions used in analyzing the simulated epidemic. A more elaborate analysis is provided in the vignette for the package patherit.

Extracting the transmission tree connecting sampled host

tree <- extractTree(epidemic)

plot(ape::ladderize(tree), show.tip.label = FALSE)

Extract the population of sampled patients

# extract the population of all sampled individuals
pop <- extractPop(epidemic, ids = tree$tip.label)

# calculate their phenotypic values at the moment of diagnosis
pop[, z:=calcValue(env, gene, e, GEValues = epidemic$GEValues)]

hist(pop[, z])

Extract donor-recipient couples

drc1 <- extractDRCouples(epidemic)

Saving the epidemic for future analysis

save(epidemic, file="epid1.RData")

Packages used

fastExecution <- c("Rcpp")
treeProcessing <- c("ape")
data <- c("data.table")
testing <- c("testthat")

packagesUsed <- c(fastExecution, treeProcessing, data, testing)

printPackages <- function(packs) {
  res <- ""
  for(i in 1:length(packs)) {
    res <- paste0(res, paste0(packs[i], ' v', packageVersion(packs[i]), ' [@R-', packs[i], ']'))
    if(i < length(packs)) {
      res <- paste0(res, ', ')
    }
  }
  res
}

# Write bib information (this line is executed manually and the bib-file is edited manually after that)
# knitr::write_bib(packagesUsed, file = "./REFERENCES-R.bib")

Apart from base R functionality, the toyepidemic package uses a number of 3rd party R-packages:

For fast execution: r printPackages(fastExecution);
For tree processing: r printPackages(treeProcessing);
For reporting: r printPackages(data);
For testing: r printPackages(testing).

References

venelin/toyepidemic documentation built on May 29, 2019, 11:46 a.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

venelin/toyepidemic
Simulating epidemics with within-host strain mutation and between-host SIR dynamics

In venelin/toyepidemic: Simulating epidemics with within-host strain mutation and between-host SIR dynamics

Designing the epidemic

Define the pathogen genotypes

Define the number and the frequencies of host-types

Define the GE-values

Define the host-specific effects distribution

Defining the population parameters

Infected death-rate

Transmission-rate as a function of viral load and rate of risky contacts with susceptible hosts

Within-host strain mutation rate

Sampling rate for an infected host

Define the time step and the duration of the simulation

Running the simulation

Analysing the epidemic

Extracting the transmission tree connecting sampled host

Extract the population of sampled patients

Extract donor-recipient couples

Saving the epidemic for future analysis

Packages used

Further reading

References

R Package Documentation

Browse R Packages

We want your feedback!

venelin/toyepidemic Simulating epidemics with within-host strain mutation and between-host SIR dynamics

In venelin/toyepidemic: Simulating epidemics with within-host strain mutation and between-host SIR dynamics

Designing the epidemic

Define the pathogen genotypes

Define the number and the frequencies of host-types

Define the GE-values

Define the host-specific effects distribution

Defining the population parameters

Infected death-rate

Transmission-rate as a function of viral load and rate of risky contacts with susceptible hosts

Within-host strain mutation rate

Sampling rate for an infected host

Define the time step and the duration of the simulation

Running the simulation

Analysing the epidemic

Extracting the transmission tree connecting sampled host

Extract the population of sampled patients

Extract donor-recipient couples

Saving the epidemic for future analysis

Packages used

Further reading

References

R Package Documentation

Browse R Packages

We want your feedback!

venelin/toyepidemic
Simulating epidemics with within-host strain mutation and between-host SIR dynamics