This vignette shows a case study of influenza data from nine geographic regions, using phylodyn
.
We start by loading the phylodyn
package.
library(phylodyn)
In preparation, we aligned the sequences by region using the software MUSCLE,
and inferred a maximum clade credibility genealogy for each region using the software BEAST[^1].
We packaged the genealogies in a list of phylo
objects regional_flu
, and we load it now.
[^1]: We infer the genealogy branch lengths in units of years using a strict molecular clock, a constant effective population size prior, and an HKY substitution model with the first two nucleotides of a codon sharing the same estimated transition matrix, while the third nucleotide's transition matrix is estimated separately.
data(regional_flu)
We supply region names and year labels.
mains <- list(USACanada = "USA / Canada", Europe = "Europe", NorthChina = "NorthChina", JapanKorea = "Japan / Korea", India = "India", SouthChina = "South China", SouthAmerica = "South America", SoutheastAsia = "Southeast Asia", Oceania = "Oceania") axlabs <- list(USACanada = list(x = seq(0, 14, by=1), labs = 2012 - seq(0, 14, by=1)), Europe = list(x = seq(0, 12, by=1), labs = 2011 - seq(0, 12, by=1)), NorthChina = list(x = seq(0, 12, by=1), labs = 2011 - seq(0, 12, by=1)), JapanKorea = list(x = seq(0, 13, by=1), labs = 2012 - seq(0, 13, by=1)), India = list(x = seq(0, 12, by=1), labs = 2011 - seq(0, 12, by=1)), SouthChina = list(x = seq(0, 12, by=1), labs = 2011 - seq(0, 12, by=1)), SouthAmerica = list(x = seq(0, 12, by=1), labs = 2012 - seq(0, 12, by=1)), SoutheastAsia = list(x = seq(0, 14, by=1), labs = 2012 - seq(0, 14, by=1)), Oceania = list(x = seq(0, 12, by=1), labs = 2011 - seq(0, 12, by=1)))
We select a region (more than one can be chosen) and use BNPR and BNPR_PS.
regions <- list("USACanada") results <- list() for (region in regions) { results[[region]] <- list(bnpr = BNPR(data = regional_flu[[region]], lengthout = 100), bnpr_ps = BNPR_PS(data = regional_flu[[region]], lengthout = 100)) }
We plot the results below.
for (region in regions) { par(mfrow=c(1, 3), cex=0.8, cex.lab=1.5, cex.main=1.4, oma=c(2.5, 2, 0, 0)+0.1, mar=c(2, 1.5, 2, 1), mgp = c(2.5, 1, 0), xpd=NA, fig=c(0, 0.32, 0, 1)) plot_BNPR(results[[region]]$bnpr, main=paste(mains[[region]], "BNPR"), ylim = c(0.1, 30), axlabs = axlabs[[region]], col = rgb(0.829, 0.680, 0.306), heatmap_labels_side = "left") par(fig=c(0.32, 0.64, 0, 1), new=TRUE) plot_BNPR(results[[region]]$bnpr_ps, main=paste(mains[[region]], "BNPR-PS"), ylab = "", ylim = c(0.1, 30), axlabs = axlabs[[region]], col = rgb(0.330, 0.484, 0.828), heatmap_labels = FALSE) par(mar=c(2, 3.5, 2, 1), fig=c(0.64, 1.0, 0, 1), new=TRUE) plot_mrw(BNPR_outs = results[[region]], axlabs = axlabs[[region]], main="", cols = c(rgb(0.829, 0.680, 0.306), rgb(0.330, 0.484, 0.828)), ylab = "Mean Relative Width", legends = c("BNPR", "BNPR-PS"), legend_place = "topright", legend_cex = 0.8) }
R. C. Edgar. MUSCLE: Multiple sequence alignment with high accuracy and high through- put. Nucleic Acids Research, 32:1792–1797, 2004.
A. J. Drummond, M. A. Suchard, D. Xie, and A. Rambaut. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Molecular Biology and Evolution, 29:1969–1973, 2012.
D. Zinder, T. Bedford, E. B. Baskerville, R. J. Woods, M. Roy, and M. Pascual. Seasonality in the migration and establishment of H3N2 influenza lineages with epidemic growth and decline. BMC Evolutionary Biology, 14(1):272, 2014.
M. D. Karcher, J. A. Palacios, T. Bedford, M. A. Suchard, and V. N. Minin. Quantifying and mitigating the effect of preferential sampling on phylodynamic inference. arXiv preprint arXiv:1510.00775, 2015.
Any scripts or data that you put into this service are public.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.