knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>",
  eval = FALSE
)
knitr::opts_knit$set(root.dir = "example_dir/DDvTD")

DDvTDtools is an R package accompanying the manuscript:

Pannetier, T., Martinez, C., Bunnefeld, L., and Etienne, R.S. Branching patterns in phylogenies cannot distinguish diversity-dependent diversification from time-dependent diversification

The package consists in a collection of functions that were used to generate the results presented in Pannetier et al. (2021).

Note that the simulations and maximum likelihood optimisations call upon the integration of the DD master system introduced in Etienne et al. (2012), and this can take an awful lot of time. This is specially true for parameter settings with an old age (larger trees) and with extinction, and the computation is heavier for the time-dependent model.

Package installation

install.packages("DDD")
devtools::install_github("TheoPannetier/DDvTDtools")

Set up directory structure

DDvTDtools expects to be working from a folder named DDvTD, and expects to find the data read by its functions at data/sim (simulated trees) and data/optim (maximum likelihood parameter estimation results).

The command below will set up the required directory structure in your working directory.

DDvTDtools::set_DDvTD_dir_struct()

Step 1 - Simulate phylogenetic trees

Diversity-dependent and time-dependent trees can be simulated with DDvTDtools::run_sim(), which is simply a wrapper around DDD::dd_sim() and DDD::td_sim() that standardizes the input and formats the output.

DDvTDtools::run_sim(
  sim = "DD",
  para = 1211,
  nb_trees = 1000
)

The first argument, sim refers to the name of the simulating model, either "DD" or "TD". The second argument, para, is a four-digit number that codes the values of the four parameters (crown age, baseline speciation rate, extinction rate and carrying capacity) used in the study. 1211 for example means

(This is the fastest setting to simulate and run maximum likelihood optimisation on.)

Calling arg_para() will print the different values used through the study.

DDvTDtools::arg_para()

And what values they code can be found in the documentation

?DDvTDtools::arg_para()

It is also possible to directly translate the code into proper values:

DDvTDtools::para_to_pars(1211)

Simulated trees are stored in DDvTD/data/sim, and can be loaded with read_sim()

trees <- DDvTDtools::read_sim(sim = "DD", para = 1211)
trees[[1]][[1]]

Step 2 - Fit the models on simulated trees

Once phylogenetic trees have been simulated, each model should be fitted to each type of trees. This is done by DDD::dd_ML() and DDD::bd_ML(), but again, we have used a wrapper to standardise the input and output.

DDvTDtools::run_optim(
  sim = "DD", 
  para = 1211, 
  optim = "DD"
)
DDvTDtools::run_optim(
  sim = "DD", 
  para = 1211, 
  optim = "TD"
)

Argument optim is equivalent to argument sim introduced above. run_optim() will fetch the simulated trees from DDvTD/data/sim and save its output in DDvTD/data/optim. You can load these tables with

df <- DDvTDtools::read_optim_results(
  sim = "DD",
  para = 1211,
  optim = "TD",
  init_k = "true_k"
)
head(df)

The structure of a results data frame with a description of all the variables can be found in the documentation of DDvTDtools::results_optim_struct(). init_k denotes which value of parameter K was used to initialise the maximum likelihood optimisation. The default value, true_k, means the true value, i.e. 40. The alternative is from_n, where K was instead set to the number of tips in the tree. This is for some settings which produced large trees, for which the likelihood optimisation proved particularly tedious. The values of init_k used for each setting can be called with:

DDvTDtools::get_init_k()

Optimisation using init_k = from_n is run with another function, which itself calls run_optim()

DDvTDtools::run_optim_from_n(
  sim = "DD", 
  para = 1211, 
  optim = "TD"
)

Step 3 - Plot results

The plots below draw results from both models fit to both type of trees, and so require that the four combinations of sim and optim for a single value of para are present in DDvTD\data\optim.

Fig. 2 - Average lineage-through-time plots

DDvTDtools::plot_ltt_nested(para = 1211)

Fig. 3 - log-likelihood ratio distributions

DDvTDtools::plot_lr(para = 1211)


TheoPannetier/DDvTDtools documentation built on Oct. 22, 2020, 2:31 p.m.