MSCquartets-package: Multispecies Coalescent Model Quartet Package

MSCquartets-packageR Documentation

Multispecies Coalescent Model Quartet Package


A package for analyzing quartets displayed on gene trees, under the multispecies coalescent (MSC) model and network multispecies coalescet model (NMSC).


This package contains routines to analyze a collection of gene trees through the displayed quartets on them.

A quartet count concordance factor (qcCF) for a set of 4 taxa is the triple of counts of the three possible resolved quartet trees on those taxa across some set of gene trees. The major routines in this package can:

  1. Tabulate all qcCFs for a collection of gene trees.

  2. Perform hypothesis tests of whether one or more qcCFs are consistent with the MSC model on a species tree \insertCiteMAR19MSCquartets.

  3. Produce simplex plots showing all estimated CFs as well as results of hypothesis tests \insertCiteAMR2020MSCquartets.

  4. Infer a species tree using the qcCFs via the QDC and WQDC methods \insertCiteRho19,YR19MSCquartets.

  5. Infer a level-1 species network via the NANUQ method \insertCiteABR19MSCquartets.

  6. Infer the tree of blobs for a species network via the TINNiK method \insertCiteABMR22MSCquartets,\insertCiteABMR23MSCquartets.

As discussed in the cited works, the inference methods for species trees and networks are statistically consistent under the MSC and Network MSC respectively.

This package, and the theory on which it is based, allows gene trees to have missing taxa (i.e., not all gene trees display all the taxa). It does require that each subset of 4 taxa is displayed on at least one of the gene trees.

Several gene tree data sets, simulated and empirical, are included.

In publications please cite the software announcement \insertCiteRBMA2020MSCquartets, as well as the appropriate paper(s) above developing the theory behind the routines you used.


















MSCquartets documentation built on Nov. 2, 2023, 5:32 p.m.