rearrvisr: rearrvisr: A package to detect, classify, and visualize...

Description Functions to convert and verify input files Functions to identify and summarize rearrangements Functions to visualize rearrangements Data References Examples

Description

The rearrvisr package provides functions to identify and visualize inter- and intrachromosomal rearrangements between a focal genome and an ancestral genome reconstruction, or two extant genomes. Rearrangements, breakpoints, and synteny blocks are identified along the focal genome and output in a tabular format. Rearrangements and synteny blocks can be visualized along the focal genome by two graphical functions.

Functions to convert and verify input files

Functions to identify and summarize rearrangements

Functions to visualize rearrangements

Data

The package includes example data (MEL_markers, SIM_markers, YAK_markers, and MSSYE_PQTREE_HEUR) generated from 12 publicly available Drosophila genome assemblies downloaded from Ensemble Release 91 (http://dec2017.archive.ensembl.org) and Ensemble Metazoa Release 37 (http://oct2017-metazoa.ensembl.org). Orthologs were identified with OMA standalone v2.2.0 (Altenhoff et al. 2015). MEL_markers, SIM_markers, and YAK_markers are maps of extant genomes from D. melanogaster, D. simulans, and D. yakuba, respectively. MSSYE_PQTREE_HEUR is a genome reconstruction of the ancestor of the melanogaster subgroup (Drosophila 12 Genomes Consortium 2007), computed with the software ANGES (Jones et al. 2012). See the package vignette for details.

References

Altenhoff, A.M. et al. (2015). The OMA orthology database in 2015: function predictions, better plant support, synteny view and other improvements. Nucleic Acids Research, 43, D240–D249. doi: 10.1093/nar/gku1158.

Drosophila 12 Genomes Consortium (2007). Evolution of genes and genomes on the Drosophila phylogeny. Nature, 450, 203–218. doi: 10.1038/nature06341.

Jones, B. R. et al. (2012). ANGES: reconstructing ANcestral GEnomeS maps. Bioinformatics, 28, 2388–2390. doi: 10.1093/bioinformatics/bts457

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
## verify input format of focal genome:
checkInfile(MEL_markers, "focalgenome", checkorder = TRUE)
## convert ancestral genome reconstruction to PQ-structure:
Comp_genome <- convertPQtree(MSSYE_PQTREE_HEUR)
## alternatively, convert extant genome to PQ-stucture:
## (note that minor scaffolds need to be excluded)
Comp_genome <- genome2PQtree(SIM_markers[is.element(SIM_markers$scaff,
                                                    c("2L", "2R", "3L", "3R", "X")), ])
## verify input format of compared genome:
checkInfile(Comp_genome, "compgenome", checkorder = TRUE)

## identify and summarize rearrangements:
SYNT <- computeRearrs(MEL_markers, Comp_genome, doubled = TRUE)
BLOCKS <- summarizeBlocks(SYNT, MEL_markers, Comp_genome,
                          c("2L", "2R", "3L", "3R", "X"))

## visualize rearrangements:
genomeImagePlot(SYNT, MEL_markers, c("2L", "2R", "3L", "3R", "X"))
genomeRearrPlot(BLOCKS, Comp_genome, c("2L", "2R", "3L", "3R", "X"),
                blockwidth = 1.15, y0pad = 3)

dorolin/rearrvisr documentation built on Aug. 6, 2020, 1:32 a.m.