dw974/MyCoV: Predict classifications of partial RdRp Coronavirus sequences

This is essentially a wrapper for blastn, which is called to query a sequence database of partial RdRp Coronavirus sequences. The sequences in the database have been classified by "predicted subgenus" based on the level of phylogenetic support of different clade groups in a bayesian phylogenetic inference and the relative classifications of holotype reference sequences, as detailed in the associated manuscript. MyCoV takes any input fasta file (single or multiple sequences), and returns the best hit(s) from the database. It reports the level of certainty with which the best-matching sequence could be classified by subgenus (again, according to the manuscript), and the pairwise identity of the sequences you are querying to their reference sequences. In order to get an idea of whether this means your sequences are likely to belong to the same subgenus, it can also plots the calculated pairwise distances with all pairwise distances for within- and between- subgenus comparisons for the different genera. Host and country of origin metadata are also associated with the sequence database. Sequence similarity can be mapped to the phylogeny from the article, with a visual summary of the similarities and the associated metadata.

Getting started

Package details

AuthorDavid A Wilkinson
Bioconductor views Biostrings DECIPHER
MaintainerDavid A Wilkinson <dwilkin799@gmail.com>
LicenseMIT + file license
Version0.1.0
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("dw974/MyCoV")
dw974/MyCoV documentation built on Oct. 9, 2020, 7:29 p.m.