EnvNJ: Whole Genome Phylogenies Using Sequence Environments

Contains utilities for the analysis of protein sequences in a phylogenetic context. Allows the generation of phylogenetic trees base on protein sequences in an alignment-independent way. Two different methods have been implemented. One approach is based on the frequency analysis of n-grams, previously described in Stuart et al. (2002) <doi:10.1093/bioinformatics/18.1.100>. The other approach is based on the species-specific neighborhood preference around amino acids. Features include the conversion of a protein set into a vector reflecting these neighborhood preferences, pairwise distances (dissimilarity) between these vectors, and the generation of trees based on these distance matrices.

Getting started

Package details

AuthorJuan Carlos Aledo [aut, cre] (<https://orcid.org/0000-0002-3497-9945>)
MaintainerJuan Carlos Aledo <caledo@uma.es>
LicenseGPL (>= 2)
Version0.1.3
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:
install.packages("EnvNJ")

Try the EnvNJ package in your browser

Any scripts or data that you put into this service are public.

EnvNJ documentation built on Sept. 27, 2021, 5:07 p.m.