camilagazolla/ProKlust: Prokariotic Clustering based on Genome and Sequence Identity/Similarity Matrices

ProKlust (“Prokaryotic Clusters”) was written with a focus on taxonomical data. It obtains, filters and visualizes clusters from multiple identity/similarity matrices using maximal clique enumeration (MCE) with settable cut-off points. The input pairwise identity/similarity matrix ) is formatted into triangular matrix using the average. Then, the matrix is formatted using the cut-off values chosen by the user, replacing values according to the criteria. If more than one matrix is used as input, each of the generated logical matrices (boolean) is multiplied. Afterwards, the igraph object is formed by the connection the nodes (for example, genomes or genes) which present the value 1. It uses a modified Bron–Kerbosch algorithm on “igraph”. ProKlust allows the modification of graph data with the use of specific filters (or a combination/intersection of them) and renaming of nodes. It generates relevant cluster data as output and allows the possibility of graph visualization using the networkD3 library.

Getting started

Package details

Maintainer
LicenseGPL (>= 3)
Version0.1.1
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("camilagazolla/ProKlust")
camilagazolla/ProKlust documentation built on May 22, 2021, 11:10 p.m.