screening | R Documentation |
This function annotate virulence factors, antibiotic resitance genes and/or biocide and metals
of the genomes (files
). The annotation is performed using
mmseqs2 software for protein sequences (https://github.com/soedinglab/MMseqs2)
or Minimap2 (https://github.com/lh3/minimap2) for nucleotide (wgs or nucl).
screening(
data,
type = "nucl",
database = c("AbR", "VF_A", "VF_B"),
query = "all",
n_cores
)
data |
A |
type |
user must be specified if the data set is nucleotide or protein. |
database |
A vector with the query databases:
|
query |
"all" or "accessory". It perform the annotation from whole protein dataset or just from the accessory |
Databases
Virulence Factor DataBase (Set_A and Set_B) (http://www.mgc.ac.cn/cgi-bin/VFs/v5/main.cgi)
ResFinder (https://cge.cbs.dtu.dk/services/ResFinder/).
BacMet (http://bacmet.biomedicine.gu.se/).
The function can re-use the
previous computational steps of mmseqs
or create a new index database from
the files. Re-use option shorten the computational time. This method use the algorithm
search of mmseqs2 so it olny return high identity matchs.
A data.frame
with the annotation information
Genome: Genome query
Protein: Proteins query
target: Protein subject (AbR o VF)
pident: Percentage of identical matches
alnlen: Alingment length
mismatch: number of mismatchs
gapopen: number of gaps
qstart: query start alingment
qend: query end alingment
tstart: target start alingment
tend: target end alingment
evalue: evalue
bits: Bitscore
DataBase: Database (AbR, VF_A or VF_B, bacmet)
Gene: Gene name
Description: Functional annotation (VF) or category (AbR)
Keep in mind that the results from accesory are based on the annotation of the representative protein of the homologous cluster and therefore does not mean that all the genomes have the same allele of the gene.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.