Description Usage Arguments Value Author(s) References
Takes a list of genome fasta files and perform blastn searches to identify the sequence type for each of the genes/loci available in a mlst scheme.
1 2 3 4 5 6 7 8 9 10 11 12 13 |
infiles |
A vector of genome sequence file paths in fasta format. Could also be multifasta files of predicted Open Reading Frames. |
org |
A valid organism from pubmlst.org. Run listPubmlst_orgs to see available ones. |
scheme |
|
schemeFastas |
A vector with the path to fasta sequences from each
loci of the specified mlst scheme. If it is |
schemeProfile |
The path to the profile file (.tab). If left |
write |
|
ddir |
A non-existing directory where to download the loci fasta files
in case they are not provided by the user. Default:
|
fdir |
A non-existing directory where to write fasta files of found
sequences (see |
n_threads |
|
pid |
Percentage identity threshold to be consider as a novel allele. An
|
scov |
Subject coverage threshold to be consider as a novel allele. A
|
An object of class "mlst", which consists on a list of 2 dataframes.
The first one is the alleles called for the infiles
and the ST
detected, whereas the second is the scheme profile. The "result" data.frame
shows one genome per row and one gene from the selected scheme per column.
The last column is the ST detected for each genome. "NA" values means that
no allele were found in the respective genome. A 'u' (from 'unknown') plus
an integer means that the allele found was not reported in pubmlst.org
database; a fasta file with the new allele is written in this case if
"write"
is set to "new" or "all" (see above). New alleles are
compared with each other so the names are properly chosen in case 2 or more
of those novel alleles were the same (in that cases, the names would be the
same). A number indicates the allele id number of the reported alleles in
pubmlst.org . The second data.frame is the profile definition chosen, the
logic is the same as the first one but in this case rows are the combination
of alleles reported at pubmlst.org . A series of attributes are also given
in the "mlst" object, mainly refering to the parameters used by the function.
Ignacio Ferres
Altschul, Gish, Miller, Myers & Lipman. (1990) "Basic local alignment search tool." J. Mol. Biol. 215:403-410.
Jolley KA, Bray JE, Maiden MCJ. A RESTful application programming interface for the PubMLST molecular typing and genome databases. Database: The Journal of Biological Databases and Curation. 2017;2017:bax060. doi:10.1093/database/bax060.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.