utilities | R Documentation |
UniProt uses custom coding of organism names from which protein sequences they store. These taxon names are used also in the protein names (not in the UniProt IDs!). These functions help to translate those names to standard scientific (Latin) taxon names and other useful identifiers.
taxname2species(): converts UniProt taxonomy names to scientific species names
taxname2taxid(): converts UniProt taxonomy names to NCBI Taxonomy IDs
taxname2domain(): converts UniProt taxonomy names to the following taxonomical domains:
'A' for archaea (=archaebacteria)
'B' for bacteria (=prokaryota or eubacteria)
'E' for eukaryota (=eukarya)
'V' for viruses and phages (=viridae)
'O' for others (such as artificial sequences)
taxname2species(taxname, specfile)
taxname2taxid(taxname, specfile)
taxname2domain(taxname, specfile)
taxname |
Character string up to 6 uppercase characters, like HUMAN, MOUSE, or AERPX. Also works for a vector of such taxon names. |
specfile |
An optional local file where speclist.RData is saved from
UniProt.org. When |
taxname2species
: a character vector of scientific taxon names
matching to the UniProt taxon names supplied as taxname
.
taxname2taxid
: a numeric vector of Taxonomy IDs matching to the
UniProt taxon names supplied as taxname
.
taxname2domain
: a character vector of one letter domain
symbols matching to the UniProt taxon names supplied as taxname
.
Csaba Ortutay
UniProt controlled vocabulary of species, which defines the taxon names.
taxname2species("PIG")
taxname2species(c("PIG","HUMAN","TRIHA"))
taxname2taxid("PIG")
taxname2taxid(c("PIG","HUMAN","TRIHA"))
taxname2domain("PIG")
taxname2domain(c("PIG","HUMAN","TRIHA"))
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.