utilities: Utility functions

Description Usage Arguments Value Author(s) See Also Examples

Description

UniProt uses custom coding of organism names from which protein sequences they store. These taxon names are used also in the protein names (not in the UniProt IDs!). These functions help to translate those names to standard scientific (Latin) taxon names and other useful identifiers.

Usage

1
2
3
4
taxname2species(taxname, specfile) 
taxname2taxid(taxname, specfile) 
taxname2domain(taxname, specfile) 
updatespecfile()

Arguments

taxname

Character string up to 6 uppercase characters, like HUMAN, MOUSE, or AERPX. Also works for a vector of such taxon names.

specfile

An optional local file where speclist.RData is saved from UniProt.org. When specfile is missing, a cached file from the extdata/ package directory is used.

Value

Function taxname2species returns a character vector of scientific taxon names matching to the UniProt taxon names supplied as taxname.

Function taxname2taxid returns a numeric vector of Taxonomy IDs matching to the UniProt taxon names supplied as taxname.

Function taxname2domain returns a character vector of one letter domain symbols matching to the UniProt taxon names supplied as taxname.

Function updatespecfile returns a file location where specfile.txt is downloaded. If the download fails, path is the location of the archived version in the package.

Author(s)

Csaba Ortutay

See Also

UniProt controlled vocabulary of species, which defines the taxon names.

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
taxname2species("PIG")
taxname2species(c("PIG","HUMAN","TRIHA"))

taxname2taxid("PIG")
taxname2taxid(c("PIG","HUMAN","TRIHA"))

taxname2domain("PIG")
taxname2domain(c("PIG","HUMAN","TRIHA"))

newspecfile <- updatespecfile()
taxname2domain("PIG", specfile = newspecfile)
taxname2domain(c("PIG","HUMAN","TRIHA"), specfile = newspecfile)

UniProt.ws documentation built on Nov. 8, 2020, 5:58 p.m.