Retrieve Protein Sequence in various Formats from Databases

Share:

Description

Retrieve Protein Sequence in various Formats from Databases

Usage

1
2
getProt(id, from = c("uniprot", "kegg", "pdb"), type = c("fasta", "pdb",
  "aaseq"), parallel = 5)

Arguments

id

A character vector, as the protein ID(s).

from

The database, one of 'uniprot', 'kegg', or 'pdb'.

type

The returned protein format, one of fasta, pdb, or aaseq.

parallel

An integer, the parallel parameter, indicates how many process the user would like to use for retrieving the data (using RCurl), default is 5. For regular cases, we recommend a number less than 20.

Details

This function retrieves protein sequence in various formats from three databases.

Value

A length of id character list, each element containing the corresponding protein sequence(s) or file(s).

Author(s)

Nan Xiao <http://nanx.me>

See Also

See getDrug for retrieving drug molecules from five databases.

Examples

1
2
3
id = c('P00750', 'P00751', 'P00752')

getProt(id, from = 'uniprot', type = 'aaseq')

Want to suggest features or report bugs for rdrr.io? Use the GitHub issue tracker.