read.proteome: Read the proteome of a given organism

Description Usage Arguments Details Value Author(s) Examples

Description

Read an organism specific proteome stored in a fasta/fastq file format.

Usage

1

Arguments

file

a character string specifying the path to the file storing the proteome.

format

a character string specifying the file format used to store the proteome, e.g. format = "fasta" or format = "fastq".

...

additional arguments that are used by the readAAStringSet function.

Details

This function takes a string specifying the path to the proteome file of interest as first argument.

It is possible to read in different proteome file standards such as fasta or fastq.

Proteomes stored in fasta format can be downloaded from http://www.ebi.ac.uk/reference_proteomes.

Value

A data.frame storing the gene id in the first column, the corresponding sequence as string in the second column, and the sequence length in the third column.

Author(s)

Hajk-Georg Drost

Examples

1
2
3
4
5
# reading a proteome stored in a fasta file
Ath.proteome <- read.proteome(system.file('seqs/ortho_thal_aa.fasta', package = 'seqreadr'),
                               format = "fasta")

dplyr::glimpse(Ath.proteome)

HajkD/seqreadr documentation built on May 6, 2019, 10:55 p.m.