at_nsp: 2700 Arabidopsis thaliana protein sequences.

Description Usage Format Source

Description

A dataset containing the sequences of 2706 Arabidopsis thaliana proteins, with a length of 100 to 400 amino acid residues predicted to contain a N-signal peptide by phobius web server or by SignalP 4.1 The dataset represents a cleaned version of all protein sequences obtained from Phytozome V12 as Athaliana_167_TAIR9.fa.gz.

Usage

1

Format

A data frame with 2706 rows and 4 variables:

Transcript.id

TAIR Gene Model id

sequence

amino acid sequence of the protein

is.signalP

logical, predicted as N-sp by SignalP 4.1

phobius

logical, predicted as N-sp by phobius web server

Source

http://genome.jgi.doe.gov/pages/dynamicOrganismDownload.jsf?organism=Phytozome


missuse/ragp documentation built on Jan. 4, 2022, 10:49 a.m.