get_signalp5: Query SignalP 5.0 web server.

Description Usage Arguments Value Note Source References See Also Examples

View source: R/get_signalp5.R

Description

The SignalP 5.0 server predicts the presence of signal peptides and the location of their cleavage sites in proteins from Archaea, Gram-positive Bacteria, Gram-negative Bacteria and Eukarya.

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
get_signalp5(data, ...)

## S3 method for class 'character'
get_signalp5(
  data,
  org_type = c("euk", "gram-", "gram+", "archea"),
  splitter = 2500L,
  attempts = 2,
  progress = FALSE,
  ...
)

## S3 method for class 'data.frame'
get_signalp5(data, sequence, id, ...)

## S3 method for class 'list'
get_signalp5(data, ...)

## Default S3 method:
get_signalp5(data = NULL, sequence, id, ...)

## S3 method for class 'AAStringSet'
get_signalp5(data, ...)

Arguments

data

A data frame with protein amino acid sequences as strings in one column and corresponding id's in another. Alternatively a path to a .fasta file with protein sequences. Alternatively a list with elements of class SeqFastaAA resulting from read.fasta call. Alternatively an AAStringSet object. Should be left blank if vectors are provided to sequence and id arguments.

...

currently no additional arguments are accepted apart the ones documented bellow.

org_type

One of "euk", "gram-", "gram+" or "archea". Default is "euk". Are the protein sequences from Eukarya, Gram-negative Bacteria, Gram-positive Bacteria or Archaea.

splitter

An integer indicating the number of sequences to be in each .fasta file that is to be sent to the server. Default is 2500. Change only in case of a server side error. Accepted values are in range of 1 to 5000.

attempts

Integer, number of attempts if server unresponsive, at default set to 2.

progress

Boolean, whether to show messages of the job id for each batch. Default is FALSE.

sequence

A vector of strings representing protein amino acid sequences, or the appropriate column name if a data.frame is supplied to data argument. If .fasta file path, or list with elements of class "SeqFastaAA" provided to data, this should be left blank.

id

A vector of strings representing protein identifiers, or the appropriate column name if a data.frame is supplied to data argument. If .fasta file path, or list with elements of class "SeqFastaAA" provided to data, this should be left blank.

Value

if org_type is "euk" a data frame with columns:

id

Character, as from input

Prediction

Integer, The type of signal peptide predicted: Sec/SPI, Tat/SPI, Sec/SPII or Other if no signal peptide predicted

SP.Sec.SPI

Numeric, marginal probability that the protein contains a Sec N-terminal signal peptide (Sec/SPI).

Other

Numeric, the probability that the sequence does not have any kind of signal peptid.

CS_pos

Character, cleavage site position.

Pr

Numeric, probability of the predicted cleavage site position.

cleave.site

Numeric,llocal amino acid sequence arround the predicted cleavage site.

is.signalp

Logical, did SignalP5 predict the presence of a signal peptide.

sp.length

Integer, length of the predicted signal peptide.

if org_type is one of "gram-", "gram+" or "archea" the returned data frame will have two additional columns between 'SP.Sec.SPI' and 'Other':

TAT.Tat.SPI

Numeric, marginal probability that the protein contains a Tat N-terminal signal peptide (Tat/SPI).

LIPO.Sec.SPII

Numeric, marginal probability that the protein contains a Lipoprotein N-terminal signal peptide (Sec/SPII).

Note

This function creates temporary files in the working directory. If something goes wrong during communication with the server and progress was set to TRUE, predictions can be obtained using 'file.path("http://www.cbs.dtu.dk/services/SignalP-5.0/tmp", jobid, "output_protein_type.txt")' eg 'read.delim(file.path(...), header = TRUE, skip = 1)'.

Source

https://services.healthtech.dtu.dk/service.php?SignalP-5.0

References

Almagro Armenteros JJ, Tsirigos KD, Sønderby CK, Petersen TN, Winther O, Brunak S, von Heijne G, Nielsen H. (2019) SignalP 5.0 improves signal peptide predictions using deep neural networks. Nature Biotechnology, 37:420-423, doi:10.1038/s41587-019-0036-z

See Also

get_signalp get_targetp

Examples

1
2
3
4
5
library(ragp)
sp5_pred <- get_signalp5(data = at_nsp[1:10,],
                         sequence,
                         Transcript.id)
sp5_pred

missuse/ragp documentation built on Jan. 4, 2022, 10:49 a.m.