Description Usage Arguments Value Note Source References See Also Examples
The SignalP 5.0 server predicts the presence of signal peptides and the location of their cleavage sites in proteins from Archaea, Gram-positive Bacteria, Gram-negative Bacteria and Eukarya.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 | get_signalp5(data, ...)
## S3 method for class 'character'
get_signalp5(
data,
org_type = c("euk", "gram-", "gram+", "archea"),
splitter = 2500L,
attempts = 2,
progress = FALSE,
...
)
## S3 method for class 'data.frame'
get_signalp5(data, sequence, id, ...)
## S3 method for class 'list'
get_signalp5(data, ...)
## Default S3 method:
get_signalp5(data = NULL, sequence, id, ...)
## S3 method for class 'AAStringSet'
get_signalp5(data, ...)
|
data |
A data frame with protein amino acid sequences as strings in one column and corresponding id's in another. Alternatively a path to a .fasta file with protein sequences. Alternatively a list with elements of class |
... |
currently no additional arguments are accepted apart the ones documented bellow. |
org_type |
One of "euk", "gram-", "gram+" or "archea". Default is "euk". Are the protein sequences from Eukarya, Gram-negative Bacteria, Gram-positive Bacteria or Archaea. |
splitter |
An integer indicating the number of sequences to be in each .fasta file that is to be sent to the server. Default is 2500. Change only in case of a server side error. Accepted values are in range of 1 to 5000. |
attempts |
Integer, number of attempts if server unresponsive, at default set to 2. |
progress |
Boolean, whether to show messages of the job id for each batch. Default is FALSE. |
sequence |
A vector of strings representing protein amino acid sequences, or the appropriate column name if a data.frame is supplied to data argument. If .fasta file path, or list with elements of class "SeqFastaAA" provided to data, this should be left blank. |
id |
A vector of strings representing protein identifiers, or the appropriate column name if a data.frame is supplied to data argument. If .fasta file path, or list with elements of class "SeqFastaAA" provided to data, this should be left blank. |
if org_type is "euk" a data frame with columns:
Character, as from input
Integer, The type of signal peptide predicted: Sec/SPI, Tat/SPI, Sec/SPII or Other if no signal peptide predicted
Numeric, marginal probability that the protein contains a Sec N-terminal signal peptide (Sec/SPI).
Numeric, the probability that the sequence does not have any kind of signal peptid.
Character, cleavage site position.
Numeric, probability of the predicted cleavage site position.
Numeric,llocal amino acid sequence arround the predicted cleavage site.
Logical, did SignalP5 predict the presence of a signal peptide.
Integer, length of the predicted signal peptide.
if org_type is one of "gram-", "gram+" or "archea" the returned data frame will have two additional columns between 'SP.Sec.SPI' and 'Other':
Numeric, marginal probability that the protein contains a Tat N-terminal signal peptide (Tat/SPI).
Numeric, marginal probability that the protein contains a Lipoprotein N-terminal signal peptide (Sec/SPII).
This function creates temporary files in the working directory. If something goes wrong during communication with the server and progress was set to TRUE, predictions can be obtained using 'file.path("http://www.cbs.dtu.dk/services/SignalP-5.0/tmp", jobid, "output_protein_type.txt")' eg 'read.delim(file.path(...), header = TRUE, skip = 1)'.
https://services.healthtech.dtu.dk/service.php?SignalP-5.0
Almagro Armenteros JJ, Tsirigos KD, Sønderby CK, Petersen TN, Winther O, Brunak S, von Heijne G, Nielsen H. (2019) SignalP 5.0 improves signal peptide predictions using deep neural networks. Nature Biotechnology, 37:420-423, doi:10.1038/s41587-019-0036-z
1 2 3 4 5 | library(ragp)
sp5_pred <- get_signalp5(data = at_nsp[1:10,],
sequence,
Transcript.id)
sp5_pred
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.