Description Usage Arguments Details Author(s) Examples
Sequences are coded based on their composition, transition and distribution.
1 2 |
featureCTD(seq,class=elements("aminoacid"))
|
seq |
a string vector for the protein, DNA, or RNA sequences. |
class |
a list for the class of biological properties. It can
be produced by |
featureCTD
returns a matrix with M+M*(M-1)/2+M*5 columns. Each row
represented features of one sequence coding by a M+M*(M-1)/2+M*5 dimension
numeric vector. Three kinds of coding: composition (C), transition (T) and
distribution (D) are used. C is the number of amino acids of a particular
property (such as hydrophobicity) divided by the total number of amino acids.
T characterizes the percent frequency with which amino acids of a particular
property is followed by amino acids of a different property. D measures
the chain length within which the first, 25, 50, 75 and 100
acids of a particular property is located respectively.
Hong Li
1 2 3 4 5 6 7 8 9 10 | if(interactive()){
file = file.path(path.package("BioSeqClass"), "example", "acetylation_K.fasta")
library(Biostrings)
tmp = readAAStringSet(file)
proteinSeq = as.character(tmp)
CTD1 = featureCTD(proteinSeq, class=elements("aminoacid") )
CTD2 = featureCTD(proteinSeq, class=aaClass("aaV") )
}
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.