View source: R/423-extractProtPSSMAcc.R
extractProtPSSMAcc | R Documentation |
Profile-based protein representation derived by PSSM (Position-Specific Scoring Matrix) and auto cross covariance
extractProtPSSMAcc(pssmmat, lag)
pssmmat |
The PSSM computed by |
lag |
The lag parameter. Must be less than the number of amino acids in the sequence (i.e. the number of columns in the PSSM matrix). |
This function calculates the feature vector based on the PSSM by running PSI-Blast and auto cross covariance tranformation.
A length lag * 20^2
named numeric vector,
the element names are derived by the amino acid name abbreviation
(crossed amino acid name abbreviation) and lag index.
Wold, S., Jonsson, J., Sjörström, M., Sandberg, M., & Rännar, S. (1993). DNA and peptide sequences and chemical processes multivariately modelled by principal component analysis and partial least-squares projections to latent structures. Analytica chimica acta, 277(2), 239–253.
extractProtPSSM extractProtPSSMFeature
x = readFASTA(system.file('protseq/P00750.fasta', package = 'Rcpi'))[[1]]
dbpath = tempfile('tempdb', fileext = '.fasta')
invisible(file.copy(from = system.file('protseq/Plasminogen.fasta', package = 'Rcpi'), to = dbpath))
pssmmat = extractProtPSSM(seq = x, database.path = dbpath)
pssmacc = extractProtPSSMAcc(pssmmat, lag = 3)
tail(pssmacc)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.