extractPSSMAcc: Profile-based protein representation derived by PSSM...

Description Usage Arguments Value Author(s) References See Also Examples

View source: R/desc-15-PSSMAcc.R

Description

This function calculates the feature vector based on the PSSM by running PSI-Blast and auto cross covariance tranformation.

Usage

1
extractPSSMAcc(pssmmat, lag)

Arguments

pssmmat

The PSSM computed by extractPSSM.

lag

The lag parameter. Must be less than the number of amino acids in the sequence (i.e. the number of columns in the PSSM matrix).

Value

A length lag * 20^2 named numeric vector, the element names are derived by the amino acid name abbreviation (crossed amino acid name abbreviation) and lag index.

Author(s)

Nan Xiao <https://nanx.me>

References

Wold, S., Jonsson, J., Sj\"orstr\"om, M., Sandberg, M., & R\"annar, S. (1993). DNA and peptide sequences and chemical processes multivariately modelled by principal component analysis and partial least-squares projections to latent structures. Analytica chimica acta, 277(2), 239–253.

See Also

extractPSSM extractPSSMFeature

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
if (Sys.which("makeblastdb") == "" | Sys.which("psiblast") == "") {

  cat("Cannot find makeblastdb or psiblast. Please install NCBI Blast+")

} else {

  x = readFASTA(system.file(
    "protseq/P00750.fasta", package = "protr"))[[1]]
  dbpath = tempfile("tempdb", fileext = ".fasta")
  invisible(file.copy(from = system.file(
    "protseq/Plasminogen.fasta", package = "protr"), to = dbpath))

  pssmmat = extractPSSM(seq = x, database.path = dbpath)
  pssmacc = extractPSSMAcc(pssmmat, lag = 3)
  tail(pssmacc)

}

protr documentation built on Sept. 29, 2017, 9:02 a.m.