Nothing
#' DPC-PSSM,AAC-PSSM and AADP-PSSM feature vectors
#' @description This feature is combination of amino asid composition and dipeptide composition feature
#'vectors. DPC feature stands for dipeptide composition, to get this feature vector for both different columns
#'the elements in two consecutive rows corresponds to this columns, would be multiplied and this scenario is done
#'for all L-1 consecutive rows (L is protein length) and finally summation of these numbers divides by L-1. since
#'the result depends on two different columns, eventually DPC Feature vector would be of length 400. AAC-PSSM is actually
#'mean of PSSM Matrix columns which its length is 20. eventually AADP-PSSM is combination of these vectors
#'and with length 420.
#' @param pssm_name name of PSSM Matrix file
#' @import utils
#'
#' @return feature vector of length 420
#' @references
#' Liu, T., Zheng, X. and Wang, J. (2010) Prediction of protein structural class for
#' low-similarity sequences using support vector machine and PSI-BLAST profile, Biochimie, 92, 1330-1334.
#' @export
#' @examples
#' X<-aadp_pssm(system.file("extdata", "C7GQS7.txt.pssm", package="PSSMCOOL"))
aadp_pssm <- function(pssm_name){
x<-read.delim(pssm_name,skip = 2,sep = "",header = FALSE)
x<-x[-1,-c(1,23:44)]
d<-which(x=="Lambda")
if(length(d)!=0){
x<-x[-c(d:dim(x)[1]),]
}
x<-x[,-1]
colnames(x)<-NULL
rownames(x)<-NULL
x<-as.matrix(x)
mode(x)<-"integer"
m2<-x
#m2<-1/(1+exp(-m2))
L<-dim(m2)[1]
AAC<-apply(m2,2,mean)
names(AAC)<-NULL
AAC<-round(AAC,digits = 4)
y<-matrix(0,nrow = 20,ncol = 20)
for (i in 1:20) {
for (j in 1:20) {
for (k in 1:(L-1)) {
y[i,j]<-y[i,j]+m2[k,i]*m2[k+1,j]
}
}
}
y<-(1/(L-1))*y
vec<-c()
for(i in 1:20){
vec<-c(vec,y[i,])
}
vec<-round(vec,digits = 4)
AADP<-c(AAC,vec)
return(AADP)
}
Any scripts or data that you put into this service are public.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.