WMM.Feature: Weighted matrix model based mapping of nucleotide sequences...

Description Usage Arguments Details Value Author(s) References See Also Examples

View source: R/WMM.Feature.R

Description

The weighted matrix model (WMM) was developed by Staden (1984) for prediction of splice sites. In this technique, position weight matrix are computed from the aligned positive dataset which is then used for computing the liklihood of any nucleotide sequence being a positive or negative class. The position weight matrix was later on used by Meher et al. (2016) for encoding of splice site motifs for prediction using supervised learning models.

Usage

1
WMM.Feature(positive_class, negative_class, test_seq)

Arguments

positive_class

Sequence dataset of the positive class, must be an object of class DNAStringSet.

negative_class

Sequence dataset of the negative class, must be an object of class DNAStringSet.

test_seq

Sequences to be encoded into numeric vector, must be an object of class DNAStringSet.

Details

In this encoding approach, a vector of two observations will be obtained for each sequence, corresponds to the situation when only positive class and both positive and neagtive datasets are used for encoding. This encoding scheme is also invariant to the length of the sequence.

Value

A numeric matrix of order m*2, where m is the number of sequences in test_seq.

Author(s)

Prabina Kumar Meher, Indian Agricultural Statistics Research Institute, New Delhi-110012, INDIA

References

  1. Staden, R. (1984). Computer methods to locate signals in nucleic acid sequences. Nucleic Acids Research, 12: 505-519.

  2. Meher, P.K., Sahu, T.K., Rao, A.R. and Wahi, S.D. (2016). Identification of donor splice sites using support vector machine: a computational approach based on positional, compositional and dependency features. Algorithms for Molecular Biology, 11(1), 16.

See Also

Bayes.Feature, MN.Fdtf.Feature

Examples

1
2
3
4
5
6
7
8
9
data(droso)
positive <- droso$positive
negative <- droso$negative
test <- droso$test
pos <- positive[1:200]
neg <- negative[1:200]
tst <- test
enc <- WMM.Feature(positive_class=pos, negative_class=neg, test_seq=tst)
enc

EncDNA documentation built on May 28, 2019, 9 a.m.

Related to WMM.Feature in EncDNA...