Dataset of human donor splice sites

Description

Dataset of 400 human donor splice sites with a sequence length of 7 base pairs.

Usage

1

Format

y

binary response. True (1) or false (0) splice site.

Pos.x

DNA letter (A, C, G, T) at position x, where x ranges from 1 to 7.

Details

The dataset is a random subset of the MEMset Donor dataset used in Gene et al. (2004).

References

Gene, Y. and Burge, C. (2004) Maximum Entropy Modeling of Short Sequence Motifs with Applications to RNA Splicing Signals, Journal of Computational Biology, 11, 475 - 494.

Examples

1