promotergene: E. coli promoter gene sequences (DNA)

Description Usage Format Source References Examples

Description

Promoters have a region where a protein (RNA polymerase) must make contact and the helical DNA sequence must have a valid conformation so that the two pieces of the contact region spatially align. The data contains DNA sequences of promoters and non-promoters.

Usage

1

Format

A data frame with 106 observations and 58 variables. The first variable Class is a factor with levels + for a promoter gene and - for a non-promoter gene. The remaining 57 variables V2 to V58 are factors describing the sequence. The DNA bases are coded as follows: a adenine c cytosine g guanine t thymine

Source

UCI Machine Learning data repository
ftp://ftp.ics.uci.edu/pub/machine-learning-databases/molecular-biology/promoter-gene-sequences

References

Towell, G., Shavlik, J. and Noordewier, M.
Refinement of Approximate Domain Theories by Knowledge-Based Artificial Neural Networks.
In Proceedings of the Eighth National Conference on Artificial Intelligence (AAAI-90)

Examples

1
2
3
4
5
data(promotergene)

## Create model using Distance Weighted Discrimination

promsv <- kdwd(Class~.,data=promotergene,C=100,cross=4)

DWD documentation built on May 2, 2019, 5 p.m.

Related to promotergene in DWD...