ecoli: Data set Ecoli: Protein Localization Sites

ecoliR Documentation

Data set Ecoli: Protein Localization Sites

Description

This data set contains information of Escherichia coli. It is a bacterium of the genus Escherichia that is commonly found in the lower intestine of warm-blooded organism.

Format

A data frame with 336 rows, 8 variables and the class.

Details

Sequence Name

Accession number for the SWISS-PROT database.

mcg

McGeoch's method for signal sequence recognition.

gvh

Von Heijne's method for signal sequence recognition.

lip

Von Heijne's Signal Peptidase II consensus sequence score. Binary attribute.

chg

Presence of charge on N-terminus of predicted lipoproteins. Binary attribute.

aac

Score of discriminant analysis of the amino acid content of outer membrane and periplasmic proteins.

alm1

Score of the ALOM membrane spanning region prediction program.

alm2

Score of ALOM program after excluding putative cleavable signal regions from the sequence.

Class

Class variable. 8 possibles states.

Source

http://archive.ics.uci.edu/ml/datasets/Ecoli


MoTBFs documentation built on April 18, 2022, 5:06 p.m.

Related to ecoli in MoTBFs...