Description Usage Format Source References
This validation dataset gives the names, the primary structure (amino acid sequences), and the secondary structure of 203 individual proteins from the targets used in CASP9 experiments as used in the paper cited below.
1 |
A data frame containing 203 observations on the following 3 variables.
name: protein name;
primary: protein primary structure (amino acid sequence) in 20 letters denoting the 20 amino acids;
hetc: secondary structure in 4 letters denoting the 4 structure types: helix (h), strand (e), turn (t) and coil (c).
Moult J, Fidelis K, Kryshtafovych A, Tramontano A (2011) Critical assessment of methods of protein structure prediction (casp) round ix. Proteins: Structure, Function, and Bioinformatics 79: 1-5. <DOI:10.1002/prot.23200>
Q. Li, D. B. Dahl, M. Vannucci, H. Joo, J. W. Tsai (2014), Bayesian Model of Protein Primary Sequence for Secondary Structure Prediction, PLOS ONE, 9(10), e109832. <DOI:10.1371/journal.pone.0109832>
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.