protein: Protein Secondary Structure Data

proteinR Documentation

Protein Secondary Structure Data

Description

This dataset contains protein sequences and their corresponding secondary structures, including beta-sheets (E), helices (H), and coils (_).

Usage

protein

Format

A data frame with multiple rows and columns representing protein sequences and their secondary structures.

  • Sequence: Amino acid sequence (using 3-letter codes).

  • Structure: Secondary structure of the protein (E for beta-sheet, H for helix, _ for coil).

  • Parameters: Additional parameters for neural networks (to be ignored).

  • Biophysical_Constants: Biophysical constants (to be ignored).

Examples

# Load the dataset
data(protein)

# Print the first few rows of the dataset
print(head(protein))

LFM documentation built on June 11, 2025, 9:07 a.m.

Related to protein in LFM...