erythromycin_BlastP | R Documentation |
This dataset contains detailed information on genes involved in the biosynthesis of Erythromycin, an antibiotic produced by the bacterium Saccharopolyspora erythraea and several homologous gene clusters identified by antiSMASH. It includes gene identifiers, chromosomal positions, orientations, and annotations regarding their products and functions, as well as similarity and identity scores from BlastP analysis.
erythromycin_BlastP
A data frame with 148 observations and 16 variables:
Unique protein identifiers. A character vector.
The chromosomal region of the gene, indicating start and end positions and strand. A character vector.
Amino acid sequence of the protein encoded by the gene. A character vector.
Identifier of the gene cluster to which the gene belongs. A character vector.
The strand orientation ("forward" or "complement") of the gene. A character vector.
The start position of the gene on the chromosome. A numeric vector.
The end position of the gene on the chromosome. A numeric vector.
A unique identifier for each row in the dataset. An integer vector.
The identity score from BlastP analysis, representing the percentage of identical matches. A numeric vector.
The similarity score from BlastP analysis, often reflecting the functional or structural similarity. A numeric vector.
Reference to the protein_id after BlastP comparison, or NA if not applicable. A character vector.
Score assigned based on the BlastP analysis, quantifying the match quality. A numeric vector.
Gene name or identifier if available, otherwise NA. A character vector.
Formatted string indicating the gene's position and orientation on the chromosome. A character vector.
Description of the gene product. A character vector.
Functional categorization of the gene. A character vector.
Derived from BlastP analysis of Saccharopolyspora erythraea genes involved in Erythromycin production.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.