| gc_add | R Documentation |
Expands gene cluster tables to include all ORFs (annotated and hypothetical) within contigs, normalizing cluster representations for downstream analysis and plotting. Ensures consistent ORF spacing/length across clusters by inserting missing rows.
gc_add(
Data = sbgc,
Annotation = bin_genes,
orf_before_first = 0,
orf_after_last = 0,
orf_range = "All"
)
Data |
A
|
Annotation |
A |
orf_before_first |
Integer. Hypothetical ORFs to insert before the first annotated ORF
in each cluster (bounded by contig start). Default: |
orf_after_last |
Integer. Hypothetical ORFs to append after the last annotated ORF
(bounded by contig end). Default: |
orf_range |
Character. Controls ORF inclusion and annotation merging:
|
Hypothetical ORFs are inserted as rows with gene = NA.
Output is always sorted by gene_cluster and orf_position.
Progress messages are printed to console with timestamps.
Contig bounds are respected—insertions never exceed actual ORF positions in Annotation.
A data.frame with one row per ORF (real/hypothetical), sorted by gene_cluster and
orf_position. Added columns:
GC_orf_positionRelative position within cluster (1-indexed).
GC_present_lengthCount of annotated ORFs in the cluster.
GC_absent_lengthCount of inserted hypothetical ORFs.
GC_lengthTotal ORFs (GC_present_length + GC_absent_length).
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.