genome_abundances.int: Genome Dinucleotide Abundance Intervals

genome_abundances.intR Documentation

Genome Dinucleotide Abundance Intervals

Description

Interval-valued dataset of dinucleotide relative abundances for 14 genome classes. Each class aggregates multiple genomes; the intervals represent the range of observed abundance values within each class for 10 dinucleotide pairs, plus a count variable.

Usage

data(genome_abundances.int)

Format

A symbolic data frame (symbolic_tbl) with 14 observations (genome classes) and 11 variables:

  • CG: Interval-valued CG dinucleotide relative abundance.

  • GC: Interval-valued GC dinucleotide relative abundance.

  • TA: Interval-valued TA dinucleotide relative abundance.

  • AT: Interval-valued AT dinucleotide relative abundance.

  • CC: Interval-valued CC dinucleotide relative abundance.

  • AA: Interval-valued AA dinucleotide relative abundance.

  • AC: Interval-valued AC dinucleotide relative abundance.

  • AG: Interval-valued AG dinucleotide relative abundance.

  • CA: Interval-valued CA dinucleotide relative abundance.

  • GA: Interval-valued GA dinucleotide relative abundance.

  • n: Number of genomes in the class (integer).

Row names are Class_1 through Class_14.

Metadata

Sample size (n) 14
Variables (p) 11
Subject area Genomics
Symbolic format Interval
Analytical tasks Clustering, Descriptive statistics

Source

Billard, L. and Diday, E. (2020), Table 3-16.

References

Billard, L. and Diday, E. (2020). Clustering Methodology for Symbolic Data. Wiley, Chichester. Table 3-16.

Examples

data(genome_abundances.int)

dataSDA documentation built on June 12, 2026, 9:06 a.m.