| genome_abundances.int | R Documentation |
Interval-valued dataset of dinucleotide relative abundances for 14 genome classes. Each class aggregates multiple genomes; the intervals represent the range of observed abundance values within each class for 10 dinucleotide pairs, plus a count variable.
data(genome_abundances.int)
A symbolic data frame (symbolic_tbl) with 14 observations
(genome classes) and 11 variables:
CG: Interval-valued CG dinucleotide relative abundance.
GC: Interval-valued GC dinucleotide relative abundance.
TA: Interval-valued TA dinucleotide relative abundance.
AT: Interval-valued AT dinucleotide relative abundance.
CC: Interval-valued CC dinucleotide relative abundance.
AA: Interval-valued AA dinucleotide relative abundance.
AC: Interval-valued AC dinucleotide relative abundance.
AG: Interval-valued AG dinucleotide relative abundance.
CA: Interval-valued CA dinucleotide relative abundance.
GA: Interval-valued GA dinucleotide relative abundance.
n: Number of genomes in the class (integer).
Row names are Class_1 through Class_14.
| Sample size (n) | 14 |
| Variables (p) | 11 |
| Subject area | Genomics |
| Symbolic format | Interval |
| Analytical tasks | Clustering, Descriptive statistics |
Billard, L. and Diday, E. (2020), Table 3-16.
Billard, L. and Diday, E. (2020). Clustering Methodology for Symbolic Data. Wiley, Chichester. Table 3-16.
data(genome_abundances.int)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.