oils.int: Oils and Fats Interval Dataset

oils.intR Documentation

Oils and Fats Interval Dataset

Description

Classic benchmark interval-valued data for 8 oils and fats described by 4 physico-chemical properties. Originally from Ichino (1988).

Usage

data(oils.int)

Format

A data frame with 8 observations and 9 columns (4 interval variables in _l/_u Min-Max pairs, plus a label):

  • sample: Oil/fat sample name (character).

  • specific_gravity_l, specific_gravity_u: Specific gravity range.

  • freezing_point_l, freezing_point_u: Freezing point range (degrees Celsius).

  • iodine_value_l, iodine_value_u: Iodine value range.

  • saponification_value_l, saponification_value_u: Saponification value range.

Details

The 8 samples are: Linseed oil, Perilla oil, Cottonseed oil, Sesame oil, Camellia oil, Olive oil, Beef tallow, Hog fat. The expected 3-cluster structure is: {Beef tallow, Hog fat}, {Cottonseed, Sesame, Camellia, Olive}, and {Linseed, Perilla}. Widely used for comparing clustering methods and distance measures in symbolic data analysis.

Metadata

Sample size (n) 8
Variables (p) 9
Subject area Chemistry
Symbolic format Interval
Analytical tasks Clustering

References

Ichino, M. (1988). General metrics for mixed features. Proc. IEEE Conf. Systems, Man, and Cybernetics, pp. 494-497.

Diday, E. and Noirhomme-Fraiture, M. (Eds.) (2008). Symbolic Data Analysis and the SODAS Software. Wiley. Table 13.7, p.253.

Examples

data(oils.int)

dataSDA documentation built on June 12, 2026, 9:06 a.m.