census.mix: Census Mixed Symbolic Dataset

census.mixR Documentation

Census Mixed Symbolic Dataset

Description

Mixed symbolic dataset of 10 census regions combining 6 different symbolic variable types: histograms (age, home value), distributions (gender, tenure), a multi-valued set (fuel), and an interval (income).

Usage

data(census.mix)

Format

A symbolic data frame (symbolic_tbl) with 10 observations (regions) and 6 variables:

  • age: Histogram-valued age distribution (12 age bins).

  • home_value: Histogram-valued home value distribution (7 value bins, in $1000s).

  • gender: Distribution over gender (male, female).

  • fuel: Multi-valued set of fuel types used.

  • tenure: Distribution over housing tenure (owner, renter, vacant).

  • income: Interval-valued household income range ($1000s).

Row names are Region_1 through Region_10.

Metadata

Sample size (n) 10
Variables (p) 6
Subject area Demographics
Symbolic format Mixed (interval, histogram, distribution, multi-valued)
Analytical tasks Clustering

Source

Billard, L. and Diday, E. (2020), Table 7-23.

References

Billard, L. and Diday, E. (2020). Clustering Methodology for Symbolic Data. Wiley, Chichester. Table 7-23.

Examples

data(census.mix)

dataSDA documentation built on June 12, 2026, 9:06 a.m.