USvote: U.S. 2000 Election Data

Description Usage Format Details Source References Examples

Description

Data from a post-election survey following the year 2000 U.S. presidential elections.

Usage

1
data("USvote")

Format

A data frame with 10645 observations on the following 6 variables.

vote3

candidate voted for Gore or Bush

gender

gender, a factor with levels male and female

ager

age group, an ordered factor with levels 18-24 < 25-34 < 35-44 < 45-54 < 55-64 < 65+

empstat

status of employment, a factor with levels yes, no or retired

educr

status of education, an ordered factor with levels <HS < HS < >HS < College < Post Coll

marstat

status of living situation, a factor with levels married, widowed, divorced or never married

Details

The dataset is based on data from a post-election survey on persons who voted for either Bush or Gore in the 2000 U.S. election. The specific variables are related to the publication of Magidson and Vermunt (2005).

Further information (and datasets) about the 2000 U.S. election and other National Election Studies is available on the American National Election Studies Web site (http://www.electionstudies.org/).

Source

http://www.statisticalinnovations.com/products/latentgold_datasets.html#USselection2000reg

References

Jay Magidson and Jeroen K. Vermunt (2005). An Extension of the CHAID Tree-based Segmentation Algorithm to Multiple Dependent Variables. In: C. Weihs & W. Gaul (Editors), Classification: The Ubiquitous Challenge, pages 176–183. Heidelberg: Springer http://www.statisticalinnovations.com/products/8pagearticle.pdf

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
  library("CHAID")

  ### fit tree to subsample
  set.seed(290875)
  USvoteS <- USvote[sample(1:nrow(USvote), 2000),]
  chaidUS <- chaid(vote3 ~ gender + ager + empstat + educr + marstat, 
                   data = USvoteS)

  print(chaidUS)
  plot(chaidUS)

CHAID documentation built on May 2, 2019, 4:47 p.m.