A dataset with information on background characteristics and salary of 473 employees.
A data frame with 473 observations on the following 9 variables:
a numeric variable, used as response variable: current salary in US dollars
a numeric variable: age in years
a numeric variable: educational level in years
a numeric variable: beginning salary in US dollars
a numeric variable: months since hire
a numeric variable: previous work experience in months
a factor variable: minority classification with levels
min, indicating minority, and
no_min, no minority
a factor variable: gender type with levels
f, indicating female, and
m, indicating male
a factor variable: type of job with levels
This is an example dataset from the statistical software program SPSS, Version 20.0. If you use this dataset, refer to IBM Corp. (2011), see references. The dataset is used as a benchmark dataset in Dusseldorp, Conversano, and Van Os (2010).
IBM Corp. (2011). IBM SPSS Statistics for Windows, Version 20.0. Armonk, NY: IBM Corp.
Dusseldorp, E. Conversano, C., and Os, B.J. (2010). Combining an additive and tree-based regression model simultaneously: STIMA. Journal of Computational and Graphical Statistics, 19(3), 514-530.