Employee Dataset

Share:

Description

A dataset with information on background characteristics and salary of 473 employees.

Usage

1

Format

A data frame with 473 observations on the following 9 variables:

salary

a numeric variable, used as response variable: current salary in US dollars

age

a numeric variable: age in years

edu

a numeric variable: educational level in years

startsal

a numeric variable: beginning salary in US dollars

jobtime

a numeric variable: months since hire

prevexp

a numeric variable: previous work experience in months

minority

a factor variable: minority classification with levels min, indicating minority, and no_min, no minority

gender

a factor variable: gender type with levels f, indicating female, and m, indicating male

jobcat

a factor variable: type of job with levels Clerical, Custodial, and manager

Source

This is an example dataset from the statistical software program SPSS, Version 20.0. If you use this dataset, refer to IBM Corp. (2011), see references. The dataset is used as a benchmark dataset in Dusseldorp, Conversano, and Van Os (2010).

References

IBM Corp. (2011). IBM SPSS Statistics for Windows, Version 20.0. Armonk, NY: IBM Corp.

Dusseldorp, E. Conversano, C., and Os, B.J. (2010). Combining an additive and tree-based regression model simultaneously: STIMA. Journal of Computational and Graphical Statistics, 19(3), 514-530.