census: census

Description Usage Format Source Examples

Description

This is the census (also known as adult) dataset. It is used to predict the salary of a person based on socio-demographis and economic predictors.

Usage

1
data("census")

Format

A data frame with 32561 observations on the following 15 variables.

age

a numeric vector

employment

a factor with levels Federal-gov Local-gov Never-worked Private Self-emp-inc Self-emp-not-inc State-gov Without-pay

a3

a numeric vector

education

a factor with levels 10th 11th 12th 1st-4th 5th-6th 7th-8th 9th Assoc-acdm Assoc-voc Bachelors Doctorate HS-grad Masters Preschool Prof-school Some-college

education.num

a numeric vector

marital.status

a factor with levels Divorced Married-AF-spouse Married-civ-spouse Married-spouse-absent Never-married Separated Widowed

job

a factor with levels Adm-clerical Armed-Forces Craft-repair Exec-managerial Farming-fishing Handlers-cleaners Machine-op-inspct Other-service Priv-house-serv Prof-specialty Protective-serv Sales Tech-support Transport-moving

relationship

a factor with levels Husband Not-in-family Other-relative Own-child Unmarried Wife

race

a factor with levels Amer-Indian-Eskimo Asian-Pac-Islander Black Other White

gender

a factor with levels Female Male

a11

a numeric vector

a12

a numeric vector

hours.per.week

a numeric vector

native.country

a factor with levels Cambodia Canada China Columbia Cuba Dominican-Republic Ecuador El-Salvador England France Germany Greece Guatemala Haiti Holand-Netherlands Honduras Hong Hungary India Iran Ireland Italy Jamaica Japan Laos Mexico Nicaragua Outlying-US(Guam-USVI-etc) Peru Philippines Poland Portugal Puerto-Rico Scotland South Taiwan Thailand Trinadad&Tobago United-States Vietnam Yugoslavia

salary

a factor with levels <=50K >50K

Source

The UCI Machine Learning Database Repository at:

Examples

1
2
3
4
## Not run: data(census)
imagmiss(census,"census")

## End(Not run)

dprep documentation built on May 29, 2017, 11:01 a.m.