Description Usage Format Source Examples
This is the census (also known as adult) dataset. It is used to predict the salary of a person based on socio-demographis and economic predictors.
1 | data("census")
|
A data frame with 32561 observations on the following 15 variables.
age
a numeric vector
employment
a factor with levels Federal-gov
Local-gov
Never-worked
Private
Self-emp-inc
Self-emp-not-inc
State-gov
Without-pay
a3
a numeric vector
education
a factor with levels 10th
11th
12th
1st-4th
5th-6th
7th-8th
9th
Assoc-acdm
Assoc-voc
Bachelors
Doctorate
HS-grad
Masters
Preschool
Prof-school
Some-college
education.num
a numeric vector
marital.status
a factor with levels Divorced
Married-AF-spouse
Married-civ-spouse
Married-spouse-absent
Never-married
Separated
Widowed
job
a factor with levels Adm-clerical
Armed-Forces
Craft-repair
Exec-managerial
Farming-fishing
Handlers-cleaners
Machine-op-inspct
Other-service
Priv-house-serv
Prof-specialty
Protective-serv
Sales
Tech-support
Transport-moving
relationship
a factor with levels Husband
Not-in-family
Other-relative
Own-child
Unmarried
Wife
race
a factor with levels Amer-Indian-Eskimo
Asian-Pac-Islander
Black
Other
White
gender
a factor with levels Female
Male
a11
a numeric vector
a12
a numeric vector
hours.per.week
a numeric vector
native.country
a factor with levels Cambodia
Canada
China
Columbia
Cuba
Dominican-Republic
Ecuador
El-Salvador
England
France
Germany
Greece
Guatemala
Haiti
Holand-Netherlands
Honduras
Hong
Hungary
India
Iran
Ireland
Italy
Jamaica
Japan
Laos
Mexico
Nicaragua
Outlying-US(Guam-USVI-etc)
Peru
Philippines
Poland
Portugal
Puerto-Rico
Scotland
South
Taiwan
Thailand
Trinadad&Tobago
United-States
Vietnam
Yugoslavia
salary
a factor with levels <=50K
>50K
The UCI Machine Learning Database Repository at:
1 2 3 4 |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.