adult | R Documentation |
The adult
dataset containing 48842 instances with 16 continuous, binary and discrete variables was extracted from the census bureau database. Extraction was done by Barry Becker from the 1994 census bureau database.
data(adult)
adult
is a data frame with 48842 cases (rows) and 16 variables (columns) named:
Type
binary train
or test
.
Age
continuous.
Workclass
one of the 8 discrete values
private
,
self-emp-not-inc
,
self-emp-inc
,
federal-gov
,
local-gov
,
state-gov
,
without-pay
or
never-worked
.
Fnlwgt
stands for continuous final weight.
Education
one of the 16 discrete values
bachelors
,
some-college
,
11th
,
hs-grad
,
prof-school
,
assoc-acdm
,
assoc-voc
,
9th
,
7th-8th
,
12th
,
masters
,
1st-4th
,
10th
,
doctorate
,
5th-6th
or
preschool
.
Education.Num
continuous.
Marital.Status
one of the 7 discrete values
married-civ-spouse
,
divorced
,
never-married
,
separated
,
widowed
,
married-spouse-absent
or
married-af-spouse
.
Occupation
one of the 14 discrete values
tech-support
,
craft-repair
,
other-service
,
sales
,
exec-managerial
,
prof-specialty
,
handlers-cleaners
,
machine-op-inspct
,
adm-clerical
,
farming-fishing
,
transport-moving
,
priv-house-serv
,
protective-serv
or
armed-forces
.
Relationship
one of the 6 discrete values
wife
,
own-child
,
husband
,
not-in-family
,
other-relative
or
unmarried
.
Race
one of the 5 discrete values
white
,
asian-pac-islander
,
amer-indian-eskimo
,
other
or
black
.
Sex
binary female
or male
.
Capital.Gain
continuous.
Capital.Loss
continuous.
Hours.Per.Week
continuous.
Native.Country
one of the 41 discrete values
united-states
,
cambodia
,
england
,
puerto-rico
,
canada
,
germany
,
outlying-us(guam-usvi-etc)
,
india
,
japan
,
greece
,
south
,
china
,
cuba
,
iran
,
honduras
,
philippines
,
italy
,
poland
,
jamaica
,
vietnam
,
mexico
,
portugal
,
ireland
,
france
,
dominican-republic
,
laos
,
ecuador
,
taiwan
,
haiti
,
columbia
,
hungary
,
guatemala
,
nicaragua
,
scotland
,
thailand
,
yugoslavia
,
el-salvador
,
trinadad&tobago
,
peru
,
hong
or
holand-netherlands
.
Income
binary <=50k
or >50k
.
A. Asuncion and D. J. Newman. Uci machine learning repository, 2007. http://archive.ics.uci.edu/ml/.
A. Asuncion and D. J. Newman. Uci machine learning repository, 2007. http://archive.ics.uci.edu/ml/.
data(adult)
# Find complete cases.
adult <- adult[complete.cases(adult),]
# Show level attributes for binary and discrete variables.
levels(adult[["Type"]])
levels(adult[["Workclass"]])
levels(adult[["Education"]])
levels(adult[["Marital.Status"]])
levels(adult[["Occupation"]])
levels(adult[["Relationship"]])
levels(adult[["Race"]])
levels(adult[["Sex"]])
levels(adult[["Native.Country"]])
levels(adult[["Income"]])
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.