BostonHousing: Boston Housing Data

BostonHousingR Documentation

Boston Housing Data

Description

Housing data for 506 census tracts of Boston from the 1970 census. The data frame BostonHousing contains the original data by \bibcitetHarrison+Rubinfeld:1978, the dataf rame BostonHousing2 the corrected version with additional spatial information (see references below).

Usage

data("BostonHousing", package = "mlbench")
data("BostonHousing2", package = "mlbench")

Format

The original data are 506 observations on 14 variables, medv being the target variable:

crim per capita crime rate by town
zn proportion of residential land zoned for lots over 25,000 sq.ft
indus proportion of non-retail business acres per town
chas Charles River dummy variable (= 1 if tract bounds river; 0 otherwise)
nox nitric oxides concentration (parts per 10 million)
rm average number of rooms per dwelling
age proportion of owner-occupied units built prior to 1940
dis weighted distances to five Boston employment centres
rad index of accessibility to radial highways
tax full-value property-tax rate per USD 10,000
ptratio pupil-teacher ratio by town
b 1000(B - 0.63)^2 where B is the proportion of blacks by town
lstat percentage of lower status of the population
medv median value of owner-occupied homes in USD 1000's

The corrected data set has the following additional columns:

cmedv corrected median value of owner-occupied homes in USD 1000's
town name of town
tract census tract
lon longitude of census tract
lat latitude of census tract

Source

The original data were taken from the UCI Repository Of Machine Learning Databases \bibcitepBlake+Merz:1998 and no longer seem to be available from the UC Irvine Machine Learning Repository (now at https://archive.ics.uci.edu/). The corrected data were taken from Statlib at https://lib.stat.cmu.edu/datasets/. See Statlib and references there for details on the corrections. Both were converted to R format by Friedrich Leisch.

References

\bibinfo

Gilley+Pace:1996note[Provided corrections and examined censoring.] \bibinfoPace+Gilley:1997note[Added georeferencing and spatial estimation.] \bibshowBlake+Merz:1998, Gilley+Pace:1996, Harrison+Rubinfeld:1978, Pace+Gilley:1997

Examples

data("BostonHousing", package = "mlbench")
summary(BostonHousing)

data("BostonHousing2", package = "mlbench")
summary(BostonHousing2)

mlbench documentation built on March 26, 2026, 5:09 p.m.