boston: Corrected Boston Housing Data

Description Format Note Source References Examples

Description

The boston.c data frame has 506 rows and 20 columns. It contains the Harrison and Rubinfeld (1978) data corrected for a few minor errors and augmented with the latitude and longitude of the observations. Gilley and Pace also point out that MEDV is censored, in that median values at or over USD 50,000 are set to USD 50,000. The original data set without the corrections is also included in package mlbench as BostonHousing. In addition, a matrix of tract point coordinates projected to UTM zone 19 is included as boston.utm, and a sphere of influence neighbours list as boston.soi.

Format

This data frame contains the following columns:

Note

Details of the creation of the tract shapefile given in final don't run block; tract boundaries for 1990: http://www.census.gov/geo/cob/bdy/tr/tr90shp/tr25_d90_shp.zip, counties in the BOSTON SMSA http://www.census.gov/population/metro/files/lists/historical/63mfips.txt; tract conversion table 1980/1970: https://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/7913?q=07913&permit[0]=AVAILABLE, http://www.icpsr.umich.edu/cgi-bin/bob/zipcart2?path=ICPSR&study=7913&bundle=all&ds=1&dups=yes. The shapefile contains corrections and extra variables (tract 3592 is corrected to 3593; the extra columns are:

Source

http://lib.stat.cmu.edu/datasets/boston_corrected.txt

References

Harrison, David, and Daniel L. Rubinfeld, Hedonic Housing Prices and the Demand for Clean Air, Journal of Environmental Economics and Management, Volume 5, (1978), 81-102. Original data.

Gilley, O.W., and R. Kelley Pace, On the Harrison and Rubinfeld Data, Journal of Environmental Economics and Management, 31 (1996),403-405. Provided corrections and examined censoring.

Pace, R. Kelley, and O.W. Gilley, Using the Spatial Configuration of the Data to Improve Estimation, Journal of the Real Estate Finance and Economics, 14 (1997), 333-340.

Bivand, Roger. Revisiting the Boston data set - Changing the units of observation affects estimated willingness to pay for clean air. REGION, v. 4, n. 1, p. 109-127, 2017. http://openjournals.wu.ac.at/ojs/index.php/region/article/view/107.

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
if (requireNamespace("spdep", quietly = TRUE)) {
  library(spdep)
  data(boston)
  hr0 <- lm(log(MEDV) ~ CRIM + ZN + INDUS + CHAS + I(NOX^2) + I(RM^2) +
                    AGE + log(DIS) + log(RAD) + TAX + PTRATIO + B + log(LSTAT), data = boston.c)
  summary(hr0)
  logLik(hr0)
  gp0 <- lm(log(CMEDV) ~ CRIM + ZN + INDUS + CHAS + I(NOX^2) + I(RM^2) +
                    AGE + log(DIS) + log(RAD) + TAX + PTRATIO + B + log(LSTAT), data = boston.c)
  summary(gp0)
  logLik(gp0)
  lm.morantest(hr0, nb2listw(boston.soi))
}
## Not run: 
library(rgdal)
boston.tr <- readOGR(system.file("shapes/boston_tracts.shp",
                           package="spData")[1])
boston_nb <- poly2nb(boston.tr)

## End(Not run)
## Not run: 
if (requireNamespace("spatialreg", quietly = TRUE)) {
  library(spatialreg)
  gp1 <- errorsarlm(log(CMEDV) ~ CRIM + ZN + INDUS + CHAS + I(NOX^2)
                             + I(RM^2) +  AGE + log(DIS) + log(RAD) +
                              TAX + PTRATIO + B + log(LSTAT),
                             data=boston.c, nb2listw(boston.soi), method="Matrix", 
                             control=list(tol.opt = .Machine$double.eps^(1/4)))
  summary(gp1)
  gp2 <- lagsarlm(log(CMEDV) ~ CRIM + ZN + INDUS + CHAS + I(NOX^2) + I(RM^2)
                  +  AGE + log(DIS) + log(RAD) + TAX + PTRATIO + B + log(LSTAT),
                  data=boston.c, nb2listw(boston.soi), method="Matrix")
  summary(gp2)
}

## End(Not run)

spData documentation built on Oct. 14, 2021, 5:06 p.m.