Data on 32 nuclear power plants. The response variable is cost and there are ten covariates.
A data frame with 32 observations on the following 12 variables. All quantitative variables, except date, have been logged and standardized to have mean 0 and variance 1.
Quantitative covariate. The date on which the construction permit was issued. The data are measured in years since January 1 1990 to the nearest month.
Quantitative covariate. The time between application for and issue of the construction permit.
Quantitative covariate. The time between issue of operating license and construction permit.
Quantitative covariate. The net capacity of the power plant (MWe).
Binary covariate. Value 1, indicates the prior existence of a LWR plant at the same site.
Binary covariate, located in North-East USA
Binary covariate, presence of cooling tower
Binary covariate, where 1 indicates that the nuclear steam supply system was manufactured by Babcock-Wilcox.
Quantitative covariate. The cumulative number of power plants constructed by each architect-engineer.
Binary covariate, partial turnkey guarantee.
Outcome. The capital cost of construction in millions of dollars adjusted to 1976 base.
Davison (2003) explores fitting models to this data using forward and backward stepwise regression. In this modelling logs of quantiative variables are used. We have also standardized this data to facilitate comparison with other techniques such as LARS and principal component regression.
Davison and Hinkley (1997, Example 6.8, 6.10, 6.12) use this data in a series of examples. Example 6.8: estimation of prediction error. Example 6.10: prediction error using cross-validation and bootstrapping. Example 6.12: subset model selection using cross-validation.
Obtained from the CRAN package boot.
Davison, A. C. (2003). Statistical Models. Cambridge: Cambridge University Press.
Davison, A.C. and Hinkley, D.V. (1997). Bootstrap Methods and their Application. Cambridge University Press.