Wine: Wine: Wines from Vintage 1991 - 2000

WineR Documentation

Wine: Wines from Vintage 1991 - 2000

Description

This data set contains prices and production information for 1450 red wines produced on grapes harvested between the year 1991 and 2000. The wines were produced in The United States: Washington, and seven regions of California.

Usage

Wine

Format

A data frame with 1450 observations (rows) and 11 variables (columns).

Column name Data type Description Values
[,1] age integer Years of aging before commercialization (1 - 6)
[,2] cases numeric Number of cases produced in thousands (0.016 - 47)
[,3] estate integer 1 if estate grown, otherwise 0 (0, 1)
[,4] price numeric CPI adjusted bottle price in dollars (38.52 - 150)
[,5] reserve integer 1 if aged before sale, otherwise 0 (0 - 1)
[,6] score integer Quality represented by the WSM tasting score (76 - 99)
[,7] vineyard integer 1 if vineyard information is provided (0 - 1)
[,8] region character 8 production areas in the USA ("napa"..."sierra")
[,9] vintage integer The year of grape harvest (1991 - 2000)
[,10] grape character 5 different types of grapes ("cabernet"..."shiraz")
[,11] category character 2 wine categories ("premium", "ultrapremium")

A wine is defined as estate grown if it is produced only from grapes in a vineyard (where the grapes are grown) owned or controlled by the winery (where the wine is produced). score is a tasting score out of 100 provided by the Wine Spectator Magazine (WSM).

Details

This data is a subset of the observations from WSM. Binary columns from the original data related to the production region, grape variety and wine vintage are combined into three categorical variables: region, grape and vintage.

Source

Costanigro, M., Mittelhammer, R.C., McCluskey, J.J. (2009) Estimating Class-specific Parametric Models under Class Uncertainty: Local Polynomial Regression Clustering in an Hedonic Analysis of Wine Markets", Journal of Applied Econometrics, 24, 1117-1135.

The data can be found here.

Examples


# The unique regions
unique(Wine$region)

# Structure of the object
str(Wine)

# Price for each age
boxplot(price ~ age, data = Wine,
        col = c("thistle4", "violetred4", "skyblue",
                "sienna", "navajowhite4", "moccasin"))


thoree/stat340 documentation built on June 30, 2024, 4:04 p.m.