Wine | R Documentation |
This data set contains prices and production information for 1450 red wines produced on grapes harvested between the year 1991 and 2000. The wines were produced in The United States: Washington, and seven regions of California.
Wine
A data frame with 1450 observations (rows) and 11 variables (columns).
Column name | Data type | Description | Values | |
[,1] | age | integer | Years of aging before commercialization | (1 - 6) |
[,2] | cases | numeric | Number of cases produced in thousands | (0.016 - 47) |
[,3] | estate | integer | 1 if estate grown, otherwise 0 | (0, 1) |
[,4] | price | numeric | CPI adjusted bottle price in dollars | (38.52 - 150) |
[,5] | reserve | integer | 1 if aged before sale, otherwise 0 | (0 - 1) |
[,6] | score | integer | Quality represented by the WSM tasting score | (76 - 99) |
[,7] | vineyard | integer | 1 if vineyard information is provided | (0 - 1) |
[,8] | region | character | 8 production areas in the USA | ("napa"..."sierra") |
[,9] | vintage | integer | The year of grape harvest | (1991 - 2000) |
[,10] | grape | character | 5 different types of grapes | ("cabernet"..."shiraz") |
[,11] | category | character | 2 wine categories | ("premium", "ultrapremium") |
A wine is defined as estate grown if it is produced only from grapes in a
vineyard (where the grapes are grown) owned or controlled by the winery
(where the wine is produced). score
is a tasting score out of 100
provided by the Wine Spectator Magazine (WSM).
This data is a subset of the observations from WSM. Binary columns from the original data related to the production region, grape variety and wine vintage are combined into three categorical variables: region, grape and vintage.
Costanigro, M., Mittelhammer, R.C., McCluskey, J.J. (2009) Estimating Class-specific Parametric Models under Class Uncertainty: Local Polynomial Regression Clustering in an Hedonic Analysis of Wine Markets", Journal of Applied Econometrics, 24, 1117-1135.
The data can be found here.
# The unique regions
unique(Wine$region)
# Structure of the object
str(Wine)
# Price for each age
boxplot(price ~ age, data = Wine,
col = c("thistle4", "violetred4", "skyblue",
"sienna", "navajowhite4", "moccasin"))
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.