whisky_collection_corrupted | R Documentation |
This dataset is a corrupted version of the whisky_collection dataset of this package. It contains the most common data quality problems like missing data or errors, the columns and structures are not unified, there are outliers or values that will break your prediction model and other pitfalls for educational issues.
data(whisky_collection_corrupted)
A data.frame with 42 rows and 15 variables:
Name of the whisky
Distiller of the specific whisky
Production location of the whisky (mostly countries or regions)
Specification of the whisky type like e.g. single malt or blended
Region of the whisky production (mostly relevant for scotchs)
Year of the first whisky production
Latitude and longitude values of the distillery
Link to the related article of the English Wikipedia
My personal rating of this whisky. I am open to discuss it, just write me an email if you see it otherwise ;-)
The average rating of this whisky based on consumer reviews from many whisky online shops in 2023
The average rating of this whisky based on reviews from professional critics until 2023
My measure of how smoky vs. delicate it tastes, negative values implicate delicate
My measure of how rich vs. light it tastes, negative values implicate light
The average price level in Euro of the youngest 10/12 year or consumer version in the whiskyexchange 2023
Alternative feature to LOCATION, contains location of the whisky (mostly countries or regions)
data(whisky_collection)
library("ggplot2")
ggplot(whisky_collection, aes(x=NAME, y=RATING, fill=RATING)) +
geom_bar(stat="identity") +
coord_flip() +
xlab("")
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.