tidy_movies: Tidy version of the movies dataset from the ggplot2 package
In ggupset: Combination Matrix Axis for 'ggplot2' to Create 'UpSet' Plots

tidy_movies

R Documentation

Tidy version of the movies dataset from the ggplot2 package

Description

The original ggplot2movies::movies dataset has 7 columns that contain indicators if a movies belongs to a certain genre. In this version the 7 columns are collapsed to a single list column to create a tidy dataset. It also has information on only 5,000 movies to reduce the size of the dataset. Furthermore each star rating is in its on row.

Usage

tidy_movies

Format

a data frame with 50,000 rows and 10 columns

title. The title of the movie.
year. Year of release.
budget. Total budget (if known) in US dollars.
length. Length in minutes.
rating. Average IMDB user rating.
votes. Number of IMDB user who rated this movie.
mpaa. MPAA rating
Genres. List column with all genres the movie belongs to
stars, percent_rating. The number of stars and the corresponding percentage of people rating the movie with this many stars.