tidy_movies: Tidy version of the movies dataset from the ggplot2 package

tidy_moviesR Documentation

Tidy version of the movies dataset from the ggplot2 package

Description

The original ggplot2movies::movies dataset has 7 columns that contain indicators if a movies belongs to a certain genre. In this version the 7 columns are collapsed to a single list column to create a tidy dataset. It also has information on only 5,000 movies to reduce the size of the dataset. Furthermore each star rating is in its on row.

Usage

tidy_movies

Format

a data frame with 50,000 rows and 10 columns

  • title. The title of the movie.

  • year. Year of release.

  • budget. Total budget (if known) in US dollars.

  • length. Length in minutes.

  • rating. Average IMDB user rating.

  • votes. Number of IMDB user who rated this movie.

  • mpaa. MPAA rating

  • Genres. List column with all genres the movie belongs to

  • stars, percent_rating. The number of stars and the corresponding percentage of people rating the movie with this many stars.

Examples

dim(tidy_movies)
head(tidy_movies)


ggupset documentation built on June 24, 2024, 5:08 p.m.