marathon_sample: A resampled set of runners from all marathons with more 50...

Description Usage Format Source Examples

Description

Each marathon will have 100 runners (50 male, 50 female) per year. So any marathon with less than 50 runners in the group will have multiple resampled runners. This data set has over 500k runners. The original data had close to 10 million runners and a few more columns. The NYT had a good article - https://www.nytimes.com/2014/04/23/upshot/what-good-marathons-and-bad-investments-have-in-common.html?rref=upshot&_r=1

Usage

1

Format

A data frame with columns:

age

The variable is numeric. The age of the runner

gender

The variable is character. The gender of the runner (M/F)

chiptime

The variable is numeric. The time in minutes for the runner

year

The variable is integer. The year of the marathon

marathon

The variable is character. The name of the marathon

country

The variable is character. The country where the marathon was held

finishers

The variable is integer. The number of finishers at the marathon

Source

http://faculty.chicagobooth.edu/george.wu/research/marathon/data.htm

Examples

1
2
3
4
## Not run: 
marathon_sample

## End(Not run)

byuidatascience/data4marathons documentation built on July 19, 2021, 5:29 a.m.