green_2016_01_sample: NYC Taxi trip data from NYC's Taxi and Limousine Commission...

Description Usage Format Details Author(s) Source References Examples

Description

There are three data tables containing information about NYC Taxi trips attached to this package.

Usage

1
2
3

Format

yellow_2016_01_sample datatable (tbl_df) has 10,000 rows and 19 variables. green_2016_01_sample datatable (tbl_df) has 10,000 rows and 21 variables. The following variables are in yellow_2016_01_sample or green_2016_01_sample.

vendorid

A code indicating the LPEP provider that provided the record.1= Creative Mobile Technologies, LLC; 2= VeriFone Inc.

tpep_pickup_datetime

The date and time when the meter was engaged.

tpep_dropoff_datetime

The date and time when the meter was disengaged.

passenger_count

The number of passengers in the vehicle. This is a driver-entered value.

trip_distance

The elapsed trip distance in miles reported by the taximeter.

pickup_longitude

Longitude where the meter was engaged.

pickup_latitude

Latitude where the meter was engaged.

ratecodeid

The final rate code in effect at the end of the trip. 1= Standard rate, 2=JFK, 3=Newark, 4=Nassau or Westchester, 5=Negotiated fare, 6=Group ride

store_and_fwd_flag

This flag indicates whether the trip record was held in vehicle memory before sending to the vendor, aka "store and forward," because the vehicle did not have a connection to the server. Y= store and forward trip, N= not a store and forward trip

dropoff_longtitude

Longitude where the meter was timed off.

dropoff_latitude

Latitude where the meter was timed off.

payment_type

A numeric code signifying how the passenger paid for the trip. 1= Credit card, 2= Cash, 3= No charge, 4= Dispute, 5= Unknown, 6= Voided trip

fare_amount

The time-and-distance fare calculated by the meter.

extra

Miscellaneous extras and surcharges. Currently, this only includes the $0.50 and $1 rush hour and overnight charges.

mta_tax

$0.50 MTA tax that is automatically triggered based on the metered rate in use.

tip_amount

Tip amount - This field is automatically populated for credit card tips. Cash tips are not included.

tolls_amount

Total amount of all tolls paid in trip.

improvement_surcharge

$0.30 improvement surcharge assessed on hailed trips at the flag drop. The improvement surcharge began being levied in 2015.

total_amount

The total amount charged to passengers. Does not include cash tips.

ehail_fee

The description of this variable is not available on TLC website.

trip_type

A code indicating whether the trip was a street-hail or a dispatch that is automatically assigned based on the metered rate in use but can be altered by the driver. 1= Street-hail, 2= Dispatch

Details

NYC's Taxi and Limousine Commission (TLC) Trip Data is a collection of trip records including fields capturing pick-up and drop-off locations, times, trip distances, fares, rate types, and driver-reported passenger counts. The data was collected and provided to the NYC TLC by technology providers under the Taxicab & Livery Passenger Enhancement Programs. The fist two tables are random samples of 100 observations generated by the sample function from the 2016 January green and yellow taxi trip data.

Author(s)

Wencong (Priscilla) Li

Source

These data are recorded at http://www.nyc.gov/html/tlc/html/about/trip_record_data.shtml

References

The description of the data is available on http://www.nyc.gov/html/tlc/html/about/trip_record_data.shtml

Examples

1
2
3
4
5
6
7
8
9
## Not run: 
#' # 
if (require(dplyr) & require(ggplot2)) {
 green_2016_01_sample %>%
 filter(Trip_distance <= 1) %>%
   ggplot(aes(x = Dropoff_longtitude, y = Pickup_latitude))
}

## End(Not run)

beanumber/nyctaxi documentation built on May 12, 2019, 9:43 a.m.