preprocess_dataset: Preprocesses the New York taxi data CSV file to analyze

Description Usage Arguments Value See Also Examples

Description

Preprocessing includes checking the validity of the dataset, removing unneeded data columns and only leaves necesary ones, parses data into the appropriate data types, creation of new columns that are necessary for analysis, adds and fills borough column for the pickup and drop locations

Usage

1
preprocess_dataset(taxidata, zones_newyork)

Arguments

taxidata

is the dataset resulted from read_NYC_trip_dataset

zones_newyork

is the New York zones sf object, if not present will be downloaded from NYC website

Value

A data.frame that has the resulted dataset after preprocessing

See Also

read_NYC_trip_dataset

Examples

1
2
3
4
## Not run: 
   preprocess_dataset(read_NYC_trip_dataset())

## End(Not run)

alaacs/nytaxi documentation built on May 9, 2019, 7:31 p.m.