knitr::opts_chunk$set(echo = TRUE)

Chicago Data Package

My name is Ryan Wang, and I am a Master's student at the Univeristy of Chicago, concentrating on quantitative methods of social analyses. For this project, I further developed an open Chicago data package from a past GIS 3 class.

This is a continuation of Clyde Schwarb's Chicago Data Package (https://github.com/cschwab1/chicagodatapackage) project for GIS 3. Clyde is an undergraduate at the University of Chicago studying geography and environmental and urban studies. A year ago, Clyde initated this project to "provide a starting point for those interested in learning geocomputation with R using real Chicago data", and "building a larger collection of civic data for academic and amateur, journalist and researcher, non-profit and concerned citizens alike." Some most commonly used datasets were provided in the original package, along with initial processing code and vignettes from Clyde. I completed documentation of datasets and added two new datasets, Crime - from 2001 to present, and Divvy Trips.

Cleaned datasets are available in /data, and original data is included the unprocessed data in /data-raw. The data was primarily collected from the Chicago data portal (https://data.cityofchicago.org/), but also contains datasets from the CDC and Cook County. Additional documentation is available in /R.

Installation

The project is not yet available from CRAN, so you can download it directly from github:

library(devtools)
install_github("ryanwyg/Chicago-Data-Package")

The installation might take several minutes depending on your internet connection, as some of the datasets in this pacakge are large.

Datasets Included

Datasets to be included in the package include (All accessed through SODA API):

The above datasets are all original sets included in Clyde’s Package, and I linked online API locations, and will update in the code. I also included an additional dataset below which I think is quite important:

More datasets will be added after the completion of the first stage of project.

The Chicago Data Explorer

This is a flexdashboard app with RShiny code incorporated. This app uses the data and functionalities in this package itself and it intends to be a tool for initial assessment of the Chicago data, aiming to quickly guide interests for further data discovery.

The app is currently published on the RShiny server here: https://ryanwyg.shinyapps.io/ChicagoDataExplorer/#section-the-maps Image of Data Explorer App

The app is made into two sections, a spatial data explorer section ("The Maps") and a non-spatial data explorer section("The Tables"). Both uses a sidebar to select specific variables of interest, and has the functionality for downloading the data.

Known Issues

Data Package

Data Explorer App

Scrolling is buggy in the data table visualization (second tab);

Future Improvements

This project is ongoing and future functionalities will come soon, mainly being expansions in the data. You can come back and explore updates through the Data Explorer, which is constantly updated with new datasets from the city's data portal. I plan to expand further with more sections for different categories of data, just like how the data portal websites classifies the data ("Buildings", "Community", "Education" etc.) Please do check back for updates.



ryanwyg/Chicago-Data-Package documentation built on June 22, 2020, 2:55 p.m.