README.md

Build Status Build status codecov.io

Introduction to the data and the repo

The U.S. Embassy and Consulates General in India maintain an air quality monitoring program with on-site measuring instruments and put the corresponding data on this website. There are csv files for 2013 and 2014, for 2015 except December which is in a pdf, and various csv/pdf for the months of the beginning of 2016. From August 2016 I chose to use OpenAQ and in particular ropenaq to get new data.

In this repository I have made a copy of these files and provide a R code for wrangling them to get a single csv with all measures for Delhi, Mumbai, Kolkata, Hyderabad and Chennai. Refer to original source for licensing questions.

Useful even for non R users: You will find the raw data in this folder and my wrangling code is here. The resulting csv is here. I will try to update the repository as new data comes in on the embassy website.

R package

I have made a R package out of the data so that my fellow R users can easily play with the data. The package imports ggTimeSeries for doing the calendar plot. It can be installed this way:

devtools::install_github("Ather-Energy/ggTimeSeries")
devtools::install_github("masalmon/usaqmindia")

Data format

The data is in a long format:

library("usaqmindia")
library("dplyr")
data("pm25_india")
pm25_india %>% head(n = 20) %>% knitr::kable()

| datetime | city | conc| |:--------------------|:------|------:| | 2013-01-01 01:00:00 | Delhi | 324.4| | 2013-01-01 02:00:00 | Delhi | 366.8| | 2013-01-01 03:00:00 | Delhi | 290.7| | 2013-01-01 04:00:00 | Delhi | 245.4| | 2013-01-01 05:00:00 | Delhi | 220.3| | 2013-01-01 06:00:00 | Delhi | 180.2| | 2013-01-01 07:00:00 | Delhi | 140.0| | 2013-01-01 08:00:00 | Delhi | 125.0| | 2013-01-01 09:00:00 | Delhi | 111.0| | 2013-01-01 10:00:00 | Delhi | 112.0| | 2013-01-01 11:00:00 | Delhi | 129.4| | 2013-01-01 12:00:00 | Delhi | 184.1| | 2013-01-01 13:00:00 | Delhi | 284.8| | 2013-01-01 14:00:00 | Delhi | 372.3| | 2013-01-01 15:00:00 | Delhi | 456.7| | 2013-01-01 16:00:00 | Delhi | 404.8| | 2013-01-01 17:00:00 | Delhi | 328.1| | 2013-01-01 18:00:00 | Delhi | 274.4| | 2013-01-01 19:00:00 | Delhi | 285.0| | 2013-01-01 20:00:00 | Delhi | 354.9|

Time series plot

Below is an example plot of concentrations (see the code here). Check out the Diwali peaks!

library("usaqmindia")
usaqmindia_plot()

Calendar plot

And this is a calendar plot of daily median concentrations in one city using the ggTimeSeries package (see the code here):

usaqmindia_calendar(cityplot = "Delhi")

Contributing

Please note that this project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms.



masalmon/usaqmindia documentation built on May 21, 2019, 12:41 p.m.