Pronounced “Data Teacher”
The goal of datateachr
is to provide an easy way to access open source
data for educational purposes. It contains several semi-tidied data sets
obtained from multiple sources.
You can install the development version from GitHub with:
install.packages("devtools")
devtools::install_github("UBC-MDS/datateachr")
You may also access this information through our vignette. First use this command for installation:
install.packages("devtools")
devtools::install_github("UBC-MDS/datateachr", build_vignettes = TRUE)
And to read the vignette use:
library(datateachr)
browseVignettes("datateachr")
This package currently contains the following data sets:
apt_buildings
: Acquired courtesy of The City of Toronto’s Open
Data Portal. It currently has 3455 rows
and 37 columns.
building_permits
: Acquired courtesy of The City of Vancouver’s
Open Data Portal. It
currently has 20680 rows and 14 columns.
cancer_sample
: Acquired courtesy of UCI Machine Learning
Repository. It currently
has 569 rows and 32 columns.
flow_sample
: Acquired courtesy of The Government of Canada’s
Historical Hydrometric
Database.
It currently has 218 rows and 7 columns.
parking_meters
: Acquired courtesy of The City of Vancouver’s Open
Data Portal. It
currently has 10032 rows and 22 columns.
steam_games
: Acquired courtesy of
Kaggle.
It currently has 40833 rows and 21 columns.
vancouver_trees
: Acquired courtesy of The City of Vancouver’s
Open Data Portal. It
currently has 146611 rows and 20 columns.
This is a basic example which shows you how to solve a common problem:
library(dplyr)
library(datateachr)
vancouver_trees %>%
filter(genus_name == "ACER") %>%
arrange(desc(diameter)) %>%
head()
#> # A tibble: 6 x 20
#> tree_id civic_number std_street genus_name species_name cultivar_name
#> <dbl> <dbl> <chr> <chr> <chr> <chr>
#> 1 149285 1906 W 14TH AV ACER SPECIES <NA>
#> 2 51001 849 W 60TH AV ACER SACCHARINUM <NA>
#> 3 84751 8250 HUDSON ST ACER PLATANOIDES <NA>
#> 4 54498 6611 ADERA ST ACER PLATANOIDES SCHWEDLERI
#> 5 78588 1333 W GEORGIA… ACER RUBRUM <NA>
#> 6 220644 1438 RICHARDS … ACER PLATANOIDES EMERALD QUEEN
#> # … with 14 more variables: common_name <chr>, assigned <chr>,
#> # root_barrier <chr>, plant_area <chr>, on_street_block <dbl>,
#> # on_street <chr>, neighbourhood_name <chr>, street_side_name <chr>,
#> # height_range_id <dbl>, diameter <dbl>, curb <chr>, date_planted <date>,
#> # longitude <dbl>, latitude <dbl>
Run this command to get information about how to cite this package:
citation("datateachr")
#>
#> To cite package 'datateachr' in publications use:
#>
#> Hayley Boyce and Jordan Bourak (2020). datateachr: Data collected to
#> use for teaching at the University of British Columbia. R package
#> version 0.2.1. https://github.com/UBC-MDS/datateachr
#>
#> A BibTeX entry for LaTeX users is
#>
#> @Manual{,
#> title = {datateachr: Data collected to use for teaching at the University of British
#> Columbia},
#> author = {Hayley Boyce and Jordan Bourak},
#> year = {2020},
#> note = {R package version 0.2.1},
#> url = {https://github.com/UBC-MDS/datateachr},
#> }
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.