pull_last_update: Read meta-data on the last dates scraper data was update by...

View source: R/pull_last_update.R

pull_last_updateR Documentation

Read meta-data on the last dates scraper data was update by agency

Description

Reads extracted data on the last time covid related data was updated from files on our webserver.

Usage

pull_last_update(all_dates = FALSE, scraper_name = NULL)

Arguments

all_dates

logical, get all data from all dates recorded by webscraper

scraper_name

character, a character string indicating a particular scraper to pull data for. Defaults to NULL which pulls all scraper data

Value

dataframe with days since last update

Examples


pull_last_update(all_dates = TRUE, scraper_name = "arizona")

## Not run: 
# this takes a minute
last_df <- pull_last_update()

last_df %>%
    filter(!is.na(days_late)) %>%
    # only get these kind of scrapers
    filter(jurisdiction %in% c("state", "federal", "immigration")) %>%
    # remove population scrapers here
    filter(!stringr::str_detect(id, "(?i)population")) %>%
    # remove youth scrapers here
    filter(!stringr::str_detect(id, "(?i)youth")) %>%
    # Do some renaming
    mutate(State = ifelse(jurisdiction == "immigration", "ICE", State)) %>%
    mutate(State = ifelse(jurisdiction == "federal", "BOP", State)) %>%
    # average across all scrapers for a given agency
    group_by(State) %>%
    summarize(days_late = mean(days_late), .groups = "drop") %>%
    mutate(State = forcats::fct_reorder(State, days_late)) %>%
    ggplot(aes(x = State, y = days_late, xend = State, yend = 0)) +
    geom_point(size = 3, color = "#D7790F") +
    geom_segment(size = 1.5, color = "#D7790F") +
    coord_flip() +
    theme_behindbars() +
    theme(
        panel.grid.major.y = element_blank(),
        axis.text.y = element_text(color = "#555526", size = 13),
        axis.text.x = element_text(color = "#555526", size = 18),
        panel.grid.major.x = element_line(
            color = "#92926C", linetype = "dotted"),
        axis.title.x = element_text(margin = margin(r = 20)),
        axis.title.y = element_blank(),
        legend.position = "none") +
    scale_color_bbdiscrete() +
    labs(x="", y="") +
    ggtitle(
        "Days Since Agency Updated Covid Data",
        stringr::str_c("As of ", as.character(first(last_df$Date))))

## End(Not run)


uclalawcovid19behindbars/behindbarstools documentation built on April 22, 2022, 4:08 a.m.