data-raw/README.md

Data scraping

house_reps

The data house_reps is a minimally processed version of that on the Chief Clerk's webpage.

To re-generate house_reps, run the scripts in order:

  1. 01-scrape.Rmd
  2. 02-parse.Rmd
  3. 03-clean.Rmd
  4. 04-patch.Rmd

house_reps_regular

house_reps_regular is a more processed data set that:

The code to generate house_reps_regular is in 05-regular.Rmd.

Patches

To see the changes made by patches:

library(daff)
library(tidyverse)
library(here)
clean_csv <- here("data-raw", "chronology-clean.csv")
patched_csv <- here("data-raw", "chronology-patched.csv")

# use same spec to read
spec <- spec_csv(clean_csv)
## Parsed with column specification:
## cols(
##   legislator = col_character(),
##   role = col_character(),
##   district = col_integer(),
##   party = col_character(),
##   residence = col_character(),
##   gender = col_character(),
##   desk_number = col_integer(),
##   profession = col_character(),
##   notes = col_character(),
##   session_name = col_character(),
##   session_year = col_integer(),
##   regular = col_logical()
## )
clean <- read_csv(clean_csv, col_types = spec)
patched <- read_csv(patched_csv, col_types = spec)

diffs <- diff_data(clean, patched, ordered = FALSE)
diff_html <- render_diff(diffs, view = FALSE, fragment = TRUE,
  summary = TRUE)
writeLines(diff_html)
@@ session\_name legislator district role party residence gender desk\_number profession notes session\_year regular ... ... ... ... ... ... ... ... ... ... ... ... ... 1909 Session Hawley, C. A. 11 null R McCoy, Polk Male NA null null 1909 true ⇒ 1909 Session Jones, Benjamin F. 12 null F → R Independence, Polk Male NA null null 1909 true 1909 Session Bones, J. W. 13 null R Carlton, Yamhill Male NA null null 1909 true ... ... ... ... ... ... ... ... ... ... ... ... ... 1939 Session Turner, Howard W. 26 null R Madras, Crook, Jefferson Male 58 null null 1939 true ⇒ 1939 Session Grant, A. S. 27 null D Baker, Baker NULL → Male 28 null null 1939 true 1939 Session Hosch, J. F. 28 null I Bend, Deschutes Male 12 null null 1939 true ... ... ... ... ... ... ... ... ... ... ... ... ... 1965 Session Willits, Howard 6 null D Portland, Multnomah Male 45 null null 1965 true +++ 1965 Session Roberts, Betty R. 6 null D Portland, Multnomah Female NA null null 1965 true 1965 Session Groener, Richard E. 7 null D Milwaukie, Clackamas Male 48 null null 1965 true ... ... ... ... ... ... ... ... ... ... ... ... ... 1975 Session Byers, Bernard (Bud) 37 null D Lebanon, Linn Male 35 null null 1975 true +++ 1975 Session Rijken, Max 38 null D Newport, Benton, Lane, Lincoln Male NA null null 1975 true 1975 Session Kerans, Grattan 39 null D Eugene, Lane Male 10 null null 1975 true ... ... ... ... ... ... ... ... ... ... ... ... ...

In interactive use, use this for full color view:

render_diff(diffs)


or-house-vis/history documentation built on May 15, 2019, 1:11 p.m.