stephbuon/hansardr: Read the Hansard 19th-Century British Parliamentary Debates

Read cleaned, decade subsets of the 19th-century British Parliamentary debates, also known as Hansard. The data was scrapped from UK Parliament's historic Hansard API and underwent corrections for systemic issues which had caused some debates to be marked as "missing" from the corpus. Speaker names were disambiguated using pairwise matching and distance measurements.

Getting started

Package details

MaintainerSteph Buongiorno <steph.buon@gmail.com>
LicenseMIT + file LICENSE
Version1.0.0
URL https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi%3A10.7910%2FDVN%2FZCYJH8&version=DRAFT
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("stephbuon/hansardr")
stephbuon/hansardr documentation built on March 1, 2023, 6:42 p.m.