file_metadata_1900 | R Documentation |
The Hansard corpus file metadata retains source information from the digitized debates hosted by UK Parliament. It also includes indexing information added by the author. This data set can be used for locating data within original debates, or for citing the debates. The variables are as follows:
file_metadata_1900
A data frame with 1817907 rows and 5 variables:
sentence_id A unique ID assigned to each sentence of the Hansard corpus.
speech_id A unqiue ID assigned to each consective sentence stated by a speaker during a debate.
debate_id A unique ID assigned to each debate of the hansard corpus.
src_file_id An ID assigned to the digitized file from which the present dataset was scraped, taken from the digitized parliamentary debates.
src_image An ID assigned to the image of the digitized file, taken from the digitized parliamentary debates.
src_column The column of the sentence, taken from the digitized parliamentary debates.
\hrefhttps://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/ZCYJH8Harvard Dataverse
Buongiorno, Steph; Kalescky, Robert; Godat, Eric; Cerpa, Omar Alexander; Guldi, Jo (2021) (\hrefhttps://doi.org/10.7910/DVN/ZCYJH8)
data(file_metadata_1900)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.