speaker_metadata_1840: The 19th-century British Parliamentary debates speaker...

speaker_metadata_1840R Documentation

The 19th-century British Parliamentary debates speaker metadata for the decade 1840

Description

The Hansard corpus debate metadata retains information about speakers. The variables are as follows:

Usage

speaker_metadata_1840

Format

A data frame with 981188 rows and 6 variables:

sentence_id A unique ID assigned to each sentence of the corpus, taken from the digitized parliamentary debates.

speaker The name of the speaker originally recorded in the transciptions of the debates.

suggested_speaker The suggested true identity of the speaker after undergoing disambiguation.

ambiguous Binary. Speaker is assigned the value of "1" if our algorithm marked them as ambiguous (e.g. multiple speakers were matched during the disambiguation process).

fuzzy_matched Binary. Speaker is assigned the value of "1" if our algorithm fuzzy matched the names of the speakers.

ignored Binary. The speaker is ignored by our disambiguation process if they are not a Member of Parliament (MP). Ignored speakers are assigned the value of "1."

Source

\hrefhttps://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/ZCYJH8Harvard Dataverse

References

Buongiorno, Steph; Kalescky, Robert; Godat, Eric; Cerpa, Omar Alexander; Guldi, Jo (2021) (\hrefhttps://doi.org/10.7910/DVN/ZCYJH8)

Examples

data(speaker_metadata_1840)


stephbuon/hansardr documentation built on March 1, 2023, 6:42 p.m.