parse_protocols: Parse all plenary protocols from xml format to tibbles in...

Description Usage Arguments Value

View source: R/parse_protocol.R

Description

Uses the xml structure of a plenary protocols to create three tibbles for further data analysis with the help of parse_protocol().

Usage

1
2
3
4
5
6
7
parse_protocols(
  path = "protokolle",
  start = NULL,
  end = NULL,
  instance_count = NULL,
  check_schema = TRUE
)

Arguments

path

A string containing the path to the xml files you want to parse

start

Name of the protocol you want to start with (optional)

end

Name of the protocol you want to end with (optional). Must succeed start in alphabetical order

instance_count

Specifies the number of r instances that will be used to parse the protocols. The default is the machines core count.

check_schema

A logical value indicating whether you want to check compatibility of the xml schema used in your xml file.

Value

Three tibbles in a named list:

"speakers": A tibble of all speaking politicians containing speaker id, name, party and similar information.

"paragraphs": A tibble of all paragraphs in speeches, containing speaker id, speech id and content of the paragraph.

"comments": A tibble of all comments given during speeches and about reactions to speeches, containing speech id and comment id as well as content of the comment.


bockstaller/dbtprotokoll documentation built on Dec. 31, 2020, 8:56 p.m.