pdf_to_xml: Parse pdf document as XML.

Description Usage Arguments Value Examples

View source: R/utils.R

Description

Parse pdf document as XML.

Usage

1
pdf_to_xml(filename, first, last)

Arguments

filename

pdf file to process

first

first page to process

last

last page to process

Value

an xml_document class object from package xml2

Examples

1
2
3
4
5
6
7
8
unmd_pdf <- system.file(package = "trickypdf", "extdata", "pdf", "UN_Millenium_Declaration.pdf")
unmd_xml <- pdf_to_xml(filename = unmd_pdf)

cdu_manifesto_pdf <- system.file(package = "trickypdf", "extdata", "pdf", "cdu.pdf")
cdu_manifesto_xml <- pdf_to_xml(filename = cdu_manifesto_pdf)

unga_pdf <- system.file(package = "trickypdf", "extdata", "pdf", "N9586353.pdf")
unga_xml <- pdf_to_xml(filename = unga_pdf)

PolMine/trickypdf documentation built on Nov. 20, 2019, 8:01 p.m.