pdfx: PDF-to-XML conversion of scientific articles using pdfx

View source: R/pdfx.r

pdfxR Documentation

PDF-to-XML conversion of scientific articles using pdfx

Description

Uses a web service provided by Utopia at http://pdfx.cs.man.ac.uk/.

Usage

pdfx(file = NULL, what = "parsed", ...)

Arguments

file

(character) Path to a file, or files on your machine.

what

(character) One of parsed, text, or html.

...

Further args passed to GET. These aren't named, so just do e.g. , verbose(), or timeout(3)

Value

Raw XML text, parsed to XMLInternalDocument, or to html text

Author(s)

Scott Chamberlain myrmecocystus@gmail.com

Examples

## Not run: 
path <- system.file("examples", "example1.pdf", package = "extractr")
pdfx(file = path)

## End(Not run)

sckott/extractr documentation built on May 20, 2022, 1:12 p.m.