xml2eco: XML to EcoJSON
In dsidavis/SpilloverDA: Munging and Exploring the Viral Spillover Data Set

Description Usage Arguments Value Author(s) Examples

Takes an XML file from a PDF created using pdftohtml, and runs the augmented keyword extractor on it. The intermediate text, broken into sections, and the results can be cached to disk.

1 2	xml2eco(XML, ecoextractLoc = getEcoExtractPyScript(), results_dir = character(), cache.dir = character())

`XML`	the name of the XML document generated by converting the PDF document to XML via pdftohtml or indirectly via the `convertPDF2XML` function in ReadPDF.
`ecoextractLoc`	the file path to the ecoextract.py script. Defaults to the included script in python/ecoextract.py.
`results_dir`	optional directory to save the results into. Will be created if it does not exist.
`cache.dir`	optional directory to save the intermediate text, broken into sections.