scrapeaway: Primary function to download and analyze a PDF file.

Description Usage Arguments Details Examples

View source: R/ScraperScript.R

Description

scrapeaway processes a batch of pdf files to test for the presence of mentions of species names.

Usage

1
scrapeaway(chunksize)

Arguments

chunksize

The number of files to download in a single call of the function

Details

This function is used to download files from the NPS Data Store Application, extract text from pdf files, and then compare those files to the a taxonomic (species-level) dictionary distributed with this package. This function is called by a sister function (runlots) where the user can set the number of files to process in a single run. The recommended number of files is between 25 and 100, depending on available bandwidth and processing power.

Examples

1

joe-devivo/pdfSpeciesScraper documentation built on May 17, 2019, 2:15 a.m.