download_pdf: Download a PDF from a URL
In Defenders-ESC/pdfdown: Robust Downloads of Portable Document Format (PDF) Files

Description Usage Arguments Details Value Examples

View source: R/download_pdf.R

Simple function to download a PDF, robustly.

1	download_pdf(url, file, quiet = FALSE, overwrite = FALSE, pause = TRUE)

`url`	The URL for a PDF
`file`	File to which the PDF will be downloaded
`quiet`	Suppress a message about which URL is being processed [default=FALSE]
`overwrite`	Overwrite an existing file of the same name [default=FALSE]
`pause`	Whether to pause for 0.5-3 seconds during scraping [default=TRUE]

Scraping PDFs from the web can run into little hitches that make writing a scraper annoying. This simplifies PDF scraping by creating a dedicated function and support functions to, e.g., test for PDFness. Ensures URL encoding, handles missing URLs gracefully. The filename is the basename of the URL with " " replaced with "_". Includes the pause parameter to limit the rate at which requests hit the hosting servers.

TODO: Have the overwrite check work on the MD5 hash of files in the download sudb rather than relying on file names.

A data.frame with url, destination, success, pdfCheck

## Not run: 
  result <- download_pdf(url = "https://goo.gl/I3P3A3",
                         file = "~/Downloads/test.pdf")

## End(Not run)

Defenders-ESC/pdfdown documentation built on May 6, 2019, 1:58 p.m.

Defenders-ESC/pdfdown index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Defenders-ESC/pdfdown
Robust Downloads of Portable Document Format (PDF) Files

download_pdf: Download a PDF from a URL
In Defenders-ESC/pdfdown: Robust Downloads of Portable Document Format (PDF) Files

Description

Usage

Arguments

Details

Value

Examples

Related to download_pdf in Defenders-ESC/pdfdown...

R Package Documentation

Browse R Packages

We want your feedback!

Defenders-ESC/pdfdown Robust Downloads of Portable Document Format (PDF) Files

download_pdf: Download a PDF from a URL In Defenders-ESC/pdfdown: Robust Downloads of Portable Document Format (PDF) Files

Description

Usage

Arguments

Details

Value

Examples

Related to download_pdf in Defenders-ESC/pdfdown...

R Package Documentation

Browse R Packages

We want your feedback!

Defenders-ESC/pdfdown
Robust Downloads of Portable Document Format (PDF) Files

download_pdf: Download a PDF from a URL
In Defenders-ESC/pdfdown: Robust Downloads of Portable Document Format (PDF) Files