ocr_pdf: Perform optical character recognition on a PDF

Description Usage Arguments See Also Examples

Description

Uses Imagemagick and Tesseract (which must be installed and on the $PATH) to perform optical character recognition (OCR) on a PDF.

Usage

1
ocr_pdf(file, verbose = TRUE)

Arguments

file

Path to the PDF from which text will be extracted

verbose

Print messages if TRUE; silent if FALSE

See Also

pdf_to_txt

Examples

1
2
3
4
## Not run: 
res <- ocr_pdf("test.pdf")

## End(Not run)

jacob-ogre/pdftext documentation built on May 18, 2019, 8:01 a.m.