ocr_pdf: Perform optical character recognition on a PDF
In jacob-ogre/pdftext: Extract Text from Text- and Image-based PDFs

Uses Imagemagick and Tesseract (which must be installed and on the $PATH) to perform optical character recognition (OCR) on a PDF.

1	ocr_pdf(file, verbose = TRUE)

`file`	Path to the PDF from which text will be extracted
`verbose`	Print messages if TRUE; silent if FALSE

## Not run: 
res <- ocr_pdf("test.pdf")

## End(Not run)

jacob-ogre/pdftext documentation built on May 18, 2019, 8:01 a.m.

jacob-ogre/pdftext index

README.md

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Description