ocr_pages: Perform optical character recognition on PNGs.
In jacob-ogre/pdftext: Extract Text from Text- and Image-based PDFs

Description Usage Arguments Value See Also Examples

Uses Tesseract (which must be installed and on the $PATH) to perform optical character recognition (OCR) on a pdf. Uses options()$pdftext.tess_conf to specify a custom config for tesseract, which can be set with set_tess_conf.

1	ocr_pages(pngs, fin_file, verbose = TRUE)

`pngs`	A listing of the temp PNG directory for a PDF
`fin_file`	Path to the 'final' text file to be written
`verbose`	Whether to print processing messages [TRUE]

The path to the OCR'd text file

pdf_to_txt

## Not run: 
res <- ocr_pages("test.pdf")

## End(Not run)

jacob-ogre/pdftext documentation built on May 18, 2019, 8:01 a.m.

jacob-ogre/pdftext index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

jacob-ogre/pdftext
Extract Text from Text- and Image-based PDFs

ocr_pages: Perform optical character recognition on PNGs.
In jacob-ogre/pdftext: Extract Text from Text- and Image-based PDFs

Description

Usage

Arguments

Value

See Also

Examples

Related to ocr_pages in jacob-ogre/pdftext...

R Package Documentation

Browse R Packages

We want your feedback!

jacob-ogre/pdftext Extract Text from Text- and Image-based PDFs

ocr_pages: Perform optical character recognition on PNGs. In jacob-ogre/pdftext: Extract Text from Text- and Image-based PDFs

Description

Usage

Arguments

Value

See Also

Examples

Related to ocr_pages in jacob-ogre/pdftext...

R Package Documentation

Browse R Packages

We want your feedback!

jacob-ogre/pdftext
Extract Text from Text- and Image-based PDFs

ocr_pages: Perform optical character recognition on PNGs.
In jacob-ogre/pdftext: Extract Text from Text- and Image-based PDFs