Description Usage Arguments Value See Also Examples
Some PDFs have an embedded text layer that is derived from OCR by the scanner or other equipment that produced the PDF. Such documents are more likely to have fundamental errors, e.g., mis-OCR'd columnar text, that can be solved by using OCR rather than extracting the text layer.
1 |
file |
Path to a PDF to check for embedding source |
Logical: TRUE if good embed, FALSE if from OCR
1 2 3 4 | ## Not run:
# res <- summarize_gold("test.pdf", text)
## End(Not run)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.