Description Usage Arguments Value Examples
Uses pdftools::pdf_text
to get the text layer from PDF 'file', which
is used as the 'gold standard' against which OCR'd versions are compared.
Checks that the text layer is distilled from the original document rather
than a text layer from OCR, e.g., a scanner that OCRs.
1 |
file |
Path to the PDF to be processed |
write |
Whether to write the text to file [FALSE] |
save |
Whether to save the text as a .rda [TRUE] |
List of pages with text layer if layer not from OCR; else NULL
1 | # res <- get_gold("test.pdf", "GOLDs")
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.