Description Usage Arguments Details Value Author(s) See Also Examples
Extract text from a file
1 | extract_text(file, pages = NULL, password = NULL, encoding = NULL)
|
file |
A character string specifying the path or URL to a PDF file. |
pages |
An optional integer vector specifying pages to extract from. |
password |
Optionally, a character string containing a user password to access a secured PDF. |
encoding |
Optionally, a character string specifying an encoding for the text, to be passed to the assignment method of |
This function converts the contents of a PDF file into a single unstructured character string.
If pages = NULL
(the default), a length 1 character vector, otherwise a vector of length length(pages)
.
Thomas J. Leeper <thosjleeper@gmail.com>
extract_tables
, extract_areas
, split_pdf
1 2 3 4 5 6 7 8 9 10 11 | ## Not run:
# simple demo file
f <- system.file("examples", "data.pdf", package = "tabulizer")
# extract all text from page 1 only
extract_text(f, from = 1, to = 1)
# extract all text
extract_text(f)
## End(Not run)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.