get_page_dims: Page length and dimensions

Description Usage Arguments Details Value Author(s) References See Also Examples

View source: R/get_page_dims.R

Description

Get Page Length and Dimensions

Usage

1
2
3
get_page_dims(file, doc, pages = NULL, password = NULL, copy = FALSE)

get_n_pages(file, doc, password = NULL, copy = FALSE)

Arguments

file

A character string specifying the path or URL to a PDF file.

doc

Optionally,, in lieu of file, an rJava reference to a PDDocument Java object.

pages

An optional integer vector specifying pages to extract from.

password

Optionally, a character string containing a user password to access a secured PDF.

copy

Specifies whether the original local file(s) should be copied to tempdir() before processing. FALSE by default. The argument is ignored if file is URL.

Details

get_n_pages returns the page length of a PDF document. get_page_dims extracts the dimensions of specified pages in a PDF document. This can be useful for figuring out how to specify the area argument in extract_tables

Value

For get_n_pages, an integer. For get_page_dims, a list of two-element numeric vectors specifying the width and height of each page, respectively.

Author(s)

Thomas J. Leeper <[email protected]>

References

Tabula

See Also

extract_tables, extract_text, make_thumbnails

Examples

1
2
3
4
5
# simple demo file
f <- system.file("examples", "data.pdf", package = "tabulizer")

get_n_pages(file = f)
get_page_dims(f)

ropensci/tabulizer documentation built on June 10, 2018, 3:11 p.m.