gpluck_extract_tables: Extract Tables from a PDF

View source: R/pdf.R

gpluck_extract_tablesR Documentation

Extract Tables from a PDF

Description

This function returns the tables in a PDF file as parsed by the tabulapdf package.

Usage

gpluck_extract_tables(
  file,
  pages = NULL,
  area = NULL,
  guess = FALSE,
  method = c("decide", "lattice", "stream"),
  output = c("matrix", "data.frame", "character", "asis", "csv", "tsv", "json"),
  ...
)

Arguments

file

The path to the PDF file.

pages

A single page number or vector of page numbers, Default: NULL

area

An area on the page given as c(x0, y0, x1, y1). Default: NULL

guess

Whether to attempt to detect tables when the coordinates are not given. Default: FALSE

method

Method to use for parsing. Default: c("decide", "lattice", "stream")

output

Output format (see extract_tables for more details). Default: c("matrix", "data.frame", "character", "asis", "csv", "tsv", "json")

...

Other arguments to extract_tables

Details

This is a wrapper around the extract_tables function that allows for easier access to the tables in a PDF document.

Value

A parsed table in either matrix, data frame, character, asis, csv, tsv or json format.

See Also

extract_tables


jtrampush/npsych.data documentation built on Feb. 25, 2025, 12:30 a.m.