Home

/

CRAN

/

tabulapdf: Extract Tables from PDF Documents

Bindings for the 'Tabula' <https://tabula.technology/> 'Java' library, which can extract tables from PDF files. This tool can reduce time and effort in data extraction processes in fields like investigative journalism. It allows for automatic and manual table extraction, the latter facilitated through a 'Shiny' interface, enabling manual areas selection\ with a computer mouse for data retrieval.

Package overview README.md Introduction to tabulapdf

Vignettes Man pages API and functions Files

Package details
Author	Thomas J. Leeper [aut] (<https://orcid.org/0000-0003-4097-6326>), Mauricio Vargas Sepulveda [aut, cre] (<https://orcid.org/0000-0003-1017-7574>), Tom Paskhalis [aut] (<https://orcid.org/0000-0001-9298-8850>), Manuel Aristaran [ctb], David Gohel [ctb] (rOpenSci reviewer), Lincoln Mullen [ctb] (rOpenSci reviewer), Munk School of Global Affairs and Public Policy [fnd]
Maintainer	Mauricio Vargas Sepulveda <m.sepulveda@mail.utoronto.ca>
License	Apache License (>= 2)
Version	1.0.5-5
URL	https://docs.ropensci.org/tabulapdf/ (website) https://github.com/ropensci/tabulapdf/
Package repository	View on CRAN
Installation	Install the latest version of this package by entering the following in R: `install.packages("tabulapdf")`