extract_text_from_pdfs: Extract text from PDF files

Description Usage Arguments

View source: R/extract_text_from_pdfs.R

Description

Extract text from PDF files and save as VCorpus and/or in individual text files.

Usage

1
2
extract_text_from_pdfs(pdf_dir, output_dir, save_Rdata = TRUE,
  save_txt_files = FALSE, language = "en")

Arguments

pdf_dir

Directory containing PDF files to extract text from.

output_dir

Directory to save output to.

save_Rdata

Logical indicating whether to save VCorpus object named "all_reports" as an .Rdata file.

save_txt_files

Logical indicating whether to save documents as individual text files.

language

Language in which documents are written.


dtburk/gensci.stm documentation built on Nov. 13, 2019, 12:33 a.m.