pdf_02_filter_lines: Filter lines in a PDF file

View source: R/pdf.R

pdf_02_filter_linesR Documentation

Filter lines in a PDF file

Description

Filter lines from text extracted from a PDF for given words.

Usage

pdf_02_filter_lines(extracted_text, pdf_file, page = NULL, scale)

Arguments

extracted_text

A list of strings, containing text extracted from the PDF file.

pdf_file

Path to the PDF file.

page

Optional integer: Page number in the PDF used for extraction, defaults to NULL.

scale

The word which is searched in lines and then filtered.

Value

A vector of strings, for all lines that contains the word in scale.


jtrampush/npsych.data documentation built on June 12, 2025, 2:20 p.m.