getTextByCols: Get the text arranged by each column.

getTextByColsR Documentation

Get the text arranged by each column.

Usage

getTextByCols(p, threshold = 0.1, asNodes = FALSE, txtNodes = getNodeSet(p, getXPathDocFontQuery(p, docFont)), bbox = getBBox2(txtNodes, TRUE), breaks = getColPositions(if (perPage) p else as(p, "XMLInternalDocument"), threshold = threshold, bbox = bbox, perPage = perPage, docFont = docFont, ...), perPage = FALSE, docFont = FALSE, order = FALSE, ...)

Arguments

p
threshold
asNodes
txtNodes
bbox
breaks
perPage
docFont
order
...

dsidavis/ReadPDF documentation built on June 12, 2025, 6:39 a.m.