Description Usage Arguments Value Examples
Extract PDF
document
1 2 3 4 5 6 7 8 9 | layout_control(
line_overlap = 0.5,
char_margin = 2,
line_margin = 0.5,
word_margin = 0.1,
boxes_flow = 0.5,
detect_vertical = FALSE,
all_texts = FALSE
)
|
line_overlap |
a double, if two characters have more overlap than this they are considered to be on the same line. The overlap is specified relative to the minimum height of both characters. |
char_margin |
a double, if two characters are closer together than this margin they are considered part of the same line. The margin is specified relative to the width of the character. |
line_margin |
a double, if two characters on the same line are further apart than this margin then they are considered to be two separate words, and an intermediate space will be added for readability. The margin is specified relative to the width of the character. |
word_margin |
a double, if two lines are are close together they are considered to be part of the same paragraph. The margin is specified relative to the height of a line. |
boxes_flow |
a double, Specifies how much a horizontal and vertical
position of a text matters when determining the order of text boxes.
The value should be within the range of |
detect_vertical |
a logical, If vertical text should be considered during layout analysis |
all_texts |
a logical, If layout analysis should be performed on text in figures. |
Returns a list with the layout control variables.
1 |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.