ocr_pdfR_fr: ocr a pdf file via R with multicore support

Description Usage Arguments

View source: R/read_pdf.R

Description

wrap of the ocr pdf with multicore support

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
ocr_pdfR_fr(
  pdf_file,
  hmax = 300,
  cropped_tm_dir,
  azure_creds,
  box_push_to_az = F,
  box_highlight = F,
  remove_fl = F,
  dpi = 400,
  main_cl = 1,
  sub_cl = 3
)

Arguments

pdf_file

pdf file location

hmax

maximum box height to include

cropped_tm_dir

temporary dir for cropped img

azure_creds

azure credential

box_push_to_az

whether to push the individual boxes to azure

box_highlight

whether to highlight boxes, this allows the check to use results from the main push if it seems to work well

remove_fl

whether to remove files written by the box level push

dpi

dpi of the converted images

main_cl

number of cores for main processes

sub_cl

number of cores for sub processes


Atan1988/alvision documentation built on Dec. 15, 2020, 7:03 a.m.