Man pages for daiR
Interface with Google Cloud Document AI API

build_block_dfBuild block dataframe
build_token_dfBuild token dataframe
create_processorCreate processor
dai_asyncOCR documents asynchronously
dai_authCheck authentication
dai_notifyNotify on job completion
dai_statusCheck job status
dai_syncOCR document synchronously
dai_tokenProduce access token
dai_userGet user information
defunctDefunct functions
delete_processorDelete processor
deprecatedDeprecated functions
disable_processorDisable processor
dot-onAttachRun when daiR is attached
draw_blocksDraw block bounding boxes
draw_entitiesDraw entity bounding boxes
draw_linesDraw line bounding boxes
draw_paragraphsDraw paragraph bounding boxes
draw_tokensDraw token bounding boxes
enable_processorEnable processor
from_labelmeExtract block coordinates from labelme files
get_entitiesGet entities
get_processor_infoGet information about processor
get_processorsList created processors
get_processor_versionsList available versions of processor
get_project_idGet project id
get_tablesGet tables
get_textGet text
image_to_pdfConvert images to PDF
img_to_binbaseImage to base64 tiff
is_colourCheck that a string is a valid colour representation
is_jsonCheck that a file is JSON
is_pdfCheck that a file is PDF
list_processor_typesList available processor types
make_hocrMake hOCR file
merge_shardsMerge shards
pdf_to_binbasePDF to base64 tiff
reassign_tokensAssign tokens to new blocks
reassign_tokens2Assign tokens to a single new block
redraw_blocksInspect revised block bounding boxes
split_blockSplit a block bounding box
tables_from_dai_fileGet tables from output file
tables_from_dai_responseGet tables from response object
text_from_dai_fileGet text from output file
text_from_dai_responseGet text from HTTP response object
daiR documentation built on April 12, 2025, 1:39 a.m.