degrade_blur: Degrade PDF quality by simulating blurred text

Description Usage Arguments Value See Also Examples

View source: R/degrade.R

Description

Low-resolution image-based PDFs are particularly susceptible to OCR errors. By simulating blurred text from text-embedded PDFs we may find patterns of OCR errors that improve post-OCR correction.

Usage

1

Arguments

file

Path to a text-embedded PDF to degrade

blur

Mean and s.d. blur in format 'm'x's', e.g., 2x2

Value

A data.frame with the in and out file names and exit status

See Also

degrade_density degrade_fax degrade_complex degrade_rotate

Examples

1
# degrade_blur("good.pdf", 2x2)

jacob-ogre/ocrerrors documentation built on May 18, 2019, 8:01 a.m.