GetSmudges | R Documentation |
This function attempts to finds rows in the bounding box matrix/data.frame that might be smudges/specs from the scanning process. The approach this takes is to consider if they are sufficiently small in both height and width to be less than a character. This is ad hoc to say the least. One can implement additional or alternative approaches and this is just offered as a utility.
GetSmudges(bbox, threshold = 5, charWidth = GetCharWidth(bbox),
charHeight = GetCharHeight(bbox), anywhere = FALSE)
bbox |
the bounding box matrix/data from for the elements under consideration. |
threshold |
currently ignored |
charWidth |
a number for the typical character width on the page |
charHeight |
a number giving the typical character height on the page |
anywhere |
if |
An integer vector giving the indices of any rows in the bounding box matrix/data.frame that are considered specs/smudges by this approach.
Duncan Temple Lang
GetBoxes
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.