pdf2html: Convert a pdf file to html

View source: R/pdf2html.R

pdf2htmlR Documentation

Convert a pdf file to html

Description

A blind user often has difficulty reading the content provided in pdf files. This tool is quite experimental at this stage.

Usage

pdf2html(pdffile, htmlfile=sub(".pdf", ".html", pdffile), HeadingLevels=4, PageTag="h6")

Arguments

pdffile

A pdf file to be converted.

htmlfile

The filename for the resulting html; default is to change the file extension.

HeadingLevels

The depth of heading level to include; tags h1, h2, h3... are used.

PageTag

Which tag to use for replacing page numbers; default is h6, but any tag could be used.

Details

A Python 2.7 module is the basis for the conversion. Some post-processing can be done to further enhance the readability of the resulting html file.

Value

Logical: Has the conversion completed. Note that this does not mean the result is totally useful as this will depend on the quality of the input file.

Author(s)

A. Jonathan R. Godfrey


BrailleR documentation built on July 26, 2023, 5:46 p.m.