parse_html: Convert html to plain text
In ESHackathon/doi2txt: Download Full Text of Scientific Articles for Data Extraction

Removes html from a downloaded article by calling the htm2txt package.

1	parse_html(html = NULL, url = NULL)

html

A large string of length 1 containing the html for a journal article

A character vector containing the plain text version of the input html document with paragraphs in separate lines.

ESHackathon/doi2txt documentation built on Dec. 17, 2021, 5:39 p.m.

ESHackathon/doi2txt index

README.md

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Description