htm2txt: Convert Html into Text

Convert a html document to simple plain texts by removing all html tags. This package utilizes regular expressions to strip off html tags. It also offers gettxt() and browse() function, which enables you to get or browse texts at a certain web page.

Getting started

Package details

AuthorSangchul Park [aut, cre]
MaintainerSangchul Park <[email protected]>
LicenseGPL (>= 2)
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:

Try the htm2txt package in your browser

Any scripts or data that you put into this service are public.

htm2txt documentation built on Nov. 17, 2017, 7:13 a.m.