scrapeHTML: Function scrapeHTML

scrapeHTMLR Documentation

Function scrapeHTML

Description

Extracts Information from HTML.

Usage

scrapeHTML(html, short = F, edit = T, save = F, plot = F, filename = NULL)

Arguments

html

A character element containing HTML-code.

short

Logical value specifying whether only lines with verbal information or link should be returned. Defaults to F.

edit

Logical value specifying whether the data.frame should be plotted/edited.

save

Logical value specifying whether the HTML-code should be saved to a csv-file.

plot

Logical value specifying whether to plot the frequency of each HTML-tag found in the html-object.

filename

Character value specifying the filename (if save is TRUE). If NULL (default) as.numeric(Sys.time()) is applied.

Details

Extracts Information from HTML code (as returned by quantqual::getHTML, for example). Returns a data.frame with three columns: the first column contains html-code, the second column contains extracted verbal information, and the third column contains extracted links.

Examples

scrapeHTML(getHTML())

AndreasFischer1985/quantqual documentation built on June 20, 2022, 4:55 p.m.