dot-get_site: Get Site Content and Extract HTML Elements

.get_siteR Documentation

Get Site Content and Extract HTML Elements

Description

This function attempts to retrieve the HTML content of a URL, extract specific HTML elements (e.g., paragraphs, headings), and extract publication date information using the extract_date function.

Usage

.get_site(x)

Arguments

x

A URL to extract content and publication date from.

Value

A data frame with columns for the URL, HTML element types, text content, extracted date, and date source.


textpress documentation built on Oct. 14, 2024, 5:08 p.m.