extract_urls_from_webpage: Extracts the urls from a webpage.
In QualitativeDataRepository/archivr: Save URLs to Perma.cc or the Wayback Machine

Description Usage Arguments Value Examples

The function works simply by extracting the 'href“ attribute from all 'a' nodes. It is called internally from 'archiv.fromUrl' but can be useful as a separate function if you want to filter which links you archive.

1	extract_urls_from_webpage(url, except = NULL)

`url`	The url to extract urls.
`except`	A regular expression for URLs to exclude from extraction

a vector of urls.

urlList <- extract_urls_from_webpage(
     "https://www-cs-faculty.stanford.edu/~knuth/retd.html",
     except="validator\\.w3\\.org"
     )

QualitativeDataRepository/archivr documentation built on Feb. 9, 2022, 8:32 p.m.

QualitativeDataRepository/archivr index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

QualitativeDataRepository/archivr
Save URLs to Perma.cc or the Wayback Machine

extract_urls_from_webpage: Extracts the urls from a webpage.
In QualitativeDataRepository/archivr: Save URLs to Perma.cc or the Wayback Machine

Description

Usage

Arguments

Value

Examples

Related to extract_urls_from_webpage in QualitativeDataRepository/archivr...

R Package Documentation

Browse R Packages

We want your feedback!

QualitativeDataRepository/archivr Save URLs to Perma.cc or the Wayback Machine

extract_urls_from_webpage: Extracts the urls from a webpage. In QualitativeDataRepository/archivr: Save URLs to Perma.cc or the Wayback Machine

Description

Usage

Arguments

Value

Examples

Related to extract_urls_from_webpage in QualitativeDataRepository/archivr...

R Package Documentation

Browse R Packages

We want your feedback!

QualitativeDataRepository/archivr
Save URLs to Perma.cc or the Wayback Machine

extract_urls_from_webpage: Extracts the urls from a webpage.
In QualitativeDataRepository/archivr: Save URLs to Perma.cc or the Wayback Machine