Description Usage Arguments Value Examples
The function works simply by extracting the 'href“ attribute from all 'a' nodes. It is called internally from 'archiv.fromUrl' but can be useful as a separate function if you want to filter which links you archive.
1 | extract_urls_from_webpage(url, except = NULL)
|
url |
The url to extract urls. |
except |
A regular expression for URLs to exclude from extraction |
a vector of urls.
1 2 3 4 | urlList <- extract_urls_from_webpage(
"https://www-cs-faculty.stanford.edu/~knuth/retd.html",
except="validator\\.w3\\.org"
)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.