Wayback_Pull: Wayback machine scrape example for table information

Description Usage Arguments Value

Description

Wayback_Pull takes a list of urls from the archive pages for your url of interest (See default url.vector argument for an example) and scrapes from your wanted urls a minimum css selector based table. Somewhat arbritray based on table structure of this page but could easily be extended to loop over css selection for table columns etc.

Usage

1
2
3
  Wayback_Pull(url.vector = c("https://web.archive.org/web/20150401000000*/https://www.fxcm.com/forex/forex-pricing/",
  "https://web.archive.org/web/20160901000000*/https://www.fxcm.com/forex/forex-pricing/"),
  css = ".center td", col.names = c("Pair", "Spread"))

Arguments

url.vector

Vector of urls from wayback machine archives

css

Minimal css selector for a table of information

col.names

vector of length 2 for table column names

Value

list of dataframes of scraped information


OJWatson/waities documentation built on May 7, 2019, 8:34 p.m.