read_page: A simple HTML page reader with debouncer This is a stupidly...

View source: R/main_scraping_helpers.R

read_pageR Documentation

A simple HTML page reader with debouncer This is a stupidly simple function to get around rate-limit or anti-crawling measures by dynamically waiting between each request for a set time. It's heuristic, and may still return a try-error that must be handled downstream. Ideally I'd use OOP and handle this at the app level instead of the request level.

Description

A simple HTML page reader with debouncer This is a stupidly simple function to get around rate-limit or anti-crawling measures by dynamically waiting between each request for a set time.

It's heuristic, and may still return a try-error that must be handled downstream. Ideally I'd use OOP and handle this at the app level instead of the request level.

Usage

read_page(page_url, max_wait = 30)

Arguments

page_url

(character) A URL we want to read

max_wait

(numeric) Maximum debounce

Note

max_wait


beansrowning/chinacovidR documentation built on April 11, 2022, 2:30 p.m.