Description Getting Started Orchestrate AWS ScrapeBot References See Also
ScrapeBotR allows to easily retrieve (large amounts of) data from a ScrapeBot installation. The package provides easy-to-use functions to read and export instances, recipes, runs, log information, and data. The package also allows to orchestrate instances on your AWS account (caution: may cause real costs). Thereby, the package plugs neatly into the tidyverse as it makes heavy use of tibbles.
Create a credentials file to connect to your database. You may do so using write_scrapebot_credentials()
.
Create a connection object by using connect_scrapebot()
.
List the available recipes through get_recipes()
or the available instances through get_instances()
.
Get information about specific runs through get_runs()
.
Collect data via get_run_data()
.
Bulk-download (and resize) screenshots from S3 through collect_screenshots_from_s3()
.
disconnect()
from your database.
You can use this package to orchestrate your Amazon Web Services (AWS) account. Once set up with IAM user credentials, this package allows you to ...
set up an RDS database and install a new ScrapeBot central database therein
set up S3 storage to upload your screenshots to
set up multiple EC2 instances as, well, ScrapeBot instances, immediately ready to run
vary regions, browser sizes, language, and the like
terminate all of these servers for good
The ScrapeBot (without "R") is a non-R tool for so-called "agent-based testing" to automatically visit, modify, and scrape a defined set of webpages regularly. It was built to automate various web-based tasks and keep track of them in a controllable way for academic research, primarily in the realm of computational social science.
Haim, M. (2020). Agent-based testing: An automated approach toward artificial reactions to human behavior. Journalism Studies, 21(7), 895-911. https://dx.doi.org/10.1080/1461670x.2019.1702892
https://github.com/MarHai/ScrapeBot
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.