tidy_scrap | R Documentation |
This function is used to scrape a tibble from a website.
tidy_scrap(link, nodes, colnames, clean = FALSE, askRobot = FALSE)
link |
the link of the web page to scrape |
nodes |
the vector of HTML or CSS elements to consider, the SelectorGadget tool is highly recommended. |
colnames |
the names of the expected columns. |
clean |
logical. Should the function clean the extracted tibble or not ? Default is FALSE. |
askRobot |
logical. Should the function ask the robots.txt if we're allowed or not to scrape the web page ? Default is FALSE. |
a tidy data frame.
# Extracting imdb movie titles and rating link <- "https://www.imdb.com/chart/top/" my_nodes <- c(".titleColumn a", "strong") names <- c("title", "rating") tidy_scrap(link, my_nodes, names)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.