View source: R/scraping_rosters_acb.R
scraping_rosters_acb | R Documentation |
This function allows us to obtain the basic information of each player, including his birth date. Then, we will be able to compute the age that each player had in the date that he played each game. The website used to collect information is https://www.acb.com.
scraping_rosters_acb(pcode, verbose = TRUE, accents = FALSE,
r_user = "guillermo.vinue@uv.es")
pcode |
Code corresponding to the player's website to scrape. |
verbose |
Should R report information on progress? Default TRUE. |
accents |
Should we keep the Spanish accents? The recommended option is to remove them, so default FALSE. |
r_user |
Email user to identify the user when doing web scraping. This is a polite way to do web scraping and to certify that the user is working as transparently as possible with a research purpose. |
Some players have a particular licence, which does not necessarily match with their nationality, in order not to be considered as a foreign player, according to the current ACB rules.
Data frame with eight columns:
CombinID: Unique ID to identify the players.
Player: Player's name.
Position: Player's position on the court.
Height: Player's height.
Date_birth: Player's birth date.
Nationality: Player's nationality.
Licence: Player's licence.
Website_player: Website.
In addition to use the email address to stay identifiable, the function also contains two headers regarding the R platform and version used.
Furthermore, even though in the robots.txt file at
https://www.acb.com/robots.txt, there is no information about scraping
limitations and all robots are allowed to have complete access,
the function also includes the command Sys.sleep(2)
to pause between requests for 2 seconds. In this way, we don't bother the server
with multiple requests and we do carry out a friendly scraping.
Guillermo Vinue
do_scraping_rosters
## Not run:
# Not needed to scrape every time the package is checked, built and installed.
df_bio <- scraping_rosters_acb("56C", verbose = TRUE, accents = FALSE,
r_user = "guillermo.vinue@uv.es")
## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.