The Guardian API Wrapper
The package provides an interface to the Open Platform's Content API of the Guardian Media Group. It retrieves content from news outlets The Observer, The Guardian, and guardian.co.uk from 1999 to current day. As of version 0.5, GuardianR connects to the Guardian Content API v2, which is much faster, but requires an API key and implements the limit of 12 Calls per second or 5,000 calls per day.
|License:||GPL (>= 2)|
The results of keyword search include news articles that match the search terms and were published after 1999. Function get_guardian accepts date intervals with multiple keywords (format: "keyword1+keyword2") and returns a data frame. Rows refer to news articles and columns to the news item's data. The resulting data frame includes the following 27 columns:
id News article ID
sectionId API news section ID to the news article
sectionName Guardian news section ID to the news article
webPublicationDate News article online publication date
webTitle Online headline
webUrl News article URL
apiUrl News article API URL
newspaperPageNumber Page where the news article was featured in the printed newspaper
trailText Introductions to the news article
headline Print headline
showInRelatedContent Related content (logical)
lastModified Date the news article was last modified (POSIXct)
score Score provided by the API relative to the query and importance of the result
standfirst Short summary to the news article
shortUrl Short URL to the news article
wordcount News article's word count
commentable Comments included (logical)
allowUgc Allows User Generated Content (logical)
isPremoderated Pre-moderated news articles (logical)
byline News article author
publication Source of the published news article (Oberserver, The Guardian, or guardian.co.uk)
newspaperEditionDate Publication date of print news article (POSIXct)
shouldHideAdverts Hide ads (logical)
liveBloggingNow live blogging (logical)
commentCloseDate Date comments closed (POSIXct)
body News article full text
At the time of this writing, Microsoft Windows does not have integration between the OS default CA certificates and OpenSS. Because of that, the function get_guardian will verify the operation system and when necessary download the SLL certificate "cacert.perm" to access https links.
To see how to citate the package, check citation("GuardianR").
Marco Bastos & Cornelius Puschmann
Maintainer: Marco Bastos <email@example.com>
More information on Content API is available on Guardian Item Reference Guide (http://www.guardian.co.uk/open-platform/content-api-item-reference-guide) and the Getting started guide (http://www.guardian.co.uk/open-platform/getting-started).
Want to suggest features or report bugs for rdrr.io? Use the GitHub issue tracker.