WikipediR
tw_search
and related functionstw_search
and related functionstw_get_wikipedia_section_links
when url provided as inputtw_search
tw_get_property_label
usethis
dependency (PR by @olivroy)tw_get_wikipedia_category_members
now works with categories that have more than 1000 member pages and consistently stores data in language-appropriate cache file also when language is derived from urltw_get
now caches Wikidata items that have been deleted, storing the error message as the value of property "error"tw_search
now returns results consistently also when description missing in given language (issue stemming from update in a dependency)tw_search
now checks cache efficiently also when cache settings are passed as parameterspurrr
and tidyselect
tw_get_wikipedia_page_links()
now works more efficiently with cache and deals more graciously when given links to non-existing Wikipedia pagestw_get_wikipedia_category_members()
now works as expected also when caching is disabledtw_get_wikipedia_page_links()
now works consistently when non-standard characters or non-latin alphabets found in urltw_get_wikipedia_page_qid()
now works consistently when url given is not in the language set with tw_set_language()
tw_get_wikipedia_category_members()
NA
values when tw_get_qualifiers()
would return no information on a given item/property combination; this ensures caching works as expected also in such cases.dbplyr
update/incompatibility with the pool
packagetw_get_p_wide()
offers a more efficient way to get wide data frames with a number of properties (with relative labels) for a set of Wikidata id; it also facilitates collapsing values for ease of export to csv.tw_get_p1()
as a shorthand for tw_get_p(only_first = TRUE, preferred = TRUE)
tw_get_property()
or tw_get_label()
, it looks for missing data if not all are present in given data frametw_index_cache_item()
and tw_index_cache_search()
. These should be significantly improved performance on the caching backend, but these are currently introduced as an opt-in as they have not been thoroughly tested with different database drivers. Running them once drastically improve performance with MySql when the cache goes into million of items. See also the vignette on caching.tw_get_p()
tw_get()
and other functions as input, missing items are queriedtw_get_wikipedia()
consistently returns vector of the same length as inputrank
column mistakenly dropped in some cases when using tw_get_property()
tw_check_qid()
has more parameters to deal with different use casestw_check_cache_index()
, tw_index_cache_item()
, and tw_index_cache_search()
). Tested with MySql._single
functions used internally are now not exported to facilitate auto-completetw_get_wikipedia_page_sections()
- and then extract the links from a specific section - tw_get_wikipedia_page_section_links()
tw_get_all_with_p()
tw_search()
now has separate parameters for the search language, and the language in which label and description and returned (previously, these were always in English)tw_get_qualifiers()
when qualifier value is of type quantitytw_get()
now keeps rank by default, facilitating retrieving more relevant resultstw_get_qualifiers()
now includes ranking such as "preferred", "normal", and "deprecated" associated with each property, as well as value type of the output (new format incompatible with previous cache, reset cache with tw_reset_qualifiers_cache()
)tw_get_qualifiers()
not returns correctly value when qualifier value type is a string (not a Wikidata identifier, not a date)tw_get_image()
not respects all parameters consistently. tw_get_image()
and tw_get_image_metadata()
are now briefly described in the README.tw_get_wikipedia_page_links()
now caches results, and provides more consistent results as a data frametw_get_p()
as an alias of tw_get_property_same_length()
for brevitytw_get_p()
to deal with common pattern when only "preferred" or most recent property should be returned, rather than whatever Wikidata has first in the list, and add new section in the readmetw_qid_meps
and tw_qid_airports
include_id_and_p
parameter tw_get_qualifiers()
tw_set_cache_db()
for easier use of alternatives to SQLitetw_get_image()
now returns consistently valid links if format is set to 'embed'; it is now possible to get a direct link to images with a given resolution with the width parametertw_get_image_metadata()
to obtain adequate credits to be included when images are usedtw_get_property()
now consistently returns data frame with properties in the same order as given and does not fail if given invalid valuestw_get_label()
, now actually always returns vector of the same length as inputNA
are now ignored by tw_get()
tw_get_label()
actually returns vector of the same length as inputtw_get_property_same_length()
for easier integration with piped operationstw_get_property_with_details()
to extract additional details such as language or unit of a property that are otherwise discarded with tw_get_property()
. tw_get_property_with_details()
, however, does not (yet) cache results.Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.