View source: R/search_eurostat.R
search_eurostat | R Documentation |
Lists datasets from eurostat table of contents with the particular pattern in item titles.
search_eurostat(
pattern,
type = "dataset",
column = "title",
fixed = TRUE,
lang = "en"
)
pattern |
Text string that is used to search from dataset, folder or table titles, depending on the type argument. |
type |
Selection for types of datasets to be searched. Default is |
column |
Selection for the column of TOC where search is done. Default is |
fixed |
logical. If TRUE (default), pattern is a string to be matched as is.
See |
lang |
2-letter language code, default is " |
Downloads list of all datasets available on eurostat and return list of names of datasets that contains particular pattern in the dataset description. E.g. all datasets related to education of teaching.
If you wish to perform searches on other fields than item title,
you can download the Eurostat Table of Contents manually using
get_eurostat_toc()
function and use grep()
function normally. The data
browser on Eurostat website may also return useful results.
A tibble with nine columns:
Dataset title in English (default)
Each item (dataset, table and folder) of the TOC has a
unique code which allows it to be identified in the API. Used in the
get_eurostat()
and get_eurostat_raw()
functions to retrieve datasets.
dataset, folder or table
Date, indicates the last time the
dataset/table was updated (format DD.MM.YYYY
or %d.%m.%Y
)
Date, indicates the last time the
dataset/table structure was modified (format DD.MM.YYYY
or %d.%m.%Y
)
Date of the oldest value included in the dataset
(if available) (format usually YYYY
or %Y
but can also be YYYY-MM
,
YYYY-MM-DD
, YYYY-SN
, YYYY-QN
etc.)
Date of the most recent value included in the dataset
(if available) (format usually YYYY
or %Y
but can also be YYYY-MM
,
YYYY-MM-DD
, YYYY-SN
, YYYY-QN
etc.)
Number of actual values included in the dataset
Hierarchy of the data navigation tree, represented in the original txt file by a 4-spaces indentation prefix in the title
The Eurostat Table of Contents (TOC) is downloaded from https://ec.europa.eu/eurostat/api/dissemination/catalogue/toc/txt?lang=en (default) or from French or German language variants: https://ec.europa.eu/eurostat/api/dissemination/catalogue/toc/txt?lang=fr https://ec.europa.eu/eurostat/api/dissemination/catalogue/toc/txt?lang=de
See Eurostat documentation on TOC items: https://wikis.ec.europa.eu/display/EUROSTATHELP/API+-+Detailed+guidelines+-+Catalogue+API+-+TOC
Przemyslaw Biecek and Leo Lahti ropengov-forum@googlegroups.com
See citation("eurostat")
:
Kindly cite the eurostat R package as follows: Lahti L., Huovari J., Kainu M., and Biecek P. (2017). Retrieval and analysis of Eurostat open data with the eurostat package. The R Journal 9(1), pp. 385-392. doi: 10.32614/RJ-2017-019 Lahti, L., Huovari J., Kainu M., Biecek P., Hernangomez D., Antal D., and Kantanen P. (2023). eurostat: Tools for Eurostat Open Data [Computer software]. R package version 4.0.0. https://github.com/rOpenGov/eurostat To see these entries in BibTeX format, use 'print(<citation>, bibtex=TRUE)', 'toBibtex(.)', or set 'options(citation.bibtex.max=999)'.
When citing data downloaded from Eurostat, see section "Citing Eurostat data"
in get_eurostat()
documentation.
get_eurostat()
, search_eurostat()
tmp <- search_eurostat("education")
head(tmp)
# Use "fixed = TRUE" when pattern has characters that would need escaping.
# Here, parentheses would normally need to be escaped in regex
tmp <- search_eurostat("Live births (total) by NUTS 3 region", fixed = TRUE)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.