title: "Impact from Traditional Metrics" date: "2020-10-04" always_allow_html: yes output: md_document: variant: gfm vignette: > %\VignetteIndexEntry{Vignette Title} %\VignetteEngine{knitr::rmarkdown} %\VignetteEncoding{UTF-8}
Citations are a common metric for determining the impact of research on the wider academic field. Both extract_pmid()
and extract_doi()
will provide data on citations (based on PubMed/Crossref repositories) and journal impact factor (based on Scimago Scientific Journal Rankings).
impact_cite()
The function impact_cite()
can also produce several metrics that can be used for the purposes of assessment of the more traditional research impact of publications. This includes:
Journal-Level: Impact Factor (IF) and Eigenfactor.
Paper-Level: Citations from online repositories (in addition to PubMed):
Google Scholar (cite_gs
): Accessed via the gcite package. (requires the Google Scholar ID for an individual to be supplied to gscholar
- does not support multiple IDs).
CrossRef (cite_cr
): Accessed via the rcrossref package. (requires crossref=TRUE).
Dimentions (cite_dim
): Accessed via the Dimentions API (requires dimentions=TRUE).
Scopus (cite_scopus
): Accessed via the rscopus package (requires scopus=TRUE and valid scopus API set via rscopus::set_api_key()
).
Open Citations (cite_oc
): Accessed via the Open Citations API (requires oc=TRUE). Note: this is felt to be too sparcely populated at present (Feb 2020) to be of practical use.
data_cite <- impactr::impact_cite(data = data, var_id = "pmid",
crossref=TRUE, dimentions=TRUE, scopus=FALSE, oc = TRUE,
gscholar = "Ol5uNSwAAAAJ&hl", metric=TRUE)
Citations (e.g. cite_max
) can be used to either provide summary statistics or visualisations.
Note: Google Scholar is typically being more sensitive but less specific in estimating citation count than Crossref / PubMed repositories.
Only Google Scholar and Open Citations allow exporation of the citations over time. This will be included in the output of impact_cite()
if either of those sources are selected.
This can be used to plot the longitudinal impact of the paper through citations over time.
As journal-level information is also extracted using impact_cite()
(e.g. impact factor / Eigenfactor) this can also allow some assessment of impact of the articles in relation to the journal benchmark.
For example, the plot below demonstrates the ratio of Paper Citations : Journal Impact Factor - any point above the horizontal line (ratio of 1:1) indicates the paper has gathered more citations than typical for that journal.
Citation metrics are author-level measures that attempts to assess both the productivity and citation impact of the publications. These include:
Total citations: Total number of citations that they have received in other publications.
H-Index: Based on the set of the scientist's most cited papers and the number of citations that they have received in other publications.
M-Quotient: A method to facilitate comparisons between academics with different lengths of academic careers. This divides the h-index by the number of years the academic has been active (measured as the number of years since the first published paper)
G-Index: It aims to improve on the h-index by giving more weight to highly-cited articles. The H-Index can "undervalue" highly cited papers as it ignores the number of citations to each individual article beyond what is needed to achieve a certain h-index.
impact_cite()
will produce common citation metrics automatically based on the papers included in the dataframe supplied. Alternatively this can be calculated directly using cite_metric()
.
data_cite$metric; impactr::cite_metric(data_cite$df$cite_max, data_cite$df$year)
## # A tibble: 1 x 4
## total_cite hindex gindex mquotient
## <dbl> <int> <int> <dbl>
## 1 591 13 24 6.5
## # A tibble: 1 x 4
## total_cite hindex gindex mquotient
## <dbl> <int> <int> <dbl>
## 1 591 13 24 6.5
$validation
) The Google Scholar ID is used to derive publications, and so all publications by an author (or authorship group) must be uploaded under one Google Scholar account.
As google scholar does not contain DOI or PMID, papers must be matched by title, and can only be matched if there is a google scholar record for each paper in the supplied dataframe. impact_cite()
provides several features to proactively identify issues via $validation
output.
The outcome will record either matched
, or the following:
noscholar
: If there is no corresponding google scholar record for the publication, then these will be listed here.
unmatch
: If the google scholar records were unable to be matched to the existing dataset (by title), then these will be listed here.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.