jst_get_references: Extract all references

View source: R/references.R

jst_get_referencesR Documentation

Extract all references


This function extracts the content of ref-list from the xml-file.


jst_get_references(file_path, parse_refs = FALSE)



The path to the .xml-file from which references should be extracted.


Should references be parsed, if available?


This content may contain references or endnotes, depending on how the article used citations. Since references are currently not fully supported by DfR, there is no comprehensive documentation on the different variants. jstor therefore extracts the content of ref-list exactly as it appears in the data. Because of this, there might be other content present than references.

In order to get all available information on citation data, you might need to combine jst_get_references() with jst_get_footnotes().

For newer xml-files, there would be the option to extract single elements like authors, title or date of the source, but this is not yet implemented.

In general, the implementation is not as fast as jst_get_article() - articles with many references slow the process down.


A tibble with the following columns:

  • file_name: the identifier for the article the references come from.

  • ref_title: the title of the references sections.

  • ref_authors: a string of authors. Several authors are separated with ⁠;⁠.

  • ref_editors: a string of editors, if available.

  • ref_collab: a field that may contain information on the authors, if authors are not available.

  • ref_item_title: the title of the cited entry. For books this is often empty, with the title being in ref_source.

  • ref_year: a year, often the article's publication year, but not always.

  • ref_source: the source of the cited entry. For books often the title of the book, for articles the publisher of the journal.

  • ref_volume: the volume of the journal article.

  • ref_first_page: the first page of the article/chapter.

  • ref_last_page: the last page of the article/chapter.

  • ref_publisher: For books the publisher, for articles often missing.

  • ref_publication_type: Known types: book, journal, web, other.

  • ref_unparsed: The full references entry in unparsed form.



# import parsed references
  parse_refs = TRUE

jstor documentation built on Aug. 16, 2023, 5:09 p.m.