cora: Cora Data for Entity Resolution

Duplicated publication data (pre-processed and formatted) for entity resolution. This data set contains a total of 1879 records. The following variables are included in the data set: id, title, book title, authors, address, date, year, editor, journal, volume, pages, publisher, institution, type, tech, note. The data set has a respective gold data set that provides information on which records match based on id.

Getting started

Package details

AuthorRebecca Steorts [aut, cre], Andee Kaplan [aut], Srini Sunil [aut]
MaintainerRebecca Steorts <>
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:

Try the cora package in your browser

Any scripts or data that you put into this service are public.

cora documentation built on Oct. 23, 2020, 7:58 p.m.