kindCodes: A kind codes database to show the type of document for each...

Description Usage Format Details

Description

Patent documents have associated kind codes, which are letter/number code combinations that signify the type of document, such as application, granted patent, utility patent, etc. These kind codes vary by country and are a useful approach to classifying patent document types. Most, however, not all, downloaded data from free services such as sumobrain.com or lens.org includes the kind code at the end of the patent document number.

Usage

1

Format

A data frame.

countryCode

The country code for the originating office where the application was filed.

kindCode

The letter/number code to signify the type of document. Codes may change after a certain date, so pay attention to dateStarted and dateDeprecated

isDeprecated

Logical TRUE/FALSE if the kind code for the country is no longer in use.

dateDeprecated

The date the kind code stopped being in use.

isNew

If the kind code is a replacement for a former kind code, TRUE, else FALSE.

dateStarted

If isNew == TRUE, the date the new kind code began being used.

comment

Additional information explaining the details of the kind code.

docTypeLong

The long version of the document type.

docType

A shorter, standardized version of docTypeLong.

expectDuplicate

A logical TRUE/FALSE to help the analyst understand if the published document is expected to have a duplicate publication. For example, USB2 is a granted patent that has an application that was also published, whereas USB1 has no previous documents published. This helps speed up the deduplication process.

countryAndKindCode

A concatenation of country code and kind code. Used in the deduplication process and to determine the type of document.

Details

For example, from the sumobrain.com download from the acars data set, here are three documents:

  1. US6523912

  2. US20030060197

  3. EP1310400A1

The first two items are missing kind codes. The third item has kind code A1 and the country code is EP.

To clean the data yourself:

temp <- readxl::read_excel(system.file("extdata", "kindCodes.xlsx", package = "patentr"))

temp <- replace(temp, is.na(temp), "NA")

temp$dateDeprecated <- as.numeric(temp$dateDeprecated)

temp$dateDeprecated <- as.Date(temp$dateDeprecated, origin = "1899-12-30")

temp$dateStarted <- as.numeric(temp$dateStarted)

temp$dateStarted <- as.Date(temp$dateStarted, origin = "1899-12-30")

temp$countryAndKindCode <- with(temp,paste0(countryCode, kindCode))

See https://www.r-bloggers.com/date-formats-in-r/ for excel mac/windows and confirm this origin works for you by reviewing the source file.

View the data sources:

  1. USPTO kind codes

  2. CAS list of kind codes

  3. IP Book kind codes

  4. Thomson File Histories

For esp@cenet at the EPO, try the following link: https://worldwide.espacenet.com/help?locale=en_EP&method= handleHelpTopic&topic=kindcodes%5C


kamilien1/patentR documentation built on May 20, 2019, 7:19 a.m.