derwent_extract: Extract sections of Derwent patent full texts to columns

Description Usage Arguments Value Examples

View source: R/derwent_extract.R

Description

Given a data frame containing the full texts of patent texts, this function provides options to split the texts into number, title, abstract, description or claims. Options are also provided to just break out the title abstract or claims ("tac") or all fields.

Usage

1
derwent_extract(x, col, section, remove = TRUE)

Arguments

x

a data.frame

col

A column in x

section

either number, title, abstract, description, or claims or "tac" or "al;l"

remove

Whether to remove the input column. Default is true.

Value

A data.frame with new columns

Examples

1
2
3
## Not run: derwent_extract(five_texts, "tac")
derwent_extract(five_texts, "all")
## End(Not run)

poldham/oldhammisc documentation built on May 25, 2019, 11:23 a.m.