nlp_pubtator_read_dataset: PubTator Dataset

nlp_pubtator_read_datasetR Documentation

PubTator Dataset

Description

The PubTator format includes medical papers’ titles, abstracts, and tagged chunks (see PubTator Docs and MedMentions Docs for more information). We can create a Spark DataFrame from a PubTator text file.

Usage

nlp_pubtator_read_dataset(sc, path)

Arguments

sc

Spark connection

path

path to a PubTator file

Value

Spark Dataframe created from the PubTator file


r-spark/sparknlp documentation built on Oct. 15, 2022, 10:50 a.m.