Description Usage Format Details
A dataset of 11,124,944 sentences from the texts data for use in machine learning with libraries such as spaCy.
1 | data("sentences")
|
doc_id
character
text
character
places_root
logical
The titles and abstracts from the texts dataset divided into sentences. For use with spaCy in Python convert places_root column to "answer" and replace TRUE with "accept" and FALSE with "reject". Then write to file with jsonlite::stream_out(sentences, file("sentences.jsonl")). The jsonl file can then be loaded directly into spaCy.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.