A dataset returned from a query to GeoDeepDive (https://geodeepdive.org/) that includes Natural Language Processing elements from the Stanford NLP tools (https://nlp.stanford.edu/).
1 |
A data.frame
with 87,181 rows and 9 columns.
Unique identifier for the article within the GDD database.
Unique sentence index within the article.
Unique index of unique words within the sentences.
Sentence within the article, split by commas.
Parts of Speech from the Stanford tagger, matching the Penn State Treebank tags: https://www.ling.upenn.edu/courses/Fall_2003/ling001/penn_treebank_pos.html
Special classes (numbers, dates, &cetera)
Word types, based on universal dependencies (http://universaldependencies.org/introduction.html).
The word (from the word index) modified by the typed word.
@source https://geodeepdive.org/
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.