fst_dev_coop_2 | R Documentation |
This data contains the responses to Development Cooperation q11_3 dataset in CoNLL-U format with NLTK stopwords and punctuation removed.
fst_dev_coop_2
## 'fst_dev_coop_2' A dataframe with 4192 rows and 14 columns:
the identifier of the document
the identifier of the paragraph
the identifier of the sentence
the text of the sentence for which this token is part of
Word index, integer starting at 1 for each new sentence; may be a range for multi-word tokens; may be a decimal number for empty nodes.
Word form or punctuation symbol.
Lemma or stem of word form.
Universal part-of-speech tag.
Language-specific part-of-speech tag; underscore if not available.
List of morphological features from the universal feature inventory or from a defined language-specific extension; underscore if not available.
Head of the current word, which is either a value of token_id or zero (0).
Universal dependency relation to the HEAD (root iff HEAD = 0) or a defined language-specific subtype of one.
Enhanced dependency graph in the form of a list of head-deprel pairs.
Any other annotation.
<https://urn.fi/urn:nbn:fi:fsd:T-FSD2821>
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.