get_text: Get the text from a tokens data frame

Description Usage Arguments Value

Description

The data is sorted by aid, sentence, and offset, and the words are pasted together to a single character value. The word.column parameter specifies which column to use for the words. If multiple columns arse given (e.g. lemma and POS) they are pasted together with a slash ("Like/IN this/DT")

Usage

1
get_text(tokens, word.column = "word")

Arguments

tokens

a data frame of tokens containing id, parent, and relation columns

word.column

the column(s) containing the words to use.

Value

the text as a single character value


anon-author/clauses documentation built on May 10, 2019, 11:52 a.m.