annotated_hymns | R Documentation |
The hymn
dataset, tokenized and tagged with part-of-speach (POS).
Each line represent a word from a given hymn.
annotated_hymns
A tibble with 153,717 rows and 7 variables:
Official hymn number
Line number in hymn
Token number in line
The original token, i.e. word
The lemmatized, i.e. dictionary form, of token
POS-tag, i.e. part-of-speach, like VERB or PUNCT for punctuation
Number of vowels in token – good for finding alternative words
using the cut_up()
function
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.