Description Usage Arguments Details Value References Examples
Convert the Penn Treebank tags to universal part of speech tags.
1 2 | as_universal(x, tagset = "en-ptb", dictionary = tagger::universal_pos_map,
...)
|
x |
A |
tagset |
The name of a tagset dictionary to use as a key. Use
|
dictionary |
A dataframe that maps the current tagset to a second tagset. |
... |
ignored. |
Petrov, Das, & McDonald (2011) state that the universal tagset includes:
verbs (all tenses and modes)
nouns (common and proper)
pronouns
adjectives
adverbs
adpositions (prepositions and postpositions)
conjunctions
determiners
cardinal numbers
particles or other function words
other: foreign words, typos, abbreviations
punctuation
For more see: https://github.com/slavpetrov/universal-pos-tags
Returns a combined character vector of words and universal tags.
Slav Petrov, Dipanjan Das and Ryan McDonald. (2011). A Universal Part-of-Speech Tagset. http://arxiv.org/abs/1104.2086
1 2 3 4 5 6 7 8 | (x <- tag_pos("They refuse to permit us to obtain the refuse permit"))
as_universal(x)
(out1 <- tag_pos(sam_i_am))
as_universal(out1)
presidential_debates_2012_pos
as_universal(presidential_debates_2012_pos)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.