download_twokenizer: Download tagger

Description Usage Arguments Examples

View source: R/download_twokenizer.R

Description

Download Ark-TweetNLP's POS tagger, and optionally, its other models.

Usage

1
download_twokenizer(outputLoc, type = c("base", "penn", "irc"))

Arguments

outputLoc

Location to put the tagger.

type

What model to download. "base" must be downloaded first before the other two (i.e. "penn" and "irc") can be downloaded. The Penn model uses Penn Treebank-style tags, which may facilitate working with other packages, whereas the irc model is, as the ArkTweetNLP website states, "a model trained on the NPSChat IRC corpus, with a PTB-style tagset".

Examples

1
2
3
4
5
#download_twokenizer("~/Documents", "base")

#MUST DOWNLOAD "base" FIRST:
#download_twokenizer("~/Documents", "penn")
#download_twokenizer("~/Documents", "irc")

b05102139/twokenizer documentation built on March 3, 2020, 12:13 a.m.