Nothing
Segmentation of short text sequences - like hashtags - into the separated words sequence, done with the use of dictionary, which may be built on custom corpus of texts. Unigram dictionary is used to find most probable sequence, and n-grams approach is used to determine possible segmentation given the text corpus.
Package details |
|
---|---|
Author | Oskar Kosch [aut, cre] (<https://orcid.org/0000-0003-2697-1393>) |
Maintainer | Oskar Kosch <contact@oskarkosch.com> |
License | GPL (>= 3) |
Version | 0.1.0 |
URL | https://github.com/theogrost/NUSS |
Package repository | View on CRAN |
Installation |
Install the latest version of this package by entering the following in R:
|
Any scripts or data that you put into this service are public.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.