Home

/

CRAN

/

NUSS: Mixed N-Grams and Unigram Sequence Segmentation

Segmentation of short text sequences - like hashtags - into the separated words sequence, done with the use of dictionary, which may be built on custom corpus of texts. Unigram dictionary is used to find most probable sequence, and n-grams approach is used to determine possible segmentation given the text corpus.

README.md

Vignettes Man pages API and functions Files

Package details
Author	Oskar Kosch [aut, cre] (<https://orcid.org/0000-0003-2697-1393>)
Maintainer	Oskar Kosch <contact@oskarkosch.com>
License	GPL (>= 3)
Version	0.1.0
URL	https://github.com/theogrost/NUSS
Package repository	View on CRAN
Installation	Install the latest version of this package by entering the following in R: `install.packages("NUSS")`