mmochtak/sentenceR: Langauge-agnostic setence tokenizer with UDPipe back end

sentenceR is a language agnostic utility designed for sentence tokenization of a raw text. Using the UDPipe POS tagging pipeline, the package automatically extracts sentences with their appropriate indexes. The package works with any of the 100+ language models natively provided by UDPipe package (see https://github.com/bnosac/udpipe).

Getting started

Package details

Maintainer
License`use_mit_license()`
Version0.0.1
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("mmochtak/sentenceR")
mmochtak/sentenceR documentation built on Aug. 25, 2022, 9:31 a.m.