pakjiddat/word-predictor: Develop Text Prediction Models Based on N-Grams

A framework for developing n-gram models for text prediction. It provides data cleaning, data sampling, extracting tokens from text, model generation, model evaluation and word prediction. For information on how n-gram models work we referred to: "Speech and Language Processing" <https://web.archive.org/web/20240919222934/https%3A%2F%2Fweb.stanford.edu%2F~jurafsky%2Fslp3%2F3.pdf>. For optimizing R code and using R6 classes we referred to "Advanced R" <https://adv-r.hadley.nz/r6.html>. For writing R extensions we referred to "R Packages", <https://r-pkgs.org/index.html>.

Getting started

Package details

Maintainer
LicenseMIT + file LICENSE
Version0.0.5
URL https://github.com/pakjiddat/word-predictor https://pakjiddat.github.io/word-predictor/
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("pakjiddat/word-predictor")
pakjiddat/word-predictor documentation built on Oct. 15, 2024, 1:10 p.m.