A framework for developing n-gram models for text prediction. It provides data cleaning, data sampling, extracting tokens from text, model generation, model evaluation and word prediction. For information on how n-gram models work we referred to: "Speech and Language Processing" <https://web.archive.org/web/20240919222934/https%3A%2F%2Fweb.stanford.edu%2F~jurafsky%2Fslp3%2F3.pdf>. For optimizing R code and using R6 classes we referred to "Advanced R" <https://adv-r.hadley.nz/r6.html>. For writing R extensions we referred to "R Packages", <https://r-pkgs.org/index.html>.
Package details |
|
---|---|
Author | Nadir Latif [aut, cre] (<https://orcid.org/0000-0002-7543-7405>) |
Maintainer | Nadir Latif <pakjiddat@gmail.com> |
License | MIT + file LICENSE |
Version | 0.0.5 |
URL | https://github.com/pakjiddat/word-predictor https://pakjiddat.github.io/word-predictor/ |
Package repository | View on CRAN |
Installation |
Install the latest version of this package by entering the following in R:
|
Any scripts or data that you put into this service are public.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.