quanteda: Quantitative Analysis of Textual Data

A fast, flexible, and comprehensive framework for quantitative text analysis in R. Provides functionality for corpus management, creating and manipulating tokens and n-grams, exploring keywords in context, forming and manipulating sparse matrices of documents by features and feature co-occurrences, analyzing keywords, computing feature similarities and distances, applying content dictionaries, applying supervised and unsupervised machine learning, visually representing text and text analyses, and more.

Package overview README.md Quick Start Guide

Vignettes Man pages API and functions Files

Package details
Author	Kenneth Benoit [cre, aut, cph] (ORCID: <https://orcid.org/0000-0002-0797-564X>), Kohei Watanabe [aut] (ORCID: <https://orcid.org/0000-0001-6519-5265>), Haiyan Wang [aut] (ORCID: <https://orcid.org/0000-0003-4992-4311>), Paul Nulty [aut] (ORCID: <https://orcid.org/0000-0002-7214-4666>), Adam Obeng [aut] (ORCID: <https://orcid.org/0000-0002-2906-4775>), Stefan Müller [aut] (ORCID: <https://orcid.org/0000-0002-6315-4125>), Akitaka Matsuo [aut] (ORCID: <https://orcid.org/0000-0002-3323-6330>), William Lowe [aut] (ORCID: <https://orcid.org/0000-0002-1549-6163>), Christian Müller [ctb], Olivier Delmarcelle [ctb] (ORCID: <https://orcid.org/0000-0003-4347-070X>), European Research Council [fnd] (ERC-2011-StG 283794-QUANTESS)
Maintainer	Kenneth Benoit <kbenoit@lse.ac.uk>
License	GPL-3
Version	4.4
URL	https://quanteda.io
Package repository	View on CRAN
Installation	Install the latest version of this package by entering the following in R: `install.packages("quanteda")`