nrauscher/corpus: Toolkit for Corpus Analysis

Tools for corpus analysis using the CWB as an efficient backend for indexing and querying large corpora. The package offers functionality to flexibly create subcorpora ("partitions") and to carry out basic statistical operations (count, cooccurrences etc.). The original full text of documents can be reconstructed and inspected at any time. Beyond that, the package is intended to serve as an interface to packages implementing advanced statistical procedures. Respective data structures (document term matrices, term cooccurrence matrices etc.) can be created based on the indexed corpora.

README.md

Vignettes Man pages API and functions Files

Package details
Author	Andreas Blaette
Maintainer	Andreas Blaette <andreas.blaette@uni-due.de>
License	GPL-3
Version	0.7.1
URL	https://www.github.com/PolMine/polmineR
Package repository	View on GitHub
Installation	Install the latest version of this package by entering the following in R: `install.packages("remotes") remotes::install_github("nrauscher/corpus")`