The corpora package provides a collection of functions for statistical inference from corpus frequency data, as well as some convenience functions and example data sets.

It is a companion package to the open-source course Statistical Inference: a Gentle Introduction for Linguists and similar creatures developed by Marco Baroni and Stefan Evert. Statistical methods implemented in the package are described and illustrated in the units of this course.


Stefan Evert <>


The official homepage of the corpora package and the SIGIL course is

