P-values of the binomial test for frequency counts (corpora)
This function computes the p-value of a binomial test for frequency counts. In the two-sided case, a fast approximation is used that may be inaccurate for small samples.
frequency of a type in the corpus (or an integer vector of frequencies)
number of tokens in the corpus, i.e. sample size (or an integer vector specifying the sizes of different samples)
null hypothesis, giving the assumed proportion of this type in the population (or a vector of proportions for different types and/or different populations)
a character string specifying the alternative
hypothesis; must be one of
two.sided, a fast approximation of the
two-sided p-value is used (multiplying the appropriate single-sided tail
probability by two), which may be inaccurate for small samples. Unlike
the exact algorithm of
binom.test, this implementation can
be applied to large frequencies and samples without a serious impact on
The p-value of a binomial test applied to the given data (or a vector of p-values).