corpora: Statistics and data sets for corpus frequency data

Utility functions and data sets for the statistical analysis of corpus frequency data, used in the SIGIL statistics course.

Author
Stefan Evert [http://purl.org/stefan.evert]
Date of publication
2012-04-04 13:26:36
Maintainer
Stefan Evert <evert@linglit.tu-darmstadt.de>
License
GPL-3
Version
0.4-3
URLs

View on CRAN

Man pages

binom_pval
P-values of the binomial test for frequency counts (corpora)
BNCbiber
Biber's (1988) register features for the British National...
BNCcomparison
Comparison of written and spoken frequencies (BNC)
BNCdomains
Distribution of domains in the British National Corpus (BNC)
BNCInChargeOf
Collocations of the phrase "in charge of" (BNC)
BNCmeta
Metadata for the British National Corpus (XML edition)
chisq
Pearson's chi-squared statistic for frequency comparisons...
chisq_pval
P-values of Pearson's chi-squared test for frequency...
cont_table
Build contingency tables for frequency comparison (corpora)
corpora_package
corpora: statistical inference from corpus frequency data
fisher_pval
P-values of Fisher's exact test for frequency comparisons...
prop_cint
Confidence interval for proportion based on frequency counts...
sample_df
Random samples from data frames (corpora)
simulated_census
Simulated census data for examples and illustrations...
simulated_wikipedia
Simulated type and token counts for Wikipedia articles...
VSS
A small corpus of very short stories with linguistic...
z_score
The z-score statistic for frequency counts (corpora)
z_score_pval
P-values of the z-score test for frequency counts (corpora)

Files in this package

corpora
corpora/MD5
corpora/R
corpora/R/z_score_pval.R
corpora/R/z_score.R
corpora/R/simulated_wikipedia.R
corpora/R/simulated_census.R
corpora/R/sample_df.R
corpora/R/quadratic.R
corpora/R/prop_cint.R
corpora/R/fisher_pval.R
corpora/R/cont_table.R
corpora/R/chisq_pval.R
corpora/R/chisq.R
corpora/R/binom_pval.R
corpora/NAMESPACE
corpora/man
corpora/man/z_score_pval.Rd
corpora/man/z_score.Rd
corpora/man/VSS.Rd
corpora/man/simulated_wikipedia.Rd
corpora/man/simulated_census.Rd
corpora/man/sample_df.Rd
corpora/man/prop_cint.Rd
corpora/man/fisher_pval.Rd
corpora/man/corpora_package.Rd
corpora/man/cont_table.Rd
corpora/man/chisq_pval.Rd
corpora/man/chisq.Rd
corpora/man/BNCmeta.Rd
corpora/man/BNCInChargeOf.Rd
corpora/man/BNCdomains.Rd
corpora/man/BNCcomparison.Rd
corpora/man/BNCbiber.Rd
corpora/man/binom_pval.Rd
corpora/DESCRIPTION
corpora/data
corpora/data/VSS.tab.bz2
corpora/data/datalist
corpora/data/BNCmeta.rda
corpora/data/BNCInChargeOf.tab.bz2
corpora/data/BNCdomains.tab.gz
corpora/data/BNCcomparison.tab.gz
corpora/data/BNCbiber.rda
corpora/COPYING
corpora/CHANGES