Description Usage Arguments Details Value Author(s) See Also
View source: R/B04.estimateCorpusSize.R
estimateCorpusSize
Estimates corpus size based upon lexical features
1 | estimateCorpusSize(korpus, sampleSize = 2000, numSamples = 100)
|
korpus |
List containing the corpus meta data |
sampleSize |
Numeric indicating the sampling unit size |
numSamples |
Numeric indicating the number of samples to analyize |
This function takes as its parameters, the korpus meta data and the POS tags selected for this analysis, the returns an estimate of total corpus size based upon the distribution of lexical features per n000-word samples of the text. This analysis is based upon Representativeness in Corpus Design Biber 1993 https://www.researchgate.net/publication/31460364_Representativeness_in_Corpus_Design
corpusSize List cointaining:
nCorpus sample size estimate
posAnalysisPOS tag distribution analysis
John James, j2sdatalab@gmail.com
Other sample size estimate functions: estimateRegisterSize
,
estimateSampleSize
,
estimateSamplingUnit
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.