Description Usage Arguments Details Value Author(s) See Also
View source: R/B05.estimateRegisterSize.R
estimateRegisterSize
Estimates the size of individual registers
1 2 | estimateRegisterSize(korpus, corpusSize, samplingUnit, sampleSize = 2000,
numSamples = 100)
|
korpus |
List containing the meta data for the corpus |
corpusSize |
List containing teh corpus size estimate from |
samplingUnit |
- List containing the sampling unit estimate from |
sampleSize |
Integer indicating sampling unit size |
numSamples |
Integer indicating the number of samples to analyze |
This function takes as its parameters, the meta data for the korpus,
the corpus sample size estimate from estimateCorpusSize
,
the sampling unit estimate from estimateSamplingUnit
, the
POS tags, sample size, and the number of samples and returns an estimate of
register size for each register based upon the distribution of lexical
features per 2000-word samples of the text. This analysis is based upon
Biber's 1993 Representativeness in Corpus Design.
registerSizes Dataframe containing
RegisterString indicating the register name
BaseNumeric indicating base allocation of samples allocated to all registers
Avg VcNumberic average coefficient of variation across all POS tags
lambdaNumeric factor multiplied by Avg Vc to calculate proportional allocation
ProportionNumeric indicating the proportional allocation of samples to registers
Num SamplesInteger indicating Base + Proportion for each register
Sample LengthInteger indicating length of each sample in tokens
Sample SizeInteger = Num Samples * Sample Length
John James, j2sdatalab@gmail.com
Other sample size estimate functions: estimateCorpusSize
,
estimateSampleSize
,
estimateSamplingUnit
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.