estimateRegisterSize: estimateRegisterSize

Description Usage Arguments Details Value Author(s) See Also

View source: R/B05.estimateRegisterSize.R

Description

estimateRegisterSize Estimates the size of individual registers

Usage

1
2
estimateRegisterSize(korpus, corpusSize, samplingUnit, sampleSize = 2000,
  numSamples = 100)

Arguments

korpus

List containing the meta data for the corpus

corpusSize

List containing teh corpus size estimate from estimateCorpusSize

samplingUnit

- List containing the sampling unit estimate from estimateSamplingUnit

sampleSize

Integer indicating sampling unit size

numSamples

Integer indicating the number of samples to analyze

Details

This function takes as its parameters, the meta data for the korpus, the corpus sample size estimate from estimateCorpusSize, the sampling unit estimate from estimateSamplingUnit, the POS tags, sample size, and the number of samples and returns an estimate of register size for each register based upon the distribution of lexical features per 2000-word samples of the text. This analysis is based upon Biber's 1993 Representativeness in Corpus Design.

Value

registerSizes Dataframe containing

Author(s)

John James, j2sdatalab@gmail.com

See Also

Other sample size estimate functions: estimateCorpusSize, estimateSampleSize, estimateSamplingUnit


DataScienceSalon/predictifyR.3.0 documentation built on May 23, 2019, 8:25 p.m.