verifyLinguistics: verifyLinguistics

Description Usage Arguments Details Value Author(s)

View source: R/C04.verifyLinguistics.R

Description

getCorpus Compares distribution of lexical features between 2 corpora

Usage

1
verifyLinguistics(fullCorpus, sampleCorpus, chunks = 100, chunkSize = 2000)

Arguments

fullCorpus

List containing the meta data for the HC Corpus

sampleCorpus

List containing the meta data for the sampleCorpus corpus

chunks

The number of chunks to sample

chunkSize

The number of sentences in each chunk sampled

Details

This function takes as its parameters, the meta data for the full corpus and the sample corpus and returns comparative statistics of the distribution of lexical features.

Value

features A list containing:

Author(s)

John James


DataScienceSalon/predictifyR.3.0 documentation built on May 23, 2019, 8:25 p.m.