This function cuts documents from a corpus to a fixed word count and ignores the documents which are shorter than that word count. The accepted documents are then tokenized and the type/token ratios for each are calculated and returned.
This function takes a data structure created by this package using methods such as getFromFolderWF and returns the type token ratios.
1 2 3 | getTypeTokenRatios(wordFrequencyMatrix)
getTypeTokenRatios(wordFrequencyMatrix)
|
wordFrequencyMatrix, |
a data strucutre generated by this package which contians the unique tokens and their counts |
path, |
the path to the folder containing the corpus |
minMaxWordCount, |
no documents with less tokens than indicated will be accepted and all documents longer than the spefified count will be cropped, defaults to 300. |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.