A set of tools to analyze texts. Includes, amongst others, functions for automatic language detection, hyphenation,
several indices of lexical diversity (e.g., type token ratio, HD-D/vocd-D, MTLD) and readability (e.g., Flesch, SMOG,
LIX, Dale-Chall). Basic import functions for language corpora are also provided, to enable frequency analyses (supports
Celex and Leipzig Corpora Collection file formats) and measures like tf-idf. Support for additional languages can be
added on-the-fly or by plugin packages. Note: For full functionality a local installation of TreeTagger is recommended.
'koRpus' also includes a plugin for the R GUI and IDE RKWard, providing graphical dialogs for its basic features. The
respective R package 'rkward' cannot be installed directly from a repository, as it is a part of RKWard. To make full
use of this feature, please install RKWard from
|Author||m.eik michalke [aut, cre], Earl Brown [ctb], Alberto Mirisola [ctb], Alexandre Brulet [ctb], Laura Hauser [ctb]|
|Date of publication||2017-04-04 22:04:32 UTC|
|Maintainer||m.eik michalke <[email protected]>|
|License||GPL (>= 3)|
|Package repository||View on CRAN|
Install the latest version of this package by entering the following in R:
Any scripts or data that you put into this service are public.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.