kRp.cluster: Work in (early) progress. Probably don't even look at it....

Description Usage Arguments Details

View source: R/kRp.cluster.R


In some future release, this might evolve into a function to help comparing several texts by features like average sentece length, word length, lexical diversity, and so forth. The idea behind it is to conduct a cluster analysis, to discover which texts out of several are similar to (or very different from) each other. This can be useful, e.g., if you need texts for an experiment which are different in content, but similar regarding syntactic features, like listed above.


kRp.cluster(txts, lang, TT.path, TT.preset)



A character vector with paths to texts to analyze.


A character string with a valid Language identifier.


A character string, path to TreeTagger installation.


A character string naming the TreeTagger preset to use.


It is included in this package not really to be used, but to maybe inspire you, to toy around with the code and help me to come up with something useful in the end...

koRpus documentation built on May 30, 2017, 12:47 a.m.