Man pages for PolMine/ctk
Toolkit for Preparing Corpora

as.xml-methodconvert data.frame to XML
characterCountCount characters in files.
consolidateConsolidate vrt files for CWB import.
CoreNLPClass for using Stanford CoreNLP
ctk-packageR-Package 'ctk' (Corpus Toolkit).
dirApplyApply a function over files in directory.
encode-methodEncode corpus (CWB import).
findAndReplaceFind and replace.
getAttributeValuesget attribute values
getFilesGet files from several directories. Files in the sourceDir...
getNgrams-character-methodget ngrams
install.corenlpInstall Stanford CoreNLP.
install.treetaggerInstall treetagger.
NDJSONParse Stanford CoreNLP JSON output.
normalizeGermanDatenormalize german date
PipePipe for corpus preparation.
PipeCoreNLPPipe using Stanford CoreNLP.
recode-methodApply iconv to files in a directory.
regex-character-methodGet Matches for Regular Expression in Files in Directory.
regexContextget context of a regex
regexPostprocessingregexPostprocessing
removeEmptyLinesremove empty lines
removeWhitespaceRemove whitespace.
sAttributeList-methodget a list with sAttributes from files
saxonInstall Saxon XSLT Processor to tools directory.
timePerFileTime per file.
tokenizeTokenize files.
treetaggerUse TreeTagger for linguistic annotation.
validateValidate XML files.
xsltPerform XSL transformation.
PolMine/ctk documentation built on May 8, 2019, 3:20 a.m.