Man pages for Docma-TU/tmT
Text Mining Tools For News Corpora Analysis

clusterTopicsCluster Analysis
deleteAndRenameDuplicatesDelete And Rename Articles with the same ID
docLDAcreate lda-ready dataset
duplistCreating List of Duplicates
intruderTopicsFunction to fit LDA model
intruderWordsFunction to fit LDA model
LDAstandardFunction to fit LDA model
makeClearSome data preprocessing
makeWordlistCount words in text corpora
mergeLDAPreparation of different LDAs For Clustering
mergeTextmetaMerge textmeta objects
plotScotPlotting Counts of Documents or Words over Time (relative to...
plotTopicPlotting Counts of Topics over Time (relative to Corpus)
plotTopicWordPlotting Counts of Topics-Words-Combination over Time...
plotWordPlotting Counts of specified Wordgroups over Time (relative...
plotWordptPlotting Counts of Topics-Words-Combination over Time...
plotWordSubPlotting Counts/Proportion of Words/Docs in LDA-generated...
readHBWiWoRead the HB WiWo Corpus
readJFArchivRead the Corpus as CSV
readNexisRead preprocessed files from Lexis Nexis
readNexisOnlineRead preprocessed files from Nexis Online
readSPIEGELRead the SPIEGEL Corpus
readSZRead the SZ corpus
readTextmetaRead Corpora as CSV
readWikiRead Pages from Wikipedia
removeXMLRemoves XML tags and umlauts
sedimentPlotPlotting Sediment plot of topics over time
showArticlesExport Readable Article Lists
showMetadataExport Readable Meta-Data of Articles.
subcorpusCountSubcorpus With Count Filter
subcorpusDateSubcorpus With Date Filter
subcorpusWordSubcorpus With Word Filter
topArticlesGet The IDs Of The Most Representive Articles
topicOverTimePlotting Topics Over Time
topicsInTextWord coloring by topics
topicwordsOverTimePlotting Topicwords Over Time
totHeatPlotting Topics over Time relative to Corpus
Docma-TU/tmT documentation built on Dec. 12, 2017, 12:08 p.m.