Man pages for NicolasJBM/lexR
Toolbox to manipulate and analyze texts

clean_asciiConvert or force into ASCII.
clean_lettersSimplify string
clean_paragraphsRecompose paragraphs by identifying improper breaks.
clean_replaceReplace words in string.
clean_spacesClean spaces
clean_tagsClean into pure ASCII text
clean_windowsExtract windows of words around a focal pattern.
count_wordsCount words
create_bowCreate a bag of words
create_dtmCreate a Document-to-Term Matrix
create_stmCreate a Structural Topic Model
create_syntnetCreate a Syntactic Network
create_syntrelCreate Syntactic Relationships
dat_dictionariesDictionaries for lexical analyses
dat_en_lemmasEnglish lemmatization
dat_symbolsList of symbols to be removed
dat_toasciiMap non ASCII to ASCII
eval_bowCompute bag of words metrics
eval_dictionariesCompute scores based on dictionaries' word counts
eval_readabilityAssess the readability of a text.
eval_stmEvaluate Structural Topic Models.
predict_topicApply a STM to a new corpus
test_regexTest Regular Expressions
NicolasJBM/lexR documentation built on Feb. 4, 2021, 6:43 p.m.