Man pages for manuelbickel/textility
Utility functions for text mining

add_shape_circle2Add additional igraph vertex shape
attr_DTattr_DT: Workaround to use data.table attributes that are...
bind_ngramsReplace blanks by replacement pattern in known ngrams in a...
cos_sim2_vectorsCalculate the cosine similarity of a set of word vectors to...
create_reference_tcmCreate several reference tcms with different settings for...
get_cooccurrenceGet the co-occurrence of column elements per row entity (e.g....
get_extremaGet idxs and values of extreme points in discrete sequence of...
get_loglik_parallelGet loglikelihood from multiple locally saved models
get_rankWrapper around base rank allowing for dense rank output
get_topic_trendsModel and summarize topic trends based on text2vec LDA
get_wiki_contentGet Wikipedia Text Content from Multiple Pages
gradient_numericGet numeric gradient between points
jsPCA_robust(nUmerically robust) Dimension reduction via Jensen-Shannon...
loess_aicc_optimizedFitting of AICC optimized loess (groupwise)
patterns_in_topicsCheck occurrence and rank of patterns per topic
p_lmGet p values of linear model
plot_topic_trendsGenerate trend plots for topics modelled with text2vec and...
replace_acronyms_by_wordsReplace acronyms by words
scale_normalScale values normal with boundaries zero and one
semantic_coherence_stm_lightSemantic Coherence (as used in stm package, but a light...
sort_topicmodels_LDA_by_lambdaSort topics of an lda model from topicmodels by lambda
sparse_to_stmConvert sparse Matrix to format required by stm for modelling
split_columnsSplit columns on basis of a string and duplicate entries
stri_processA wrapper function for various preprocessing options for...
subset_rows_DTSubset rows of a data.table by reference
tcm_specs_standardInitializes the standard specifications for tcms to be used...
top_feature_matrixCreate a top features per entity matrix from a numeric...
topic_cooccurrenceCalculate topic co-occurrence from text2vec model
treetag_parallelWrapper around treetag function from koRpus for parallel part...
tripl_to_sparseSimple triple matrix to sparse matrix
uncontract_negationsUncontrtact negations, e.g., "don't" to "do not"
warp_lda_vary_n_parallelFit Warp LDA models for varying n in parallel
manuelbickel/textility documentation built on Nov. 25, 2022, 9:07 p.m.