rlda_general main class methods: LDA wrapper, fit
dtm
document term matrix
idx
index in K for the model that should be treated as the original model
K
numeric or vector, if numeric, number of k to try, if vector, k's to try (will be overwrite with list of k's tried once fit has been run)
threshold,
sim_threshold, threshold for return2, between [0,1]
similarity_measure
string, similarity measure (so far cosine or hellinger). Default: cosine
num_of_clusters
numeric or vector, number of clusters used when performing spectral clustering
beta_list
list of beta matrices in LDA objects of all K tried (ordered same as K)
gamma_list
list of gamma matrices in LDA objects of all K tried (ordered same as K)
terms
list of unique words/token in the vocabulary
model_topic_mat
percentage of documents dominated by the given topic
similarity_mat
maximum similarity (given choice of similarity functions) of a given topic compare to any topics in the original lda model (to give the probability of a user<e2><80><99>s topic shows up in a tried model<e2><80><99>s resulting topics)
sim_matrix_list
list of similarity matrices that gives us similarity between
key_features
top 10 features of a given topic in each model tried
topic_dom_perc_list
percentage of documents dominated by the given topic out of documents originally dominated by similar topic in the original model
dominant_topic_cluster_list
clusters correponding to dominant topics of each document in each model
cluster_center_key_words_list
top 10 keywords for each center found by the cluster algorithn (so far only support spectral clustering)
perc_document_belong_cluster_list
percentage of documents belong to a given cluster in a given model
topic_cluster_assignment
cluster number a given topic belongs to
doc_by_cluster_and_model
a matrix indicating the dominiant cluster of each document according to each topic model
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.