These files contain the analysis code and write-up. All R code except for a few library calls and initial data loading calls are in the files in the code
directory.
tf-idf
Initial loading and munging of data. Computation and plots of TF-IDF and sample sets. train-model.Rmd
Contains all model validation code and plots. Cross-validation of perplexity scores, scoring of models with ldatuning
functions, coherence scores and plots of set distributions over topic probabilities. compare-model-distributions.Rmd
Plots of color distributions over topics, plots of example sets and tables of example sets. final-model.Rmd
Several plots of the final model. background
Some math used in model evaluation.Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.