These files contain the analysis code and write-up. All R code except for a few library calls and initial data loading calls are in the files in the code directory.
tf-idf Initial loading and munging of data. Computation and plots of TF-IDF and sample sets. train-model.Rmd Contains all model validation code and plots. Cross-validation of perplexity scores, scoring of models with ldatuning functions, coherence scores and plots of set distributions over topic probabilities. compare-model-distributions.Rmd Plots of color distributions over topics, plots of example sets and tables of example sets. final-model.Rmd Several plots of the final model. background Some math used in model evaluation.Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.