topic_model_data | R Documentation |
Dataset containing topic model outputs for demonstrating bias correction methods in topic model regressions using CEO diary data.
topic_model_data
A list with 8 components:
Data frame (916 x 11): Control variables
Data frame (916 x 672): Contains outcome ly
and word frequencies
Data frame (2000 x 2): MCMC draws
Data frame (916 x 2): Full sample topic proportions
Data frame (916 x 2): Subsample topic proportions
Data frame (2 x 654): Full sample topic-word distributions
Data frame (2 x 654): Subsample topic-word distributions
Data frame (916 x 2): LDA validation data
CEO diary data from Bandiera et al (2020), Journal of Political Economy
ols_bca_topic
, ols_bcm_topic
data(topic_model_data)
# Basic exploration
Y <- topic_model_data$estimation_data$ly
theta <- as.matrix(topic_model_data$theta_est_full)
cat("Sample size:", length(Y), "\n")
cat("Mean log employment:", round(mean(Y), 2), "\n")
cat("Topic 1 mean:", round(mean(theta[, 1]), 3), "\n")
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.