Description Usage Format Source References
A corpus of 196 speeches from the 2017 UN General Debate. The raw corpus with all speeches since 1970 is available at: https://doi.org/10.7910/DVN/0TJX8Y. The economic data for 2017 (GDP and GDP per capita) are downloaded from the World Bank website.
1 |
The corpus includes the following document variables:
ISO3c country code, e.g. "AFG" for Afghanistan
UN session, a numeric identifier (in this case, 72)
4-digit year (2017).
Country name, in English.
Continent of the country, one of: Africa, Americas, Asia, Europe, Oceania. Note that the speech delivered on behalf of the European Union is coded as "Europe".
GDP in $US for 2017, from the World Bank. Contains missing values for 9 countries.
GDP per capita in $US for 2017, derived from the World Bank. Contains missing values for 9 countries.
Mikhaylov, M., Baturo, A., & Dasandi, N. (2017). United Nations General Debate Corpus. Harvard Dataverse, V4. URL: https://doi.org/10.7910/DVN/0TJX8Y.
Mikhaylov, M., Baturo, A., & Dasandi, N. (2017). United Nations General Debate Corpus. Harvard Dataverse, V4. URL: https://doi.org/10.7910/DVN/0TJX8Y.
Baturo, A., Dasandi, N., & Mikhaylov, S. (2017). Understanding State Preferences With Text As Data: Introducing the UN General Debate Corpus. Research and Politics 4(2): 1–9.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.