data_corpus_ungd2017: UN General Debate speeches, 2017

Description Usage Format Source References

Description

A corpus of 196 speeches from the 2017 UN General Debate. The raw corpus with all speeches since 1970 is available at: https://doi.org/10.7910/DVN/0TJX8Y. The economic data for 2017 (GDP and GDP per capita) are downloaded from the World Bank website.

Usage

1

Format

The corpus includes the following document variables:

country_iso

ISO3c country code, e.g. "AFG" for Afghanistan

un_session

UN session, a numeric identifier (in this case, 72)

year

4-digit year (2017).

country

Country name, in English.

continent

Continent of the country, one of: Africa, Americas, Asia, Europe, Oceania. Note that the speech delivered on behalf of the European Union is coded as "Europe".

gdp

GDP in $US for 2017, from the World Bank. Contains missing values for 9 countries.

gdp_per_capita

GDP per capita in $US for 2017, derived from the World Bank. Contains missing values for 9 countries.

Source

Mikhaylov, M., Baturo, A., & Dasandi, N. (2017). United Nations General Debate Corpus. Harvard Dataverse, V4. URL: https://doi.org/10.7910/DVN/0TJX8Y.

References

Mikhaylov, M., Baturo, A., & Dasandi, N. (2017). United Nations General Debate Corpus. Harvard Dataverse, V4. URL: https://doi.org/10.7910/DVN/0TJX8Y.

Baturo, A., Dasandi, N., & Mikhaylov, S. (2017). Understanding State Preferences With Text As Data: Introducing the UN General Debate Corpus. Research and Politics 4(2): 1–9.


quanteda/quanteda.corpora documentation built on Nov. 16, 2020, 12:45 a.m.