download_extdata: Download Some Extra Data for Peace Science Research

View source: R/download_extdata.R

download_extdataR Documentation

Download Some Extra Data for Peace Science Research

Description

download_extdata() leverages R's inst directory flexibility to allow you to download some extra data and store it in the package.

Usage

download_extdata(overwrite = FALSE)

Arguments

overwrite

logical, defaults to FALSE. If FALSE, the function checks to see if you've already downloaded the data and, if you already have, it does nothing. If TRUE, the function redownloads the data.

Value

download_extdata() downloads some extra data stored on my website (http://svmiller.com) and sticks them in the extdata directory in the package.

A Description of Various Data Sets This Will Download

Running download_extdata() returns the following data that will be stored in the package's extdata directory.

Correlates of War Dyadic Trade Data Set (v. 4.0)

These are directed dyad-year-level data for dyadic trade from the Correlates of War project. The trade values presented here have been rounded to three decimal points to conserve space. The data downloaded by this function are about 4.1 megabytes in size.

COLUMN DESCRIPTION
ccode1 a numeric vector for the Correlates of War state code for the first state
ccode2 a numeric vector for the Correlates of War state code for the second state
year the year
flow1 imports of ccode1 from ccode2, in current million USD
flow2 imports of ccode2 from ccode1, in current million USD
smoothflow1 smoothed flow1 values
smoothflow2 smoothed flow2 values

Directed Leader Dyad-Year Data, 1870-2015 (CoW States)

These are all directed leader dyad-year data from 1870-2015. Data come from the Archigos data (version 4.1). The data are standardized to just those observations where both leaders and states appear in the CoW state system data. The data downloaded by this function are about 2 megabytes in size.

COLUMN DESCRIPTION
year the year
obsid1 the unique Archigos (v. 4.1) observation ID for the first leader
obsid2 the unique Archigos (v. 4.1) observation ID for the second leader
ccode1 a numeric vector for the Correlates of War state code for the first state
ccode2 a numeric vector for the Correlates of War state code for the second state
gender1 the gender of obsid1 ("M" or "F")
gender2 the gender of obsid2 ("M" or "F")
leaderage1 the approximate age (i.e. year - yrborn) for obsid1 in the year
leaderage2 the approximate age (i.e. year - yrborn) for obsid2 in the year
yrinoffice1 a running count for the tenure of obsid1, starting at 1.
yrinoffice2 a running count for the tenure of obsid2, starting at 1.

Directed Leader Dyad-Year Data, 1870-2015 (Gleditsch-Ward States)

These are all directed leader dyad-year data from 1870-2015. Data come from the Archigos data (version 4.1). The data represent every possible dyadic leader-pairing in the Archigos data (which is denominated in the Gleditsch-Ward system), but standardizes leader dyad-years to Gleditsch-Ward state system dates. The data downloaded by this function are about 2.2 megabytes in size.

COLUMN DESCRIPTION
year the year
obsid1 the unique Archigos (v. 4.1) observation ID for the first leader
obsid2 the unique Archigos (v. 4.1) observation ID for the second leader
gwcode1 a numeric vector for the Gleditsch-Ward state code for the first state
gwcode2 a numeric vector for the Gleditsch-Ward state code for the second state
gender1 the gender of obsid1 ("M" or "F")
gender2 the gender of obsid2 ("M" or "F")
leaderage1 the approximate age (i.e. year - yrborn) for obsid1 in the year
leaderage2 the approximate age (i.e. year - yrborn) for obsid2 in the year
yrinoffice1 a running count for the tenure of obsid1, starting at 1.
yrinoffice2 a running count for the tenure of obsid2, starting at 1.

Chance-Corrected Measures of Foreign Policy Similarity (FPSIM, v. 2)

The FPSIM data set provides measures of foreign policy similarity of dyads based on alliance ties (Correlates of War, version 4.1) and UN General Assembly voting (Voeten, version 17) for all members of the Correlates of War state system. The alliance data cover the time period from 1816 to 2012, and the UN voting data from 1946 to 2015. The similarity measures include various versions of Ritter and Signorino's S (weighted/non-weighted by material capabilities; squared/absolute distance metrics) as well as the chance-corrected measures Cohen's (1960) kappa and Scott's (1955) pi. The measures based on alliance data come in two versions: one is based on valued alliance ties and the other is based on binary alliance ties. Data were last updated on December 7, 2017, and this description was effectively plagiarized (with his blessing) from Frank Haege's Dataverse.

These data are directed dyad-years with 17 columns and 1,872,198 observations. They will almost certainly be the largest data set I nudge/ask you to download remotely. The file containing this information is 18.6 MB in size. To reduce size further, these decimal points have also been rounded to three spots.

Haege generated all estimates of dyadic foreign policy similarity, except for the taub column. That was generated separately, by me.

COLUMN DESCRIPTION
year the year
ccode1 the Correlates of War state code for the first state
ccode2 the Correlates of War state code for the second state
taub Tau-b (valued alliance data)
srsvas unweighted S (squared distances, valued alliance data)
srswvas weighted S (squared distances, valued alliance data)
srsvaa unweighted S (absolute distances, valued alliance data)
srswvaa weighted S (absolute distances, valued alliance data)
kappava Kappa (squared distances, valued alliance data)
piva Pi (squared distances, valued alliance data)
srsba Unweighted S (binary alliance data)
srswba Weighted S (binary alliance data)
kappaba Kappa (binary alliance data)
piba Pi denominator (binary alliance data)
srsvvs Unweighted S (squared distances, valued UN voting data)
srsvva Unweighted S (absolute distances, valued UN voting data)
kappavv Kappa (squared distances, valued UN voting data)
pivv Pi (squared distances, valued UN voting data)

(Non-Directed) Dyadic Minimum Distance Data Plus (CoW States)

These are non-directed dyadic minimum distance data from Schvitz et al. (2022) for all Correlates of War states from the start of 1886 to the end of 2019. Note that I call these "data plus", with the idea of informally branding these as a kind of augmentation of what you might otherwise do with the cshapes package. This data set has over 4.4 million rows for each dyadic minimum distance for all available years. Within each year, there is a recorded minimum distance for Jan. 1, June 30, Dec. 31 and, in addition, any day within the year where the composition of the international system (or shape of a state) changed, as recorded in cshapes. Sometimes these changes concern the dyadic minimum distance; sometimes they don't. For example, the League of Nations is responsible for a lot shape changes (i.e. system entry) in the CoW state system data in the year 1920. That obviously won't change the dyadic minimum distance between the U.S. and Canada, which will always be zero. Sometimes the start of the year (Jan. 1), the midpoint of the year (June 30), or the end of the year (Dec. 31) coincides with a system change. Often it doesn't. Note that a referent day (Jan. 1, June 30, Dec. 31) may not appear in a given year for a given dyad if that date exists outside CoW state system membership. For example, Canada doesn't appear as a state system member until Jan. 10, 1920. The goal of this data set is allow you to more quickly generate dyadic minimum distances within peacesciencer's functionality if you are proficient in tidyverse verbs. You could also use it to highlight how often the dyadic minimum distance may vary within a year for a given dyad.

Despite the dimensions of the data set, it's not too big of a download. The data are about 1.7 MB in size.

COLUMN DESCRIPTION
ccode1 the Correlates of War state code for the first state
ccode2 the Correlates of War state code for the second state
year the year
date a date, coinciding with either a system change date or a referent day (i.e. Jan. 1, June 30, Dec. 31)
change_date a date that, when present, indicates the shape of the system changed on that day
mindist the dyadic minimum distance (in kilometers)

(Non-Directed) Dyadic Minimum Distance Data Plus (G-W States)

These are non-directed dyadic minimum distance data from Schvitz et al. (2022) for all Gleditsch-Ward states from the start of 1886 to the end of 2019. Note that I call these "data plus", with the idea of informally branding these as a kind of augmentation of what you might otherwise do with the cshapes package. This data set has over 3.7 million rows for each dyadic minimum distance for all available years. Within each year, there is a recorded minimum distance for Jan. 1, June 30, Dec. 31 and, in addition, any day within the year where the composition of the international system (or shape of a state) changed, as recorded in cshapes. Sometimes these changes concern the dyadic minimum distance; sometimes they don't. For example, the dissolution of the Soviet Union is responsible for a lot shape changes (i.e. system entry) in 1991. That obviously won't change the dyadic minimum distance between the U.S. and Canada, which will always be zero. Sometimes the start of the year (Jan. 1), the midpoint of the year (June 30), or the end of the year (Dec. 31) coincides with a system change. Often it doesn't. Note that a referent day (Jan. 1, June 30, Dec. 31) may not appear in a given year for a given dyad if that date exists outside G-W state system membership. For example, Haiti disappears from the state system on July 4, 1915 and reappears on Aug. 15, 1934. That means there won't be any dyadic minimum distance observations with the U.S., for example, on Dec. 31, 1915 or June 30, 1934. The goal of this data set is allow you to more quickly generate dyadic minimum distances within peacesciencer's functionality if you are proficient in tidyverse verbs. You could also use it to highlight how often the dyadic minimum distance may vary within a year for a given dyad.

Despite the dimensions of the data set, it's not too big of a download. The data are about 1.4 MB in size.

COLUMN DESCRIPTION
gwcode1 the Gleditsch-Ward state code for the first state
gwcode2 the Gleditsch-Ward state code for the second state
year the year
date a date, coinciding with either a system change date or a referent day (i.e. Jan. 1, June 30, Dec. 31)
change_date a date that, when present, indicates the shape of the system changed on that day
mindist the dyadic minimum distance (in kilometers)

Author(s)

Steven V. Miller

References

Barbieri, Katherine, Omar M. G. Keshk, and Brian Pollins. 2009. "TRADING DATA: Evaluating our Assumptions and Coding Rules." Conflict Management and Peace Science. 26(5): 471-491.

Goemans, Henk E., Kristian Skrede Gleditsch, and Giacomo Chiozza. 2009. "Introducing Archigos: A Dataset of Political Leaders" Journal of Peace Research 46(2): 269–83.

Haege, Frank. 2011. "Choice or Circumstance? Adjusting Measures of Foreign Policy Similarity for Chance Agreement." Political Analysis 19(3): 287-305.

Schvitz, Guy, Luc Girardin, Seraina Ruegger, Nils B. Weidmann, Lars-Erik Cederman, and Kristian Skrede Gleditsch. 2022. "Mapping The International System, 1886-2017: The CShapes 2.0 Dataset." Journal of Conflict Resolution. 66(1): 144-161.

Weidmann, Nils B. and Kristian Skrede Gleditsch. 2010. "Mapping and Measuring Country Shapes: The cshapes Package." The R Journal 2(1): 18-24.

Examples


## Not run: 
# Here's where the data are going to be downloaded.
system.file("extdata", package="peacesciencer")
# Now, let's download the data.
download_extdata()

## End(Not run)

peacesciencer documentation built on March 31, 2023, 8:37 p.m.