In conjugateprior/cbn: Tools and replication materials for Caliskan, Bryson, and Narayanan (2017)

knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>"
)

WEAT (Table 1)

In the following we set the number of permuations to 1000. This means that, although the point estimates should agree with the paper table the p values will be relatively imprecise. To make them more precise change 1000 to a larger number and be prepared to wait a little longer. In most cases the p values is less than 0.0001, so imprecision has no real implications for statistical confidence.

First we'll load the package and set up some graphics parameters.

library(cbn)

library(ggplot2)
theme_set(theme_minimal())

Flowers vs Insects

its <- cbn_get_items("WEAT", 1)
summary(its)
vecs <- cbn_get_item_vectors("WEAT", 1)
weat_perm(its, vecs, x_name = "Flowers", y_name = "Insects", 
          a_name = "Pleasant", b_name = "Unpleasant", 1000)

Instruments vs Weapons

its <- cbn_get_items("WEAT", 2)
summary(its)
vecs <- cbn_get_item_vectors("WEAT", 2)
weat_perm(its, vecs, x_name = "Instruments", y_name = "Weapons", 
          a_name = "Pleasant", b_name = "Unpleasant", 1000)

European-American vs African-American Names (1)

its <- cbn_get_items("WEAT", 3)
summary(its)
vecs <- cbn_get_item_vectors("WEAT", 3)
weat_perm(its, vecs, x_name = "EuropeanAmericanNames", 
          y_name = "AfricanAmericanNames", 
          a_name = "Pleasant", b_name = "Unpleasant", 1000)

European-American vs African-American Names (2)

its <- cbn_get_items("WEAT", 4)
summary(its)
vecs <- cbn_get_item_vectors("WEAT", 4)
weat_perm(its, vecs, x_name = "EuropeanAmericanNames", 
          y_name = "AfricanAmericanNames", 
          a_name = "Pleasant", b_name = "Unpleasant", 1000)

European-American vs African-American Names (3)

its <- cbn_get_items("WEAT", 5)
summary(its)
vecs <- cbn_get_item_vectors("WEAT", 5)
weat_perm(its, vecs, x_name = "EuropeanAmericanNames", 
          y_name = "AfricanAmericanNames", 
          a_name = "Pleasant", b_name = "Unpleasant", 1000)

Male vs Female Names

its <- cbn_get_items("WEAT", 6)
summary(its)
vecs <- cbn_get_item_vectors("WEAT", 6)
weat_perm(its, vecs, x_name = "MaleNames", y_name = "FemaleNames", 
          a_name = "Career", b_name = "Family", 1000)

Math vs Arts

its <- cbn_get_items("WEAT", 7)
summary(its)
vecs <- cbn_get_item_vectors("WEAT", 7)
weat_perm(its, vecs, x_name = "Math", y_name = "Arts",
           a_name = "MaleTerms", b_name = "FemaleTerms", 1000)

Science vs Arts

its <- cbn_get_items("WEAT", 8)
summary(its)
vecs <- cbn_get_item_vectors("WEAT", 8)
weat_perm(its, vecs, x_name = "Science", y_name = "Arts", 
           a_name = "MaleTerms", b_name = "FemaleTerms", 1000)

Mental vs Physical Disease

its <- cbn_get_items("WEAT", 9)
summary(its)
vecs <- cbn_get_item_vectors("WEAT", 9)
weat_perm(its, vecs, x_name = "MentalDisease", y_name = "PhysicalDisease", 
          a_name = "Temporary", b_name = "Permanent", 1000)

Mental vs Physical Disease

its <- cbn_get_items("WEAT", 9)
summary(its)
vecs <- cbn_get_item_vectors("WEAT", 9)
weat_perm(its, vecs, x_name = "MentalDisease", y_name = "PhysicalDisease", 
          a_name = "Temporary", b_name = "Permanent", 1000)

Young vs Old People's Names

its <- cbn_get_items("WEAT", 10)
summary(its)
vecs <- cbn_get_item_vectors("WEAT", 10)
weat_perm(its, vecs, x_name = "YoungNames", y_name = "OldNames", 
          a_name = "Pleasant", b_name = "Unpleasant", 1000)

WEFAT (Figure 1)

tba

WEFAT (Figure 2)

its <- cbn_get_items("WEFAT", 2)
its_vecs <- cbn_get_item_vectors("WEFAT", 2)
res <- wefat(its, its_vecs, x_name = "AndrogynousNames",
             a_name = "FemaleAttributes", b_name = "MaleAttributes")
head(res)

Next we find the gender proportions for each name from the census. In the paper a gender score is constructed from the population proportions (it's not clear how this was done or where the data came from in more detail than 'the 1990 US census'). The replication materials bundle these as cbn_gender_name_stats_census1990

data(cbn_gender_name_stats_census1990)
head(cbn_gender_name_stats_census1990)

However, it's not clear how the graphs x values come out of this data set, so we'll use instead the gender package, which queries the US Social Security Administration to get the proportion of stated males and females with any particular first name. A version of this data is bundled with the package

data(cbn_gender_name_stats)
head(cbn_gender_name_stats)

We join it to res

res <- merge(res, cbn_gender_name_stats, 
             by.x = "Word", by.y = "name")

and plot the statistic against the gender proportions (converted to percentages)

ggplot(res, aes(x = 100 * proportion_female, y = S_wab, color = S_wab)) +
  geom_hline(yintercept = 0, size = 2, col = "grey") + 
  geom_point(size = 5, alpha = 0.9) +
  scale_colour_gradient2(low = "blue", mid = "yellow", high = "red", 
                         guide = FALSE) +
  xlim(0, 100) +
  ylim(-2, 2) +
  xlab("Percentage of people with name who are women") +
  ylab("Strength of association of name vector with female gender")

The correlation is

cor.test(res$S_wab, res$proportion_female)

which is a tiny bit stronger than the relationship in the paper.

conjugateprior/cbn documentation built on June 26, 2019, 2:28 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

conjugateprior/cbn
Tools and replication materials for Caliskan, Bryson, and Narayanan (2017)

In conjugateprior/cbn: Tools and replication materials for Caliskan, Bryson, and Narayanan (2017)

WEAT (Table 1)

Flowers vs Insects

Instruments vs Weapons

European-American vs African-American Names (1)

European-American vs African-American Names (2)

European-American vs African-American Names (3)

Male vs Female Names

Math vs Arts

Science vs Arts

Mental vs Physical Disease

Mental vs Physical Disease

Young vs Old People's Names

WEFAT (Figure 1)

WEFAT (Figure 2)

R Package Documentation

Browse R Packages

We want your feedback!

conjugateprior/cbn Tools and replication materials for Caliskan, Bryson, and Narayanan (2017)

In conjugateprior/cbn: Tools and replication materials for Caliskan, Bryson, and Narayanan (2017)

WEAT (Table 1)

Flowers vs Insects

Instruments vs Weapons

European-American vs African-American Names (1)

European-American vs African-American Names (2)

European-American vs African-American Names (3)

Male vs Female Names

Math vs Arts

Science vs Arts

Mental vs Physical Disease

Mental vs Physical Disease

Young vs Old People's Names

WEFAT (Figure 1)

WEFAT (Figure 2)

R Package Documentation

Browse R Packages

We want your feedback!

conjugateprior/cbn
Tools and replication materials for Caliskan, Bryson, and Narayanan (2017)