A dataset demonstrating Simpson's Paradox with a strongly positively correlated dataset (
and a dataset with the same positive correlation as
simpson_1, but where individual groups have a
strong negative correlation (
A data frame with 444 rows and 3 variables:
dataset: indicates which of the two datasets the data are from,
Matejka, J., & Fitzmaurice, G. (2017). Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing. CHI 2017 Conference proceedings: ACM SIGCHI Conference on Human Factors in Computing Systems. Retrieved from https://www.autodeskresearch.com/publications/samestats.
1 2 3 4 5 6
Loading required package: ggplot2
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.