data-raw/README.md

Data cleaning

I explicitly use this package to teach data cleaning, so have refactored my old cleaning code into several scripts. I also include them as compiled Markdown reports. Caveat: these are realistic cleaning scripts! Not the highly polished ones people write with 20/20 hindsight :) I wouldn't necessarily clean it the same way again (and I would download more recent data!), but at this point there is great value in reproducing the data I've been using for ~5 years.

Cleaning history

| r_script | notebook | tsv | |:------------------------------------------------------------------------|:--------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------| | 01_extract-from-excel-pop.R | 01_extract-from-excel-pop.md | 01_pop.tsv | | 02_extract-from-excel-lifeExp.R | 02_extract-from-excel-lifeExp.md | 02_lifeExp.tsv | | 03_extract-from-excel-gdpPercap.R | 03_extract-from-excel-gdpPercap.md | 03_gdpPercap.tsv | | 04_merge-pop-lifeExp-gdpPercap.R | 04_merge-pop-lifeExp-gdpPercap.md | 04_gap-merged.tsv | | 05_impute-china-1952-gdpPercap.R | 05_impute-china-1952-gdpPercap.md | 05_gap-merged-with-china-1952.tsv | | 06_smell-test-gap-merged.R | 06_smell-test-gap-merged.md | | | 07_fill-and-fix-continent.R | 07_fill-and-fix-continent.md | 07_gap-merged-with-continent.tsv | | 08_filter-every-five-years.R | 08_filter-every-five-years.md | 08_gap-every-five-years.tsv | | 09_add-data-to-package.R | 09_add-data-to-package.md | | | 40_make-color-scheme.R | 40_make-color-scheme.md | 40_continent-colors.tsv, 40_country-colors.tsv |



YTLogos/gapminder documentation built on May 20, 2019, 1:47 p.m.