I explicitly use this package to teach data cleaning, so have refactored my old cleaning code into several scripts. I also include them as compiled Markdown reports. Caveat: these are realistic cleaning scripts! Not the highly polished ones people write with 20/20 hindsight :) I wouldn't necessarily clean it the same way again (and I would download more recent data!), but at this point there is great value in reproducing the data I've been using for ~5 years.
Cleaning history
gdata
package. It was kind of painful, due to encoding and other issues. See the scripts in this state in v0.1.0.readxl
. This was much less painful. Present day.| r_script | notebook | tsv | |:------------------------------------------------------------------------|:--------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------| | 01_extract-from-excel-pop.R | 01_extract-from-excel-pop.md | 01_pop.tsv | | 02_extract-from-excel-lifeExp.R | 02_extract-from-excel-lifeExp.md | 02_lifeExp.tsv | | 03_extract-from-excel-gdpPercap.R | 03_extract-from-excel-gdpPercap.md | 03_gdpPercap.tsv | | 04_merge-pop-lifeExp-gdpPercap.R | 04_merge-pop-lifeExp-gdpPercap.md | 04_gap-merged.tsv | | 05_impute-china-1952-gdpPercap.R | 05_impute-china-1952-gdpPercap.md | 05_gap-merged-with-china-1952.tsv | | 06_smell-test-gap-merged.R | 06_smell-test-gap-merged.md | | | 07_fill-and-fix-continent.R | 07_fill-and-fix-continent.md | 07_gap-merged-with-continent.tsv | | 08_filter-every-five-years.R | 08_filter-every-five-years.md | 08_gap-every-five-years.tsv | | 09_add-data-to-package.R | 09_add-data-to-package.md | | | 40_make-color-scheme.R | 40_make-color-scheme.md | 40_continent-colors.tsv, 40_country-colors.tsv |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.