datacheck: Tools for Checking Data Consistency

Functions to check variables against a set of data quality rules. A rule file can be accompanied by look-up tables. In addition, there are some convenience functions that may serve as an example for defining clearer 'data rules'. An HTML based user interface facilitates initial exploration of the functionality.

Install the latest version of this package by entering the following in R:
install.packages("datacheck")
Author"Simon, Reinhard (CIP)" <R.SIMON@CGIAR.ORG>, Jose Francisco Loff <jfloff at phistat.com>
Date of publication2015-04-18 08:56:10
Maintainer"Simon, Reinhard (CIP)" <R.SIMON@CGIAR.ORG>
LicenseMIT + file LICENSE
Version1.2.2
https://github.com/c5sire/datacheck

View on CRAN

Man pages

as_rules: Converts a vector of lines into a rules data frame

as.rules: Converts a vector of lines into a rules data frame

datacheck-package: Check a table against a set of constraints or rules defined...

datadict_profile: Create a data quality profile (main function)

datadict.profile: Create a data quality profile (main function)

has_punct: Tests for presence of most common punctuation characters

has.punct: Tests for presence of most common punctuation characters

has_rule_errors: Quick check if a rule profile on a table has any errors.

has.ruleErrors: Quick check if a rule profile on a table has any errors.

heatmap_quality: Draws a heatmap based on data quality scores

heatmap.quality: Draws a heatmap based on data quality scores

is_datadict_profile: is.datadict.profile

is.datadict.profile: is.datadict.profile

is_one_of: Tests if a string or 'factor level' is one of a pre-defined...

is.oneOf: Tests if a string or 'factor level' is one of a pre-defined...

is_only_lowers: Tests if a string has only lower case letters

is.onlyLowers: Tests if a string has only lower case letters

is_proper_name: Tests if string is like a proper name with inital letter in...

is.properName: Tests if string is like a proper name with inital letter in...

is_within_range: Tests if a numeric value is between a minimal and maximum...

is.withinRange: Tests if a numeric value is between a minimal and maximum...

pkg_version: Get the current version of a package

pkg.version: Get the current version of a package

prep4rep: Prepares a summary table for display in a 'printed' report.

read_rules: Reads a file containing rules in data dictionary format.

read.rules: Reads a file containing rules in data dictionary format.

rule_coverage: Dotchart of rules per variable

ruleCoverage: Dotchart of rules per variable

run_datacheck: Presents the packages graphical user interface

runDatacheck: Presents the packages graphical user interface

score_sum: Line chart of cumulative sum of rule scores.

scoreSum: Line chart of cumulative sum of rule scores.

short_summary: Produces a tabular summary of descriptive statistics using...

shortSummary: Produces a tabular summary of descriptive statistics using...

Functions

as_rules Man page
as.rules Man page
datacheck-package Man page
datadict_profile Man page
datadict.profile Man page
has_punct Man page
has.punct Man page
has_rule_errors Man page
has.ruleErrors Man page
heatmap_quality Man page
heatmap.quality Man page
is_datadict_profile Man page
is.datadict.profile Man page
is_one_of Man page
is.oneOf Man page
is_only_lowers Man page
is.onlyLowers Man page
is_proper_name Man page
is.properName Man page
is_within_range Man page
is.withinRange Man page
pkg_version Man page
pkg.version Man page
prep4rep Man page
read_rules Man page
read.rules Man page
rule_coverage Man page
ruleCoverage Man page
run_datacheck Man page
runDatacheck Man page
score_sum Man page
scoreSum Man page
short_summary Man page
shortSummary Man page

Files

inst
inst/examples
inst/examples/rules1.R
inst/examples/location.csv
inst/examples/is_onlyLowers.R inst/examples/rules2.R inst/examples/is_oneOf.R
inst/examples/person.csv
inst/examples/soil_rules.R
inst/examples/db-err.csv
inst/examples/datadict_profile.R inst/examples/is_withinRange.R
inst/examples/soilsamples.csv
inst/examples/person_rules.R
inst/examples/db.csv
inst/examples/has_ruleErrors.R inst/examples/rules.R inst/examples/is_properName.R inst/examples/has_punct.R
inst/www
inst/www/ui.R inst/www/server.R
inst/www/js
inst/www/js/timer.js
inst/doc
inst/doc/index.html
inst/doc/RN_1.0.4.pdf
inst/doc/RN_1.2.2.pdf
inst/doc/index.Rmd
inst/doc/RN_1.1.0.md
inst/doc/RN_1.0.4.md
inst/doc/RN_1.1.0.pdf
inst/doc/RN_1.2.2.md
NAMESPACE
NEWS
R
R/datacheck.R
vignettes
vignettes/index.html
vignettes/index.Rmd
vignettes/custom.css
vignettes/style.R
README.md
MD5
DESCRIPTION
man
man/is_datadict_profile.Rd man/datadict.profile.Rd man/run_datacheck.Rd man/as.rules.Rd man/is.onlyLowers.Rd man/is_only_lowers.Rd man/read_rules.Rd man/is.withinRange.Rd man/runDatacheck.Rd man/read.rules.Rd man/rule_coverage.Rd man/is_proper_name.Rd man/as_rules.Rd man/prep4rep.Rd man/heatmap_quality.Rd man/scoreSum.Rd man/is.datadict.profile.Rd man/is.properName.Rd man/pkg.version.Rd man/has.punct.Rd man/pkg_version.Rd man/score_sum.Rd man/datacheck-package.Rd man/has_punct.Rd man/ruleCoverage.Rd man/is_one_of.Rd man/shortSummary.Rd man/has_rule_errors.Rd man/heatmap.quality.Rd man/has.ruleErrors.Rd man/short_summary.Rd man/is_within_range.Rd man/datadict_profile.Rd man/is.oneOf.Rd
LICENSE

Questions? Problems? Suggestions? or email at ian@mutexlabs.com.

Please suggest features or report bugs with the GitHub issue tracker.

All documentation is copyright its authors; we didn't write any of that.