options(width = 90) knitr::opts_chunk$set(collapse = TRUE, comment = NA)
The guiding principle behind duawranglr is to make it easier for organizations to share data that contain protected elements and/or personally idenfiable information (PII) with researchers. There are two key problems this package attempts to solve:
The duawranglr package is designed with the idea that rather than setting a new DUA for each project in an ongoing collaboration between researchers and data partners, two things will happen instead:
An example crosswalk file (e.g. a CSV file or Excel spreadsheet) might look like this:
|level_i|level_ii|level_iii| |:------:|:-------:|:--------:| |sid|sid|sid| |sname|sname|sname| |dob|dob|| |gender|| |raceeth|| |tid|| |tname|tname|tname| |zip|zip|
Each column represents a restriction level---level_i
, level_ii
, or
level_iii
---along with the corresponding data element names that are
restricted at that level. In this crosswalk, like variable names have
been aligned so that they are easier to compare, but the elements can
be included in whichever way makes most sense to the data
administrator.
The restriction level names are arbitrary as far as the package goes, but in conjunction with a DUA, they have meaning:
The benefit of this level-plus-crosswalk system is two-fold:
Functions in the package do not
There are many packages, such as those in the tidyverse suite, that are already well suited to data wrangling tasks. There is no need to replicate those functions in this package.
It also should go without saying, but users can simply not use functions in this package when attempting to secure restricted data. What this package does is offer a framework and a set of useful functions that, when followed, help users secure data in a clear and replicable manner that allows data administrators to more easily participate in the process.
Any scripts or data that you put into this service are public.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.