View source: R/functions_wrapper.R
deidentify_data | R Documentation |
Deidentify a dataset
Two operations are performed on the dataset:
All ID numbers are randomized from the range 1 to n
All columns containing dates will have the year changed
The year change is done by letting the earliest year in the dataset be used as a reference and by maintaining leap years. The reference year will either be 1901, 1902, 1903 or 1904 depending on its distance to the closest preceeding leap year.
deidentify_data(df, id_column = "ID", date_columns = NULL)
df |
(data.frame) A dataset |
id_column |
(str) Name of the id column |
date_columns |
(array(str) (optional)) Names of all date columns |
(data.frame) Deidentified dataset
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.