conceal_all: Conceal all values in a dataframe by replacing them with...

Description Usage Arguments Value

View source: R/mutaters.R

Description

For concealing raw data that shouldn't be shown as it is. Re-use not recommended yet. This is a quick hack that works properly only on certain types of data. In non-id variables, the function preserves only the variable names, range of continuous variables, unique values of discrete variables, and number of rows. Identifiers that are specified in the argument 'id' are replaced with random codes. The same id gets the same code so that dependent observations (e.g. time series of people) don't falsely turn into independent observations for a larger population of people.

Usage

1
conceal_all(dataset, id)

Arguments

dataset

A dataframe to be concealed.

id

A character vector of names of the identifier variables.

Value

A dataframe with the same basic structure (rows, columns, unique values, minimums, maximums) but completely randomized to conceal sensitive data.


eteppo/tvs-project documentation built on Aug. 13, 2019, 8:53 a.m.