maskVal: Mask Values of a Vector

maskValR Documentation

Mask Values of a Vector

Description

Modifies the value of a vector so as to mask the information by generating random data subject to constraints and keeping the length, type, label, and units attributes of the original variable. For a binary numeric or logical variable a random vector with prevalence (by default) of 0.5 replaces the original. For a factor variable, a random multinomial sample is drawn, with equal expected frequencies of all levels. For a numeric variable, the range is preserved but the distribution is uniform over that range, and generated values are rounded by an amount equal to the minimum spacing between distinct values. Character variables are just randomly reordered. In the special case where the input vector contains only one unique non-NA value, the variable is assumed to be the type of variable where NA represents FALSE or "no", and the variable is replaced by a logical vector with the specified prevalence.

Usage

maskVal(x, prev = 0.5, NAs = TRUE)

Arguments

x

an input vector

prev

a numeric scalar specifying the prevalence for binary variables

NAs

if the variable contains NAs, keep the same expected proportion of NAs but distribute them randomly. Otherwise, the new vector will have no missing values.


harrelfe/greport documentation built on Sept. 9, 2023, 8:08 a.m.