camr_deidentified_codebook_entry: Flag De-identified Variables

View source: R/R07-Codebook.R

camr_deidentified_codebook_entryR Documentation

Flag De-identified Variables

Description

Function to flag whether a variable in a data frame is de-identified (i.e., does not risk exposure of patient health information) based on codebook entries.

Usage

camr_deidentified_codebook_entry(dtf)

Arguments

dtf

A data frame.

Details

According to the Health Insurance Portability and Accountability Act of 1996 (HIPAA), under the 'Safe Harbor' method of de-identification, there are 18 types of identifiers that should be removed for a data set to avoid risk of exposing patient health information (PHI):

  • Names

  • All geographic subdivisions smaller than a state

  • All elements of dates (except year) directly related to an individual

  • Telephone numbers

  • Vehicle identifiers and serial numbers

  • Fax numbers

  • Device identifiers and serial numbers

  • Email addresses

  • Web Universal Resource Locators (URLs)

  • Social security numbers

  • Internet Protocol (IP) addresses

  • Medical record numbers

  • Biometric identifiers

  • Health plan beneficiary numbers

  • Full-face photographs and comparable images

  • Account numbers

  • Any other unique identifying number, characteristic, or code

  • Certificate/license numbers

Value

A logical vector matching in length to the number of colunns in dtf, equal to TRUE for any variables with codebook entries indicating the variable is de-identified.

Author(s)

Kevin Potter

Examples

data( "example_CAM_data_set" )


rettopnivek/camrprojects documentation built on March 26, 2024, 9:17 a.m.