util_int_duplicate_ids_segment: Check for duplicated IDs

View source: R/util_int_duplicate_ids_segment.R

util_int_duplicate_ids_segmentR Documentation

Check for duplicated IDs

Description

This function tests for duplicates entries in identifiers. It is possible to check duplicated identifiers by study segments or to consider only selected segments.

Usage

util_int_duplicate_ids_segment(
  level = c("segment"),
  id_vars_list,
  study_segment,
  repetitions,
  study_data,
  meta_data,
  meta_data_segment = "segment_level",
  segment_level
)

Arguments

level

character a character vector indicating whether the assessment should be conducted at the study level (level = "dataframe") or at the segment level (level = "segment").

id_vars_list

list id variable names for each segment or data frame

study_segment

vector the segments or data frame names being assessed

repetitions

vector an integer vector indicating the number of allowed repetitions in the id_vars. Currently, no repetitions are supported.

study_data

data.frame the data frame that contains the measurements, mandatory.

meta_data

data.frame the data frame that contains metadata attributes of the study data, mandatory.

meta_data_segment

data.frame – optional: Segment level metadata

segment_level

data.frame alias for meta_data_segment

Value

a list with

  • SegmentData: data frame with the results of the quality check for duplicated identifiers

  • SegmentTable: data frame with selected duplicated identifiers check results, used for the data quality report.

  • Other: named list with inner lists of unique cases containing each the row indices of duplicated identifiers separated by "|" , if any. outer names are names of the segments. Use prep_get_study_data_segment() to get the data frame the indices refer to.

See Also

Other integrity_indicator_functions: util_int_duplicate_content_dataframe(), util_int_duplicate_content_segment(), util_int_duplicate_ids_dataframe(), util_int_unexp_records_set_dataframe(), util_int_unexp_records_set_segment()


dataquieR documentation built on Jan. 8, 2026, 5:08 p.m.