util_int_duplicate_content_dataframe: Check for duplicated content

View source: R/util_int_duplicate_content_dataframe.R

util_int_duplicate_content_dataframeR Documentation

Check for duplicated content

Description

This function tests for duplicates entries in the data set. It is possible to check duplicated entries by study segments or to consider only selected segments.

Usage

util_int_duplicate_content_dataframe(
  level = c("dataframe"),
  identifier_name_list
)

Arguments

level

character a character vector indicating whether the assessment should be conducted at the study level (level = "dataframe") or at the segment level (level = "segment").

identifier_name_list

vector the vector that contains the name of the identifier to be used in the assessment. For the study level, corresponds to the names of the different data frames. For the segment level, indicates the name of the segments.

Value

a list with

  • SegmentData: data frame with the results of the quality check for duplicated entries

  • SegmentTable: data frame with selected duplicated entries check results, used for the data quality report.

  • Duplicates: vector with row indices of duplicated entries, if any, otherwise NULL.

See Also

Other integrity_indicator_functions: util_int_duplicate_content_segment(), util_int_duplicate_ids_dataframe(), util_int_duplicate_ids_segment(), util_int_unexp_records_set_dataframe(), util_int_unexp_records_set_segment()


dataquieR documentation built on May 29, 2024, 7:18 a.m.