extract_duplicates: Extract all duplicates
In RemPsyc/rempsyc: Convenience Functions for Psychology

extract_duplicates

R Documentation

Extract all duplicates

Description

Extract all duplicates, for visual inspection. Note that it also contains the first occurrence of future duplicates, unlike duplicated() or dplyr::distinct()). Also contains an additional column reporting the number of missing values for that row, to help in the decision-making when selecting which duplicates to keep.

Usage

extract_duplicates(data, id)

Arguments

`data`	The data frame.
`id`	The ID variable for which to check for duplicates.

Details

For the easystats equivalent, see: datawizard::data_unique().

Value

A dataframe, containing all duplicates.

Examples

df1 <- data.frame(
  id = c(1, 2, 3, 1, 3),
  item1 = c(NA, 1, 1, 2, 3),
  item2 = c(NA, 1, 1, 2, 3),
  item3 = c(NA, 1, 1, 2, 3)
)

extract_duplicates(df1, id = "id")

# Filter to exclude duplicates
df2 <- df1[-c(1, 5), ]
df2

RemPsyc/rempsyc documentation built on June 9, 2025, 7:42 a.m.