cleaninginspectoR: Basic checks that data cleaning ocurred

Description Usage Arguments Value Examples

Find duplicates / non-unique values in a variable

1	find_duplicates(data, duplicate.column.name)

`data`	a dataframe
`duplicate.column.name`	the name of the column the dataframe to be checked for duplicates as a string (in quotes)

A dataframe with one row per potential issue. It has columns for the corresponding row index in the original data; the suspicious value; the variable name in the original dataset in which the suspicious value occured; A description of the issue type.

# a test dataset with 1000 rows; one numeric variable and one id variable
testdf <- data.frame(numeric_var = runif(10), unique_ids = c(1, 2, 3, 4, 5, 6, 7, 8, 1, 3))
# find duplicates in the unique_ids column:
find_duplicates(data, "unique_ids")