Description Usage Arguments Value Examples
View source: R/standard_checks.R
Find duplicates / non-unique values in a variable
1 | find_duplicates(data, duplicate.column.name)
|
data |
a dataframe |
duplicate.column.name |
the name of the column the dataframe to be checked for duplicates as a string (in quotes) |
A dataframe with one row per potential issue. It has columns for the corresponding row index in the original data; the suspicious value; the variable name in the original dataset in which the suspicious value occured; A description of the issue type.
1 2 3 4 | # a test dataset with 1000 rows; one numeric variable and one id variable
testdf <- data.frame(numeric_var = runif(10), unique_ids = c(1, 2, 3, 4, 5, 6, 7, 8, 1, 3))
# find duplicates in the unique_ids column:
find_duplicates(data, "unique_ids")
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.