duplicates: Tag, report, delete or keep duplicate observations
In myominnoo/mStats_beta: A tool for data management and statistical analysis

Description Usage Arguments Details Author(s) See Also Examples

duplicates displays structure of a data frame

duplicates(data, ..., print.table = TRUE)

keepUnique(data, ..., print.table = TRUE)

keepDup(data, ..., print.table = TRUE)

`data`	dataframe
`...`	any variables within dataframe for unique id
`print.table`	logical value to display formatted outputs

duplicates

tags duplicate observations within dataframe with a new variable called dupID_ and reports statistics. Duplicates are observations with identical values either on all variables if no variable is specified in the optional argument ... or on a specified list of variables.

ANNOTATIONS:

Copies - Number of duplicates

Observations - Number of records per Copies

Surplus - Number of surplus copies

keepUnique

delete all but the first occurrence of each group of duplicated observations.

keepDup keep all but the first occurrence of each group of duplicated observations. This function returns the opposite dataset generated from keepUnique.

Myo Minn Oo (Email: dr.myominnoo@gmail.com | Website: https://myominnoo.github.io/)

keep, lose

## Not run: 
# finding duplicates across all variables
duplicates(iris)

# finding duplicates on variables of interest
duplicates(iris, Sepal.Length, Sepal.Width)
duplicates(iris, Species)
duplicates(iris, Sepal.Length, Sepal.Width, print.table = FALSE)

# Keep Unique records
keepUnique(iris, Sepal.Length)
keepUnique(iris, Species)
keepUnique(infert, case)

# Keep duplicated records (opposite of keep unique records)
keepDup(iris, Sepal.Length)
keepDup(iris, Species)
keepDup(infert, case)

## End(Not run)