mergeduplicates: Clean Duplicates in Dataframes

Description Usage Arguments Value Examples

Description

function to merge duplicate rows in a data.frame based on a set of id fields. other fields are concatenated together using paste(x, collapse="| "). This allows for cleaning based on all available information.

Usage

1
mergeduplicates(df, idfields, sep = "|")

Arguments

df

Data frame containing duplicates

idfields

The fields that should be unique

sep

The separator to pass to paste() when concatenating duplicate fields together

Value

The resulting data frame.

Examples

1
2
3
4
5
6
df <- data.frame(x=c(1,2,3,4,1,2,3,4), 
                 y=c(1,2,3,4,1,2,3,4), 
                 label=c("one", "two", "three", "four", 
                         "five", "six", "seven", "eight"))
df
mergeduplicates(df, idfields=c("x", "y"))

paleolimbot/rfuncs documentation built on May 24, 2019, 6:13 p.m.