deduplicate: Remove duplicate entries from a data frame
In elizagrames/synthesisr: Import, Assemble, and Deduplicate Systematic Review Search Results

Given a data frame and a field to check for duplicates, flags and removes duplicate entries with three optional methods.

1 2	deduplicate(df, field, method = c("quick", "similarity", "fuzzy"), language = "English", cutoff_distance = 2)

`df`	the data frame to deduplicate
`field`	the name or index of the column to check for duplicate values
`method`	the manner of duplicate detection; quick removes exact text duplicates, similarity removes duplicates below a similarity threshold, and fuzzy uses fuzzdist matching
`language`	the language to use if method is set to similarity
`cutoff_distance`	the threshold below which articles are marked as duplicates by the similarity method

a deduplicated data frame

elizagrames/synthesisr documentation built on May 26, 2019, 10:34 a.m.

elizagrames/synthesisr index

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Description