Description Usage Arguments Value Examples
View source: R/import_and_clean_data.R
Calls the deduplicate function from synthesisr to flag and remove duplicate entries from a data frame
1 | remove_duplicates(df, field, method = c("string_osa", "fuzzdist", "exact"))
|
df |
the data frame to deduplicate |
field |
the name or index of the column to check for duplicate values |
method |
the manner of duplicate detection; exact removes exact text duplicates, stringdist removes duplicates below a similarity threshold, and fuzzy uses fuzzdist matching |
a deduplicated data frame
1 2 | my_df <- data.frame(title=c("Picoides", "picoides", "Seiurus"), id=c("01", "02", "03"))
remove_duplicates(my_df, "title", "exact")
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.