Description Usage Arguments Value Examples
Description
1 2 3 4 5 6 7 8 | dedup_data(
.score,
.source,
.target,
.cols_match,
.min_sim = NULL,
.col_score = c("sms", "smw", "smc", "sss", "ssw", "ssc")
)
|
.score |
Dataframe generated by scores_data() |
.source |
The Source Dataframe. |
.target |
The Target Dataframe. |
.cols_match |
A character vector of columns to perform fuzzy matching. |
.min_sim |
Named vector with minimum similarities |
.col_score |
Score column generated by scores_data().
|
A dataframe
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 | tab_source <- table_source[1:100, ]
tab_target <- table_target[1:999, ]
cols_match <- c("name", "iso3", "city", "address")
cols_exact <- "iso3"
cols_join <- c("name", "iso3")
tab_match <- match_data(
.source = tab_source,
.target = tab_target,
.cols_match = cols_match,
.cols_exact = cols_exact,
.cols_join = cols_join,
.method = "soundex"
)
tab_score <- scores_data(
.matches = tab_match,
.source = tab_source,
.target = tab_target,
.cols_match = cols_match,
.cols_exact = cols_exact
)
dedup_data(
.score = tab_score,
.source = tab_source,
.target = tab_target,
.cols_match = cols_match,
.col_score = "sms"
)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.