fuzzy_dedupes: Find inexact duplicates in a vector of handtype strings.

Usage Arguments Author(s) Examples

View source: R/fuzzy_dedupes.R

Usage

1
2
3
fuzzy_dedupes(vec)
fuzzy_dedupes(vec,cutoff_value = 0.1)
fuzzy_dedupes(vec,find_cutoff = TRUE)

Arguments

vec

input vector

find_cutoff

Boolean value; if TRUE, the calculation will return a dataframe with possible 'best_fit' suggestions. This allows users to choose a reasonable 'cutoff_distance'. Default: FALSE.

cutoff_distance

Maximum similarity between two strings up to which one will be replaced by the other. Default 0.06.

Author(s)

Richard Vogg

Examples

1
2
3
4
vec <- c("hello","hallo","hello","telephone","telefone","telephone")

fuzzy_dedupes(vec,find_cutoff=TRUE)
fuzzy_dedupes(vec,cutoff=0.15)

richardvogg/fuzzymatch documentation built on May 19, 2021, 8:50 a.m.