Description Usage Arguments Details Value Examples
Finds matches between two or more data sets based on a text variable (code or e-mail) based on Levensthein distances. For a detailed application see the vignette.
1 2 3 4 5 6 |
data |
list of data frames |
... |
further parameters for cmp |
vars |
vector of variables. One for each data frame. |
dmax |
maximal levensthein distance for matching in text variables $l(t_i1,tj2]<dmax$), defaults to |
exclude |
entries to be excluded from the unique values, defaults to |
ignore.case |
if FALSE, the uniques values are case sensitive and if TRUE, case is ignored |
unique.id |
vector of variables which contain a unique ID over all data sets. If not given then |
output |
number of observation to analyse before a progress information is displayed |
cmpfunc |
function for comparison of strings of form |
The result consists of a list with three elements
linea matrix with the line numbers of the matching observations
idna matrix with the common ID ZDV and the original text variables in the data sets
levena matrix with the levenshtein distance between the common ID and the original text variables in the data sets
a list structure with possibly matched observations
a list structure with possibly matched observations
1 2 3 4 5 6 7 8 9 10 |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.