Description Usage Arguments Details Value Examples
Finds matches between two or more data sets based on a text variable (code or e-mail) based on Levensthein distances. For a detailed application see the vignette.
1 2 3 4 5 6 |
data |
list of data frames |
... |
further parameters for cmp |
vars |
vector of variables. One for each data frame. |
dmax |
maximal levensthein distance for matching in text variables $l(t_i1,tj2]<dmax$), defaults to |
exclude |
entries to be excluded from the unique values, defaults to |
ignore.case |
if FALSE, the uniques values are case sensitive and if TRUE, case is ignored |
unique.id |
vector of variables which contain a unique ID over all data sets. If not given then |
output |
number of observation to analyse before a progress information is displayed |
cmpfunc |
function for comparison of strings of form |
The result consists of a list with three elements
line
a matrix with the line numbers of the matching observations
idn
a matrix with the common ID ZDV
and the original text variables in the data sets
leven
a matrix with the levenshtein distance between the common ID and the original text variables in the data sets
a list structure with possibly matched observations
a list structure with possibly matched observations
1 2 3 4 5 6 7 8 9 10 |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.