RecLinkData.object | R Documentation |
S3 class representing information about record pairs for Record
Linkage, as returned by functions compare.dedup
and
compare.linkage
.
A list with at least the following elements:
data
(for type = "deduplication"
):Object of class
"data.frame"
. Data frame of original records.
data1
, data2
(for type = "linkage"
):Objects of class
"data.frame"
. Data frames of original records.
pairs
:Object of class "data.frame"
Data frame of data pairs. Each row represents the comparison pattern of two records,
identified by columns id1
and id2
. The other columns contain for each
considered attribute a real number in the range [0..1] representing the degree of
similarity. These columns are named according to the respective columns in
data
. The last column contains the matching status of the pair,
coded as 1 for a match or 0 for a non-match.
frequencies
:Object of class "numeric"
Numeric vector with average frequency of values for each column
included in pairs
(reciprocal of number of distinct values).
type
:Object of class "character"
Identifies whether a linkage
("linkage"
) or a deduplication ("deduplication"
) project is
represented.
.S3class
:Internal slot.
The following elements are optional:
M
:Object of class "numeric"
Vector of m-probabilities as calculated by emWeights
.
U
:Object of class "numeric"
Vector of u-probabilities as calculated by emWeights
.
W
:Object of class "numeric"
Vector of log-likelihood weights as calculated by emWeights
,
corresponding to binary comparison patterns as created by
bincombinations
.
Wdata
:Object of class "numeric"
Vector of log-likelihood weights as calculated by emWeights
,
corresponding to the rows of pairs
.
Andreas Borg, Murat Sariyar
"RecLinkData"
for the S4 representation.
compare.dedup
, which creates objects of this class.
"RLBigData"
, an alternative data structure suitable for
big data sets.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.