getExpectedSize: Estimate number of record pairs.

getExpectedSizeR Documentation

Estimate number of record pairs.

Description

Estimates the total number of record pairs generated by a dataset and specified blocking conditions.

Usage

  getExpectedSize(object, ...)

  ## S4 method for signature 'RLBigDataDedup'
getExpectedSize(object)

  ## S4 method for signature 'RLBigDataLinkage'
getExpectedSize(object)

  ## S4 method for signature 'data.frame'
getExpectedSize(object, blockfld = list())

Arguments

object

Either a record linkage object or a dataset.

blockfld

A blocking definition, such as in compare.dedup

...

Placeholder for additional arguments.

Details

The "RLBigData*" methods are only left for backward compatibility. Since version 0.4, all record pairs for such objects are generated and stored in a disk file. The methods return the true number of record pairs.

For the "data.frame" method, estimation is based on the assumption that agreement or disagreement of one attribute is independent of the other attributes.

blockfld is a blocking definition such as for RLBigDataDedup.

Value

The expected number of record pairs.

Author(s)

Andreas Borg, Murat Sariyar


RecordLinkage documentation built on Nov. 10, 2022, 5:42 p.m.