BlockAndCompareCombinations: Block and then Compare Unique Combinations of Records

Description Usage Arguments Value

View source: R/comparing.R

Description

Block by substrings of any length of multiple variables from a dataset and then compare using any comparison function on any set of variables

Usage

1
2
BlockAndCompareCombinations(RLdata, var.names, n.chars = NULL, ids = NULL,
  variables.to.match = NULL, string.comparators = NULL)

Arguments

RLdata

Dataset containing records to be matched

var.names

Vector of strings of variables you want to block by

n.chars

Vector of number of the characters you want to compare, respective to var.names

ids

True unique identification vector of length nrow(RLdata)

variables.to.match

Vector of strings containing the variables of interest for this linkage. Default is all variables in RLdata. Can repeat variables to use different comparators on same variable.

string.comparators

Vector of strings containing the comparators wanted for each variable. Default is jarowinkler. Should be same length as variables.to.match.

Value

A list of data and comparisons for each block

block.comparison.lists

list of datasets corresponding to the linkage comparisons for each block of data

block.data

list of blocked datasets


kaylafrisoli/ActiveRL documentation built on May 20, 2019, 7:53 a.m.