gammaCK2par: gammaCK2par

View source: R/gammaCK2par.R

gammaCK2parR Documentation

gammaCK2par

Description

Field comparisons for string variables. Two possible agreement patterns are considered: 0 total disagreement, 2 agreement. The distance between strings is calculated using a Jaro-Winkler distance.

Usage

gammaCK2par(matAp, matBp, n.cores, cut.a, method, w)

Arguments

matAp

vector storing the comparison field in data set 1

matBp

vector storing the comparison field in data set 2

n.cores

Number of cores to parallelize over. Default is NULL.

cut.a

Lower bound for full match, ranging between 0 and 1. Default is 0.92

method

String distance method, options are: "jw" Jaro-Winkler (Default), "dl" Damerau-Levenshtein, "jaro" Jaro, and "lv" Edit

w

Parameter that describes the importance of the first characters of a string (only needed if method = "jw"). Default is .10

Value

gammaCK2par returns a list with the indices corresponding to each matching pattern, which can be fed directly into tableCounts and matchesLink.

Author(s)

Ted Enamorado <ted.enamorado@gmail.com>, Ben Fifield <benfifield@gmail.com>, and Kosuke Imai

Examples

## Not run: 
g1 <- gammaCK2par(dfA$firstname, dfB$lastname)

## End(Not run)

kosukeimai/fastLink documentation built on Nov. 17, 2023, 8:11 p.m.