RecordLinkage: Record Linkage in R

Provides functions for linking and de-duplicating data sets. Methods based on a stochastic approach are implemented as well as classification algorithms from the machine learning domain.

AuthorAndreas Borg <borga@uni-mainz.de>, Murat Sariyar <murat.sariyar@charite.de>
Date of publication2016-07-27 11:06:56
MaintainerAndreas Borg <borga@uni-mainz.de>
LicenseGPL (>= 2)
Version0.4-10
https://r-forge.r-project.org/projects/recordlinkage/, http://journal.r-project.org/archive/2010-2/RJournal_2010-2_Sariyar+Borg.pdf

View on CRAN

Man pages

append-methods: Concatenate comparison patterns or classification results

classifySupv: Supervised Classification

classifyUnsup: Unsupervised Classification

clone: Serialization of record linkage object.

deleteNULLs: Remove NULL Values

editMatch: Edit Matching Status

emClassify: Weight-based Classification of Data Pairs

emWeights: Calculate weights

epiClassify: Classify record pairs with EpiLink weights

epiWeights: Calculate EpiLink weights

ff_vector-class: Class '"ff_vector"'

genSamples: Generate Training Set

getExpectedSize: Estimate number of record pairs.

getFrequencies-methods: Get attribute frequencies

getMinimalTrain: Create a minimal training set

getPairsBackend: Backend function for getPairs

getParetoThreshold: Estimate Threshold from Pareto Distribution

getTable-methods: Build contingency table

gpdEst: Estimate Threshold from Pareto Distribution

internals: Internal functions and methods

isFALSE: Check for FALSE

makeBlockingPairs: Create record pairs from blocks of ids.

mrl: Mean Residual Life Plot

mygllm: Generalized Log-Linear Fitting

optimalThreshold: Optimal Threshold for Record Linkage

phonetics: Phonetic Code

RecLinkClassif-class: Class "RecLinkClassif"

RecLinkResult-class: Class "RecLinkResult"

resample: Safe Sampling

RLBigData-class: Class "RLBigData"

RLBigDataDedup-class: Class "RLBigDataDedup"

RLBigDataLinkage-class: Class "RLBigDataLinkage"

RLResult-class: Class "RLResult"

show: Show a RLBigData object

splitData: Split Data

subset: Subset operator for record linkage objects

texSummary: LaTeX Summary of linkage results

trainSupv: Train a Classifier

unorderedPairs: Create Unordered Pairs

Functions

.allows_extensions Man page
\%append\% Man page
\%append\%-methods Man page
\%append\%,RecLinkData,RecLinkData-method Man page
\%append\%,RecLinkResult,RecLinkResult-method Man page
begin Man page
begin-methods Man page
begin,RLBigData-method Man page
blockfldfun Man page
classifySupv Man page
classifySupv-methods Man page
classifySupv,RecLinkClassif,RecLinkData-method Man page
classifySupv,RecLinkClassif,RLBigData-method Man page
classifyUnsup Man page
clear Man page
clear-methods Man page
clear,RLBigData-method Man page
clone Man page
clone-methods Man page
clone,RLBigData-method Man page
clone,RLResult-method Man page
countpattern Man page
deleteNULLs Man page
editMatch Man page
editMatch-methods Man page
editMatch,RecLinkData-method Man page
editMatch,RLBigData-method Man page
emClassify Man page
emClassify,RecLinkData,ANY,ANY-method Man page
emClassify,RecLinkData,missing,missing-method Man page
emClassify,RLBigData,ANY,ANY-method Man page
emClassify,RLBigData-method Man page
emClassify,RLBigData,missing,missing-method Man page
emWeights Man page
emWeights-methods Man page
emWeights,RecLinkData-method Man page
emWeights,RLBigData-method Man page
epiClassify Man page
epiClassify-methods Man page
epiClassify,RecLinkData-method Man page
epiClassify,RLBigData-method Man page
epiWeights Man page
epiWeights-methods Man page
epiWeights,RecLinkData-method Man page
epiWeights,RLBigData-method Man page
ff_vector-class Man page
genSamples Man page
getColumnNames Man page
getColumnNames-methods Man page
getColumnNames,RLBigDataDedup-method Man page
getColumnNames,RLBigDataLinkage-method Man page
getExpectedSize Man page
getExpectedSize,data.frame-method Man page
getExpectedSize-methods Man page
getExpectedSize,RLBigDataDedup-method Man page
getExpectedSize,RLBigDataLinkage-method Man page
getFrequencies Man page
getFrequencies-methods Man page
getFrequencies,RLBigData-method Man page
getMatchCount Man page
getMatchCount-methods Man page
getMatchCount,RLBigData-method Man page
getMinimalTrain Man page
getMinimalTrain-methods Man page
getMinimalTrain,RecLinkData-method Man page
getMinimalTrain,RLBigData-method Man page
getNACount Man page
getNACount-methods Man page
getNACount,RLBigData-method Man page
getNonMatchCount Man page
getNonMatchCount-methods Man page
getNonMatchCount,RLBigData-method Man page
getPairsBackend Man page
getParetoThreshold Man page
getParetoThreshold-methods Man page
getParetoThreshold,RecLinkData-method Man page
getParetoThreshold,RLBigData-method Man page
getPatternCounts Man page
getPatternCounts-methods Man page
getPatternCounts,RLBigData-method Man page
getSQLStatement Man page
getSQLStatement-methods Man page
getSQLStatement,RLBigData-method Man page
getTable Man page
getTable-methods Man page
getTable,RecLinkResult-method Man page
getTable,RLResult-method Man page
getThresholds Man page
gpdEst Man page
hasWeights Man page
hasWeights-methods Man page
hasWeights,RecLinkData-method Man page
hasWeights,RLBigData-method Man page
init_sqlite_extensions Man page
isFALSE Man page
.lib_path Man page
loadRLObject Man page
makeBlockingPairs Man page
mrl Man page
mygllm Man page
nextPairs Man page
nextPairs-methods Man page
nextPairs,RLBigData-method Man page
optimalThreshold Man page
optimalThreshold-methods Man page
optimalThreshold,RecLinkData-method Man page
optimalThreshold,RLBigData-method Man page
pho_h Man page
phonetics Man page
plotMRL Man page
RecLinkClassif Man page
RecLinkClassif-class Man page
[.RecLinkData Man page
[.RecLinkResult Man page
RecLinkResult-class Man page
resample Man page
[.RLBigData Man page
RLBigData-class Man page
RLBigDataDedup-class Man page
RLBigDataLinkage-class Man page
[.RLResult Man page
RLResult-class Man page
saveRLObject Man page
saveRLObject-methods Man page
saveRLObject,RLBigData-method Man page
saveRLObject,RLResult-method Man page
show Man page
show,RLBigData-method Man page
soundex Man page
splitData Man page
texSummary Man page
trainSupv Man page
unorderedPairs Man page

Files

RecordLinkage
RecordLinkage/inst
RecordLinkage/inst/doc
RecordLinkage/inst/doc/WeightBased.rnw
RecordLinkage/inst/doc/BigData.rnw
RecordLinkage/inst/doc/EVT.rnw
RecordLinkage/inst/doc/WeightBased.R
RecordLinkage/inst/doc/Supervised.pdf
RecordLinkage/inst/doc/BigData.R
RecordLinkage/inst/doc/EVT.R
RecordLinkage/inst/doc/BigData.pdf
RecordLinkage/inst/doc/Supervised.rnw
RecordLinkage/inst/doc/EVT.pdf
RecordLinkage/inst/doc/WeightBased.pdf
RecordLinkage/inst/doc/Supervised.R
RecordLinkage/src
RecordLinkage/src/soundex.h
RecordLinkage/src/ph_ext.h
RecordLinkage/src/jarowinkler.c
RecordLinkage/src/sqlite3.h
RecordLinkage/src/phonet.h
RecordLinkage/src/sqlite3ext.h
RecordLinkage/src/soundex.c
RecordLinkage/src/mygllm.c
RecordLinkage/src/sqlite_extensions.c
RecordLinkage/src/levenshtein.c
RecordLinkage/src/phonet.c
RecordLinkage/src/makeBlockingPairs.c
RecordLinkage/src/pho_h.c
RecordLinkage/NAMESPACE
RecordLinkage/NEWS
RecordLinkage/data
RecordLinkage/data/RLdata10000.rda
RecordLinkage/data/RLdata500.rda
RecordLinkage/R
RecordLinkage/R/accessor-methods.r
RecordLinkage/R/register-S3-classes.r
RecordLinkage/R/classifySupv-methods.r
RecordLinkage/R/RLBigData-classes.r
RecordLinkage/R/getPairs-methods.r
RecordLinkage/R/em.r
RecordLinkage/R/phonetics.r
RecordLinkage/R/classify.r
RecordLinkage/R/epilink-methods.r
RecordLinkage/R/mygllm.r
RecordLinkage/R/stochastic.r
RecordLinkage/R/internals.r
RecordLinkage/R/em-methods.r
RecordLinkage/R/strcmp.r
RecordLinkage/R/getPairs.r
RecordLinkage/R/summary.r
RecordLinkage/R/tools.r
RecordLinkage/R/genSamples.r
RecordLinkage/R/evt.r
RecordLinkage/R/serialization.r
RecordLinkage/R/onAttach.r
RecordLinkage/R/compare.r
RecordLinkage/R/RLResult-class.r
RecordLinkage/vignettes
RecordLinkage/vignettes/WeightBased.rnw
RecordLinkage/vignettes/BigData.rnw
RecordLinkage/vignettes/EVT.rnw
RecordLinkage/vignettes/Supervised.rnw
RecordLinkage/MD5
RecordLinkage/build
RecordLinkage/build/vignette.rds
RecordLinkage/DESCRIPTION
RecordLinkage/man
RecordLinkage/man/stochastic.rd
RecordLinkage/man/RLBigData-class.Rd
RecordLinkage/man/RLBigData-constructors.rd
RecordLinkage/man/RLResult-class.Rd RecordLinkage/man/getExpectedSize.Rd RecordLinkage/man/ff_vector-class.Rd RecordLinkage/man/show.Rd RecordLinkage/man/splitData.Rd RecordLinkage/man/classifyUnsup.Rd
RecordLinkage/man/summary.RLResult.rd
RecordLinkage/man/getFrequencies-methods.Rd RecordLinkage/man/getMinimalTrain.Rd RecordLinkage/man/RLBigDataLinkage-class.Rd RecordLinkage/man/epiClassify.Rd RecordLinkage/man/RecLinkResult-class.Rd RecordLinkage/man/emWeights.Rd RecordLinkage/man/phonetics.Rd RecordLinkage/man/deleteNULLs.Rd RecordLinkage/man/genSamples.Rd RecordLinkage/man/append-methods.Rd
RecordLinkage/man/RecLinkData.object.rd
RecordLinkage/man/makeBlockingPairs.Rd
RecordLinkage/man/summary.rd
RecordLinkage/man/trainSupv.Rd RecordLinkage/man/getParetoThreshold.Rd RecordLinkage/man/isFALSE.Rd RecordLinkage/man/clone.Rd RecordLinkage/man/resample.Rd RecordLinkage/man/optimalThreshold.Rd
RecordLinkage/man/getPairs-methods.rd
RecordLinkage/man/gpdEst.Rd RecordLinkage/man/emClassify.Rd
RecordLinkage/man/strcmp.rd
RecordLinkage/man/internals.Rd RecordLinkage/man/mygllm.Rd
RecordLinkage/man/ffdf-class.rd
RecordLinkage/man/RecLinkClassif-class.Rd RecordLinkage/man/unorderedPairs.Rd
RecordLinkage/man/RecLinkData-class.rd
RecordLinkage/man/getErrorMeasures-methods.rd
RecordLinkage/man/classifySupv.Rd RecordLinkage/man/epiWeights.Rd
RecordLinkage/man/RecLinkResult.object.rd
RecordLinkage/man/editMatch.Rd
RecordLinkage/man/summary.RLBigData.rd
RecordLinkage/man/texSummary.Rd RecordLinkage/man/subset.Rd RecordLinkage/man/getTable-methods.Rd
RecordLinkage/man/compare.rd
RecordLinkage/man/RLBigDataDedup-class.Rd RecordLinkage/man/getPairsBackend.Rd
RecordLinkage/man/RLdata.rd
RecordLinkage/man/mrl.Rd

Questions? Problems? Suggestions? or email at ian@mutexlabs.com.

Please suggest features or report bugs with the GitHub issue tracker.

All documentation is copyright its authors; we didn't write any of that.