RecordLinkage: Record Linkage in R

Provides functions for linking and de-duplicating data sets. Methods based on a stochastic approach are implemented as well as classification algorithms from the machine learning domain.

AuthorAndreas Borg <borga@uni-mainz.de>, Murat Sariyar <murat.sariyar@charite.de>
Date of publication2016-07-27 11:06:56
MaintainerAndreas Borg <borga@uni-mainz.de>
LicenseGPL (>= 2)
Version0.4-10
https://r-forge.r-project.org/projects/recordlinkage/, http://journal.r-project.org/archive/2010-2/RJournal_2010-2_Sariyar+Borg.pdf

View on CRAN

Man pages

append-methods: Concatenate comparison patterns or classification results

classifySupv: Supervised Classification

classifyUnsup: Unsupervised Classification

clone: Serialization of record linkage object.

deleteNULLs: Remove NULL Values

editMatch: Edit Matching Status

emClassify: Weight-based Classification of Data Pairs

emWeights: Calculate weights

epiClassify: Classify record pairs with EpiLink weights

epiWeights: Calculate EpiLink weights

ff_vector-class: Class '"ff_vector"'

genSamples: Generate Training Set

getExpectedSize: Estimate number of record pairs.

getFrequencies-methods: Get attribute frequencies

getMinimalTrain: Create a minimal training set

getPairsBackend: Backend function for getPairs

getParetoThreshold: Estimate Threshold from Pareto Distribution

getTable-methods: Build contingency table

gpdEst: Estimate Threshold from Pareto Distribution

internals: Internal functions and methods

isFALSE: Check for FALSE

makeBlockingPairs: Create record pairs from blocks of ids.

mrl: Mean Residual Life Plot

mygllm: Generalized Log-Linear Fitting

optimalThreshold: Optimal Threshold for Record Linkage

phonetics: Phonetic Code

RecLinkClassif-class: Class "RecLinkClassif"

RecLinkResult-class: Class "RecLinkResult"

resample: Safe Sampling

RLBigData-class: Class "RLBigData"

RLBigDataDedup-class: Class "RLBigDataDedup"

RLBigDataLinkage-class: Class "RLBigDataLinkage"

RLResult-class: Class "RLResult"

show: Show a RLBigData object

splitData: Split Data

subset: Subset operator for record linkage objects

texSummary: LaTeX Summary of linkage results

trainSupv: Train a Classifier

unorderedPairs: Create Unordered Pairs

Files in this package

RecordLinkage
RecordLinkage/inst
RecordLinkage/inst/doc
RecordLinkage/inst/doc/WeightBased.rnw
RecordLinkage/inst/doc/BigData.rnw
RecordLinkage/inst/doc/EVT.rnw
RecordLinkage/inst/doc/WeightBased.R
RecordLinkage/inst/doc/Supervised.pdf
RecordLinkage/inst/doc/BigData.R
RecordLinkage/inst/doc/EVT.R
RecordLinkage/inst/doc/BigData.pdf
RecordLinkage/inst/doc/Supervised.rnw
RecordLinkage/inst/doc/EVT.pdf
RecordLinkage/inst/doc/WeightBased.pdf
RecordLinkage/inst/doc/Supervised.R
RecordLinkage/src
RecordLinkage/src/soundex.h
RecordLinkage/src/ph_ext.h
RecordLinkage/src/jarowinkler.c
RecordLinkage/src/sqlite3.h
RecordLinkage/src/phonet.h
RecordLinkage/src/sqlite3ext.h
RecordLinkage/src/soundex.c
RecordLinkage/src/mygllm.c
RecordLinkage/src/sqlite_extensions.c
RecordLinkage/src/levenshtein.c
RecordLinkage/src/phonet.c
RecordLinkage/src/makeBlockingPairs.c
RecordLinkage/src/pho_h.c
RecordLinkage/NAMESPACE
RecordLinkage/NEWS
RecordLinkage/data
RecordLinkage/data/RLdata10000.rda
RecordLinkage/data/RLdata500.rda
RecordLinkage/R
RecordLinkage/R/accessor-methods.r
RecordLinkage/R/register-S3-classes.r
RecordLinkage/R/classifySupv-methods.r
RecordLinkage/R/RLBigData-classes.r
RecordLinkage/R/getPairs-methods.r
RecordLinkage/R/em.r
RecordLinkage/R/phonetics.r
RecordLinkage/R/classify.r
RecordLinkage/R/epilink-methods.r
RecordLinkage/R/mygllm.r
RecordLinkage/R/stochastic.r
RecordLinkage/R/internals.r
RecordLinkage/R/em-methods.r
RecordLinkage/R/strcmp.r
RecordLinkage/R/getPairs.r
RecordLinkage/R/summary.r
RecordLinkage/R/tools.r
RecordLinkage/R/genSamples.r
RecordLinkage/R/evt.r
RecordLinkage/R/serialization.r
RecordLinkage/R/onAttach.r
RecordLinkage/R/compare.r
RecordLinkage/R/RLResult-class.r
RecordLinkage/vignettes
RecordLinkage/vignettes/WeightBased.rnw
RecordLinkage/vignettes/BigData.rnw
RecordLinkage/vignettes/EVT.rnw
RecordLinkage/vignettes/Supervised.rnw
RecordLinkage/MD5
RecordLinkage/build
RecordLinkage/build/vignette.rds
RecordLinkage/DESCRIPTION
RecordLinkage/man
RecordLinkage/man/stochastic.rd
RecordLinkage/man/RLBigData-class.Rd
RecordLinkage/man/RLBigData-constructors.rd
RecordLinkage/man/RLResult-class.Rd RecordLinkage/man/getExpectedSize.Rd RecordLinkage/man/ff_vector-class.Rd RecordLinkage/man/show.Rd RecordLinkage/man/splitData.Rd RecordLinkage/man/classifyUnsup.Rd
RecordLinkage/man/summary.RLResult.rd
RecordLinkage/man/getFrequencies-methods.Rd RecordLinkage/man/getMinimalTrain.Rd RecordLinkage/man/RLBigDataLinkage-class.Rd RecordLinkage/man/epiClassify.Rd RecordLinkage/man/RecLinkResult-class.Rd RecordLinkage/man/emWeights.Rd RecordLinkage/man/phonetics.Rd RecordLinkage/man/deleteNULLs.Rd RecordLinkage/man/genSamples.Rd RecordLinkage/man/append-methods.Rd
RecordLinkage/man/RecLinkData.object.rd
RecordLinkage/man/makeBlockingPairs.Rd
RecordLinkage/man/summary.rd
RecordLinkage/man/trainSupv.Rd RecordLinkage/man/getParetoThreshold.Rd RecordLinkage/man/isFALSE.Rd RecordLinkage/man/clone.Rd RecordLinkage/man/resample.Rd RecordLinkage/man/optimalThreshold.Rd
RecordLinkage/man/getPairs-methods.rd
RecordLinkage/man/gpdEst.Rd RecordLinkage/man/emClassify.Rd
RecordLinkage/man/strcmp.rd
RecordLinkage/man/internals.Rd RecordLinkage/man/mygllm.Rd
RecordLinkage/man/ffdf-class.rd
RecordLinkage/man/RecLinkClassif-class.Rd RecordLinkage/man/unorderedPairs.Rd
RecordLinkage/man/RecLinkData-class.rd
RecordLinkage/man/getErrorMeasures-methods.rd
RecordLinkage/man/classifySupv.Rd RecordLinkage/man/epiWeights.Rd
RecordLinkage/man/RecLinkResult.object.rd
RecordLinkage/man/editMatch.Rd
RecordLinkage/man/summary.RLBigData.rd
RecordLinkage/man/texSummary.Rd RecordLinkage/man/subset.Rd RecordLinkage/man/getTable-methods.Rd
RecordLinkage/man/compare.rd
RecordLinkage/man/RLBigDataDedup-class.Rd RecordLinkage/man/getPairsBackend.Rd
RecordLinkage/man/RLdata.rd
RecordLinkage/man/mrl.Rd

Questions? Problems? Suggestions? or email at ian@mutexlabs.com.

All documentation is copyright its authors; we didn't write any of that.