RecordLinkage: Record Linkage in R

Share:

Provides functions for linking and de-duplicating data sets. Methods based on a stochastic approach are implemented as well as classification algorithms from the machine learning domain.

Author
Andreas Borg <borga@uni-mainz.de>, Murat Sariyar <murat.sariyar@charite.de>
Date of publication
2016-07-27 11:06:56
Maintainer
Andreas Borg <borga@uni-mainz.de>
License
GPL (>= 2)
Version
0.4-10
URLs

View on CRAN

Man pages

append-methods
Concatenate comparison patterns or classification results
classifySupv
Supervised Classification
classifyUnsup
Unsupervised Classification
clone
Serialization of record linkage object.
deleteNULLs
Remove NULL Values
editMatch
Edit Matching Status
emClassify
Weight-based Classification of Data Pairs
emWeights
Calculate weights
epiClassify
Classify record pairs with EpiLink weights
epiWeights
Calculate EpiLink weights
ff_vector-class
Class '"ff_vector"'
genSamples
Generate Training Set
getExpectedSize
Estimate number of record pairs.
getFrequencies-methods
Get attribute frequencies
getMinimalTrain
Create a minimal training set
getPairsBackend
Backend function for getPairs
getParetoThreshold
Estimate Threshold from Pareto Distribution
getTable-methods
Build contingency table
gpdEst
Estimate Threshold from Pareto Distribution
internals
Internal functions and methods
isFALSE
Check for FALSE
makeBlockingPairs
Create record pairs from blocks of ids.
mrl
Mean Residual Life Plot
mygllm
Generalized Log-Linear Fitting
optimalThreshold
Optimal Threshold for Record Linkage
phonetics
Phonetic Code
RecLinkClassif-class
Class "RecLinkClassif"
RecLinkResult-class
Class "RecLinkResult"
resample
Safe Sampling
RLBigData-class
Class "RLBigData"
RLBigDataDedup-class
Class "RLBigDataDedup"
RLBigDataLinkage-class
Class "RLBigDataLinkage"
RLResult-class
Class "RLResult"
show
Show a RLBigData object
splitData
Split Data
subset
Subset operator for record linkage objects
texSummary
LaTeX Summary of linkage results
trainSupv
Train a Classifier
unorderedPairs
Create Unordered Pairs

Files in this package

RecordLinkage
RecordLinkage/inst
RecordLinkage/inst/doc
RecordLinkage/inst/doc/WeightBased.rnw
RecordLinkage/inst/doc/BigData.rnw
RecordLinkage/inst/doc/EVT.rnw
RecordLinkage/inst/doc/WeightBased.R
RecordLinkage/inst/doc/Supervised.pdf
RecordLinkage/inst/doc/BigData.R
RecordLinkage/inst/doc/EVT.R
RecordLinkage/inst/doc/BigData.pdf
RecordLinkage/inst/doc/Supervised.rnw
RecordLinkage/inst/doc/EVT.pdf
RecordLinkage/inst/doc/WeightBased.pdf
RecordLinkage/inst/doc/Supervised.R
RecordLinkage/src
RecordLinkage/src/soundex.h
RecordLinkage/src/ph_ext.h
RecordLinkage/src/jarowinkler.c
RecordLinkage/src/sqlite3.h
RecordLinkage/src/phonet.h
RecordLinkage/src/sqlite3ext.h
RecordLinkage/src/soundex.c
RecordLinkage/src/mygllm.c
RecordLinkage/src/sqlite_extensions.c
RecordLinkage/src/levenshtein.c
RecordLinkage/src/phonet.c
RecordLinkage/src/makeBlockingPairs.c
RecordLinkage/src/pho_h.c
RecordLinkage/NAMESPACE
RecordLinkage/NEWS
RecordLinkage/data
RecordLinkage/data/RLdata10000.rda
RecordLinkage/data/RLdata500.rda
RecordLinkage/R
RecordLinkage/R/accessor-methods.r
RecordLinkage/R/register-S3-classes.r
RecordLinkage/R/classifySupv-methods.r
RecordLinkage/R/RLBigData-classes.r
RecordLinkage/R/getPairs-methods.r
RecordLinkage/R/em.r
RecordLinkage/R/phonetics.r
RecordLinkage/R/classify.r
RecordLinkage/R/epilink-methods.r
RecordLinkage/R/mygllm.r
RecordLinkage/R/stochastic.r
RecordLinkage/R/internals.r
RecordLinkage/R/em-methods.r
RecordLinkage/R/strcmp.r
RecordLinkage/R/getPairs.r
RecordLinkage/R/summary.r
RecordLinkage/R/tools.r
RecordLinkage/R/genSamples.r
RecordLinkage/R/evt.r
RecordLinkage/R/serialization.r
RecordLinkage/R/onAttach.r
RecordLinkage/R/compare.r
RecordLinkage/R/RLResult-class.r
RecordLinkage/vignettes
RecordLinkage/vignettes/WeightBased.rnw
RecordLinkage/vignettes/BigData.rnw
RecordLinkage/vignettes/EVT.rnw
RecordLinkage/vignettes/Supervised.rnw
RecordLinkage/MD5
RecordLinkage/build
RecordLinkage/build/vignette.rds
RecordLinkage/DESCRIPTION
RecordLinkage/man
RecordLinkage/man/stochastic.rd
RecordLinkage/man/RLBigData-class.Rd
RecordLinkage/man/RLBigData-constructors.rd
RecordLinkage/man/RLResult-class.Rd
RecordLinkage/man/getExpectedSize.Rd
RecordLinkage/man/ff_vector-class.Rd
RecordLinkage/man/show.Rd
RecordLinkage/man/splitData.Rd
RecordLinkage/man/classifyUnsup.Rd
RecordLinkage/man/summary.RLResult.rd
RecordLinkage/man/getFrequencies-methods.Rd
RecordLinkage/man/getMinimalTrain.Rd
RecordLinkage/man/RLBigDataLinkage-class.Rd
RecordLinkage/man/epiClassify.Rd
RecordLinkage/man/RecLinkResult-class.Rd
RecordLinkage/man/emWeights.Rd
RecordLinkage/man/phonetics.Rd
RecordLinkage/man/deleteNULLs.Rd
RecordLinkage/man/genSamples.Rd
RecordLinkage/man/append-methods.Rd
RecordLinkage/man/RecLinkData.object.rd
RecordLinkage/man/makeBlockingPairs.Rd
RecordLinkage/man/summary.rd
RecordLinkage/man/trainSupv.Rd
RecordLinkage/man/getParetoThreshold.Rd
RecordLinkage/man/isFALSE.Rd
RecordLinkage/man/clone.Rd
RecordLinkage/man/resample.Rd
RecordLinkage/man/optimalThreshold.Rd
RecordLinkage/man/getPairs-methods.rd
RecordLinkage/man/gpdEst.Rd
RecordLinkage/man/emClassify.Rd
RecordLinkage/man/strcmp.rd
RecordLinkage/man/internals.Rd
RecordLinkage/man/mygllm.Rd
RecordLinkage/man/ffdf-class.rd
RecordLinkage/man/RecLinkClassif-class.Rd
RecordLinkage/man/unorderedPairs.Rd
RecordLinkage/man/RecLinkData-class.rd
RecordLinkage/man/getErrorMeasures-methods.rd
RecordLinkage/man/classifySupv.Rd
RecordLinkage/man/epiWeights.Rd
RecordLinkage/man/RecLinkResult.object.rd
RecordLinkage/man/editMatch.Rd
RecordLinkage/man/summary.RLBigData.rd
RecordLinkage/man/texSummary.Rd
RecordLinkage/man/subset.Rd
RecordLinkage/man/getTable-methods.Rd
RecordLinkage/man/compare.rd
RecordLinkage/man/RLBigDataDedup-class.Rd
RecordLinkage/man/getPairsBackend.Rd
RecordLinkage/man/RLdata.rd
RecordLinkage/man/mrl.Rd