internals | R Documentation |
These functions and methods are used internally and should usually not be called from outside the package.
begin(x, ...) nextPairs(x, n = 10000, ...) clear(x, ...) blockfldfun(blockfld, phoneticFld, phoneticFun, coln) getSQLStatement(data1, data2 = data1, con, type, blockFld, excludeFld, strcmpFld, strcmpFun, phoneticFld, phoneticFun) init_sqlite_extensions(db) .allows_extensions(db) .lib_path() getPatternCounts(x, n=10000, cutoff=1, withProgressBar = (sink.number()==0)) getMatchCount(object) getNonMatchCount(object) getNACount(object) getColumnNames(object, withExcluded = FALSE) getThresholds(W, M, U, my, ny) countpattern(x, matching = FALSE) hasWeights(object)
x, object |
Depends on the function, usually a |
n |
Maximum number of comparison patterns to fetch. |
blockfld |
Blocking definition as in
|
phoneticFld |
Indices of attributes for which phonetic code should be used. |
phoneticFun |
Name of phonetic function as in
|
data1, data2 |
Data frame with records. |
type |
"linkage" or "deduplication", the type of linkage. |
excludeFld |
Indices or names of fields to exclude from comparison. |
strcmpFld |
Indices of attributes for which string comparison should be used. |
strcmpFun |
Name of string comparison function as in
|
con |
A SQLite database connection. |
coln |
Column names of records. |
db |
Database connection. |
cutoff |
Threshold from which string comparison values are rounded to 1. |
withProgressBar |
Whether to display a progress bar. |
withExcluded |
Logical. Whether to output names of excluded columns |
W,M,U |
Linkage weights, m- and u-probabilities. See |
my,ny |
Error bounds. See |
... |
Optional arguments. |
matching |
If TRUE an additional vector is returned which stores which row belongs to which pattern. |
begin
Sends a query to the database of an "RLBigData"
object which creates comparison patterns.
nextPairs
Retrieves the next n
comparison patterns.
clear
Closes the result set opened by begin
.
blockfldfun
Constructs the part of SQL statement which implements blocking.
blockfldfun
Constructs the part of SQL statement which implements blocking.
getSQLStatement
Constructs SQL statement for retrieving comparison patterns.
init_sqlite_extensions
Load string comparison and phonetic functions into database.
.allows_extensions
Tests whether SQLite engine supports extension functions.
.lib_path
Gets path of shared library of package.
getPatternCounts
Counts binary patterns, implemented through
countpattern
.
getMatchCount
Gets number of matches from a
"RLBigData"
object.
getNACount
Gets number of pairs with unknown matching status
from a "RLBigData"
object.
getDbFile
Returns path of database file for a data or result object.
getColumnNames
Returns column names of a
"RLBigData"
object as a character vector.
getThresholds
Calculates upper and lower classification based on error bounds.
countpattern
Modified version of countpattern
in package e1071.
hasWeights
Determines whether a data object has weights.
Andreas Borg, Murat Sariyar
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.