Description Usage Arguments Value
Run entire SNIP deduplication algorithm
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 |
pedigrees |
Pedigree data to deduplicate. |
requestID |
Column that has the ID for the family. |
isProband |
Column that indicates the proband. |
keyVars |
Character vector of column names for the variables in the sort key |
keyVars.male |
Optional character vector of column names for the variables in the sort key that are specific to males |
keyVars.female |
Optional character vector of column names for the variables in the sort key that are specific to females |
keyWt |
Numeric vector of weights assigned to variables in the sort key, corresponding
to |
blockVar |
Vector of column names for the blocking variables, where families in different blocks will not be considered when searching for duplicates. |
repSN |
Number of iterations when sorting neighbors according to the sort key |
windowSN |
Integer representing the size of the sliding window to use during sorted neighbors. |
keyLength |
Numeric vector representing the number of key variables
(out of |
method |
If "intersection", we use the intersection score. If "greedy", we use the greedy match score. If "both", we use both. |
thresh |
Vector of thresholds. If |
priority |
A list of structure (var = 'Varx', min = TRUE) with 'Varx'
being a character value corresponding to a column in rawData.
This parameter determines how to sort the duplicates.
If |
dateFormat |
Character string of the format of the date. This is only used
if the priority variable is a date. The format should match the formats of
class |
printRuntime |
If TRUE, will print the runtime |
seed |
Seed |
An object of class Duplicates
containing the duplicate entities
and representatives for each duplicate entity (including singletons without duplicates).
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.