| fuzzy_join | R Documentation |
Joins two tables using approximate string matching on key columns. Optionally blocks by a second column (e.g., genus) for performance — only rows sharing the same blocking key are compared.
fuzzy_join(
x,
y,
by,
method = "dl",
max_dist = 0.2,
block_by = NULL,
n_threads = 4L,
suffix = ".y"
)
x |
A |
y |
A |
by |
A named character vector of length 1: |
method |
Character. Distance algorithm: |
max_dist |
Numeric. Maximum normalized distance (0-1) to keep a match.
Default |
block_by |
Optional named character vector of length 1:
|
n_threads |
Integer. Number of OpenMP threads for parallel distance
computation over partitions. Default |
suffix |
Character. Suffix appended to build-side column names that
collide with probe-side names. Default |
A vectra_node with all probe columns, all build columns (suffixed
on collision), and a fuzzy_dist column (double).
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.