verify_ids | R Documentation |
Compares demographic information across datasets to determine
if the entity identified with ID x
is the same across all
datasets.
verify_ids(
dat_list,
id_col,
unique_id_col,
file = NULL,
database_col = "database",
variables = NULL,
tolerances = NULL,
extra_metrics = NULL,
extra_cols = NULL,
verbose = TRUE,
...
)
dat_list |
A named list of |
id_col |
The name of the ID, or primary key, column. For consistency, should be the same across datasets. |
unique_id_col |
The name of the row ID, or surrogate key, column. For consistency, should be the same across datasets. |
file |
If not |
database_col |
The column name to store the |
variables |
A character vector of integer or character columns to be used for comparison across datasets. |
tolerances |
If not
|
extra_metrics |
A |
extra_cols |
A character vector of columns to be included in the output verification spreadsheet, mainly for reference and support during manual inspection |
verbose |
Enables logging |
... |
Extra parameters passed to |
A data.frame
in the fix format
if (FALSE) {
anara::verify_ids(
list(
database1 = dat_1,
database2 = dat_2
),
id_col = "participant_id",
unique_id_col = "unique_id",
variables = c("female", "grade", "teacher_name", "form"),
tolerances = list(
form = 0,
teacher_name = 0.05
),
extra_cols = c(
"start", "end",
"incdnt_01", "incdnt_01_o", "incdnt_02", "incdnt_02_o"
),
file = file.path("path", "to", "issues.csv")
)
}
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.