View source: R/executeDqChecks.R
| executeDqChecks | R Documentation |
This function will connect to the database, generate the sql scripts, and run the data quality checks against the database. By default, results will be written to a json file as well as a database table.
executeDqChecks(
connectionDetails,
cdmDatabaseSchema,
resultsDatabaseSchema,
vocabDatabaseSchema = cdmDatabaseSchema,
cdmSourceName,
numThreads = 1,
sqlOnly = FALSE,
sqlOnlyUnionCount = 1,
sqlOnlyIncrementalInsert = FALSE,
outputFolder,
outputFile = "",
verboseMode = FALSE,
writeToTable = TRUE,
writeTableName = "dqdashboard_results",
writeToCsv = FALSE,
csvFile = "",
checkLevels = c("TABLE", "FIELD", "CONCEPT"),
checkNames = c(),
checkSeverity = c("fatal", "convention", "characterization"),
cohortDefinitionId = c(),
cohortDatabaseSchema = resultsDatabaseSchema,
cohortTableName = "cohort",
tablesToExclude = c("CONCEPT", "VOCABULARY", "CONCEPT_ANCESTOR",
"CONCEPT_RELATIONSHIP", "CONCEPT_CLASS", "CONCEPT_SYNONYM", "RELATIONSHIP", "DOMAIN"),
cdmVersion = "5.3",
tableCheckThresholdLoc = "default",
fieldCheckThresholdLoc = "default",
conceptCheckThresholdLoc = "default"
)
connectionDetails |
A connectionDetails object for connecting to the CDM database |
cdmDatabaseSchema |
The fully qualified database name of the CDM schema |
resultsDatabaseSchema |
The fully qualified database name of the results schema |
vocabDatabaseSchema |
The fully qualified database name of the vocabulary schema (default is to set it as the cdmDatabaseSchema) |
cdmSourceName |
The name of the CDM data source |
numThreads |
The number of concurrent threads to use to execute the queries |
sqlOnly |
Should the SQLs be executed (FALSE) or just returned (TRUE)? |
sqlOnlyUnionCount |
(OPTIONAL) In sqlOnlyIncrementalInsert mode, how many SQL commands to union in each query to insert check results into results table (can speed processing when queries done in parallel). Default is 1. |
sqlOnlyIncrementalInsert |
(OPTIONAL) In sqlOnly mode, boolean to determine whether to generate SQL queries that insert check results and associated metadata into results table. Default is FALSE (for backwards compatibility to <= v2.2.0) |
outputFolder |
The folder to output logs, SQL files, and JSON results file to |
outputFile |
(OPTIONAL) File to write results JSON object |
verboseMode |
Boolean to determine if the console will show all execution steps. Default is FALSE |
writeToTable |
Boolean to indicate if the check results will be written to the dqdashboard_results table in the resultsDatabaseSchema. Default is TRUE |
writeTableName |
The name of the results table. Defaults to 'dqdashboard_results'. Used when sqlOnly or writeToTable is True. |
writeToCsv |
Boolean to indicate if the check results will be written to a csv file. Default is FALSE |
csvFile |
(OPTIONAL) CSV file to write results |
checkLevels |
Choose which DQ check levels to execute. Default is all 3 (TABLE, FIELD, CONCEPT) |
checkNames |
(OPTIONAL) Choose which check names to execute. Names can be found in inst/csv/OMOP_CDM_v[cdmVersion]_Check_Descriptions.csv. Note that "cdmTable", "cdmField" and "measureValueCompleteness" are always executed. |
checkSeverity |
Choose which DQ check severity levels to execute. Default is all 3 (fatal, convention, characterization) |
cohortDefinitionId |
The cohort definition id for the cohort you wish to run the DQD on. The package assumes a standard OHDSI cohort table with the fields cohort_definition_id and subject_id. |
cohortDatabaseSchema |
The schema where the cohort table is located. |
cohortTableName |
The name of the cohort table. Defaults to 'cohort'. |
tablesToExclude |
(OPTIONAL) Choose which CDM tables to exclude from the execution. |
cdmVersion |
The CDM version to target for the data source. Options are "5.2", "5.3", or "5.4". By default, "5.3" is used. |
tableCheckThresholdLoc |
The location of the threshold file for evaluating the table checks. If not specified the default thresholds will be applied. |
fieldCheckThresholdLoc |
The location of the threshold file for evaluating the field checks. If not specified the default thresholds will be applied. |
conceptCheckThresholdLoc |
The location of the threshold file for evaluating the concept checks. If not specified the default thresholds will be applied. |
A list object of results
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.