knitr::opts_chunk$set( collapse = TRUE, comment = "#>" ) library(CDMConnector) if (Sys.getenv("EUNOMIA_DATA_FOLDER") == "") Sys.setenv("EUNOMIA_DATA_FOLDER" = tempdir()) if (!dir.exists(Sys.getenv("EUNOMIA_DATA_FOLDER"))) dir.create(Sys.getenv("EUNOMIA_DATA_FOLDER")) if (!eunomiaIsAvailable()) downloadEunomiaData(datasetName = "synpuf-1k", cdmVersion = "5.3")
In this vignette, we are going to present how to run PhenotypeDiagnostics()
. We are going to use the following packages and mock data:
library(CohortConstructor) library(PhenotypeR) library(dplyr) con <- DBI::dbConnect(duckdb::duckdb(), CDMConnector::eunomiaDir("synpuf-1k", "5.3")) cdm <- CDMConnector::cdmFromCon(con = con, cdmName = "Eunomia Synpuf", cdmSchema = "main", writeSchema = "main", achillesSchema = "main") cdm
Note that we have included achilles tables in our cdm reference, which will be used to speed up some of the analyses.
First, we are going to use the package CohortConstructor to generate three cohorts of warfarin, acetaminophen and morphine users.
# Create a codelist codes <- list("warfarin" = c(1310149, 40163554), "acetaminophen" = c(1125315, 1127078, 1127433, 40229134, 40231925, 40162522, 19133768), "morphine" = c(1110410, 35605858, 40169988)) # Instantiate cohorts with CohortConstructor cdm$my_cohort <- conceptCohort(cdm = cdm, conceptSet = codes, exit = "event_end_date", overlap = "merge", name = "my_cohort")
Now we will proceed to run phenotypeDiagnotics()
. This function will run the following analyses:
We can specify which analysis we want to perform by setting to TRUE or FALSE each one of the corresponding arguments:
result <- phenotypeDiagnostics( cohort = cdm$my_cohort, databaseDiagnostics = TRUE, codelistDiagnostics = TRUE, cohortDiagnostics = TRUE, populationDiagnostics = TRUE, populationSample = 1e+06, populationDateRange = as.Date(c(NA, NA)), matchedDiagnostics = TRUE, matchedSample = 1000 ) result |> glimpse()
Notice that we have three additional arguments:
populationSample
: It allows to specify a number of people that randomly will be extracted from the CDM to perform the Population diagnostics analysis. If NULL, all the participants in the CDM will be included. It helps to reduce the computational time.populationDateRange
: We can use it to specify the time period when we want to perform our Population diagnostics analysis.matchedSample
: Similar to populationSample, this arguments subsets a random sample of people to perform the Matching diagnostics.To save the results, we can use exportSummarisedResult function from omopgenerics R Package:
exportSummarisedResult(result, directory = here::here(), minCellCount = 5)
Once we get our Phenotype diagnostics result, we can use shinyDiagnostics
to easily create a shiny app and visualise our results:
result <- shinyDiagnostics(result, directory = tempdir(), minCellCount = 5, open = TRUE)
Notice that we have specified the minimum number of counts (minCellCount
) for suppression to be shown in the shiny app, and also that we want the shiny to be launched in a new R session (open
). You can see the shiny app generated for this example in here.See Shiny diagnostics vignette for a full explanation of the shiny app.
Any scripts or data that you put into this service are public.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.