In ohdsi-studies/MentalHealthEquity: Assessing Health Equity in Mental Healthcare Delivery Using a Federated Network Research Model

Feasibility Assessment

Executive Summary

Author: Jacob S. Zelko

Date: 2022-11-02

Summary: A feasibility assessment to assess candidacy of partner site location

Running Feasbility Assessment

Packages

The following packages will be loaded to conduct the feasibility assessment:

library(DatabaseConnector)
library(dplyr)
library(lubridate)
library(readr)
library(SqlRender)
library(tibble)

To learn more about these packages, see the Appendix.

Defining Connection Details

Here, we need to set-up connection to the OMOP CDM database we will assess. To do so, we need to define some constants that will be used for the connection. The following list of constants:

dbms - the database management system that is used to host your database; common options include (see all options here):
- "postgresql"
- "sql server"
server - name of the server; could be localhost, an address like 123.0.1.5, etc.
user - your username to access the server
password - the password you use to access the server
port - the port where the database is hosted
schema - name of the database schema used

Must be defined in this code block:

dbms <- "Fill in here"
server <- "Fill in here"
user <- "Fill in here"
password <- "Fill in here"
port <- "Fill in here"
schema <- "Fill in here"

An additional step needed is to configure the required driver to connect to the database as follows:

Determine the name of your database management system based on the list here
Download the drivers by running the following:

This is accomplished in the following codeblock (change eval = FALSE to eval = FALSE when you have set these variables correctly):

pathToDriver <- "/location/that/you/want"
downloadJdbcDrivers(dbms = dbms, pathToDriver = pathToDriver, method = "auto")

Once this is done, we can create the connection to the database (change eval = FALSE to eval = FALSE when you have set these variables correctly):

connectionDetails <- createConnectionDetails(dbms=dbms, 
                                             server=server,
                                             user=user,
                         password=password,
                         port=port,
                         pathToDriver=pathToDriver)

connection <- connect(connectionDetails)

If there were no errors, then we should be able to continue with the analysis!

WARN: As you proceed with this analysis, if you encounter a Java issue like this: "Insufficient java heap memory", please run the following code block:

r options(java.parameters = c("-XX:+UseConcMarkSweepGC", "-Xmx8192m")) This is only an emergency work around and should be removed when a better solution is found.

Example Connection

If any of this was confusing, here is an example of how to fill out the above connection information:

dbms <- "postgresql"
server <- "test.data.americus.edu/mimic_omop"
user <- "mimic"
password <- "omoprocks"
port <- 5042
schema <- "mimic.omop"

pathToDriver = "utils"
downloadJdbcDrivers(dbms = dbms, pathToDriver = pathToDriver, method = "auto")

connectionDetails <- createConnectionDetails(dbms=dbms, 
                                             server=server,
                                             user=user,
                                             password=password,
                         port=port,
                         pathToDriver=pathToDriver)

connection <- connect(connectionDetails)

Queries and Analysis

There are a number of queries to execute and analyses that will take varying amounts of time to run depending on how many patients are in your database and how much data is at your site. Each query has the following two pieces of information:

Description - a concise description of what the query will do upon execution
Requirements - the expected database tables and fields required to run a query

If there are concerns about 1, you can view queries within the sql directory to see each query. If the requirements per 2 are not met, then either that query will need to be skipped or the corresponding query within the sql directory will need to be modified.

NOTE: For 2, if there is no available in a certain CDM field, that will not cause the query to fail. The query will fail, however, if a given field does not exist within a database table at all.

The final step needed before running the following queries is to define some constants we will use in this analysis:

site_name - the name of the site in snake case (i.e. "Georgia Tech Research Institute" becomes "georgia_tech_research_institute")

And here is the code block needed to update:

site_name <- "Fill in here"

site_path <- paste("../data/", site_name, sep = "")
dir.create(path = site_path)

Stratified Person Query

Description: A compact query to get all persons by race, gender, and age group. For the purpose of this analysis, the subtrahend value for age is calculated based on the latest recorded date found in the observation_period table (considered by OHDSI experts to be the table with the latest information in a database).

Requirements: This query expects the following tables and their fields to be available:

| Table | Field | | --------------- | --------------- | | observation_period | observation_period_end_date | | person | person_id | | person | year_of_birth | | person | race_concept_id | | person | gender_concept_id |

source("sql/stratified_person.R")

Now we can enforce aggregation and patient count suppression per HITECH standards via filtering:

person_stratified_counts <- person_stratified %>% 
filter(COUNTS > 10)

Now, all we need to do is export the data:

write.table(person_stratified_counts, file = paste(site_path, "/person_stratified_breakdown.csv", sep = ""), row.names = FALSE)

Care Site Query

Description: Gets counts of distinct care site types present in database.

Requirements: This query expects the following tables and their fields to be available:

| Table | Field | | --------------- | --------------- | | care_site | care_site_id | | care_site | place_of_service_id | | visit_occurrence | person_id | | visit_occurrence | care_site_id |

source("sql/care_sites.R")

For this, all we need to do is export the data:

write.table(care_sites, file = paste(site_path, "/care_site_breakdown.csv", sep = ""), row.names = FALSE)

State Person Query

Description: Gets counts of persons broken down by state.

Requirements: This query expects the following tables and their fields to be available:

| Table | Field | | --------------- | --------------- | | location | location_id | | location | state |

source("sql/state_person.R")

Here we must enforce HITECH standards for filtering:

location <- location %>% 
filter(COUNT > 10)

Now, all we need to do is export the data:

write.table(location, file = paste(site_path, "/location_breakdown.csv", sep = ""), row.names = FALSE)

Visit Type Query

Description: Count of unique visits across types of visits.

Requirements: This query expects the following tables and their fields to be available:

| Table | Field | | --------------- | --------------- | | visit_occurrence | person_id | | visit_occurrence | visit_concept_id |

source("sql/visit_type.R")

For this, all we need to do is export the data:

write.table(visit_types, file = paste(site_path, "/visit_type_breakdown.csv", sep = ""), row.names = FALSE)

Conclusion

Next Steps

With this feasibility analysis done, please share the results back to us at GTRI. The way to do that is to upload the folder that was generated for your site in ../data/feasibility/. It should be a folder with your site's name and contains CSV extracts. This can then be uploaded to the Box link that we provided you over email. Feel free to contact us at jacob.zelko@gtri.gatech.edu and we can discuss further steps!

THANK YOU!

If you completed this entire feasibility assessment, THANK YOU -- you are incredible! You are making this network study possible and my team and I at GTRI fully understand how difficult getting these assessments to run can be! We look forward to collaborating further!

Appendix

Packages Details

renv - create reproducible environments for R projects
dplyr - grammar for data manipulation
tibble - improved data.frame functionality
SqlRender - package for rendering parameterized SQL
DatabaseConnector - package for connecting to databases using JDBC
lubridate - makes it easier to work with date times
readr - a fast and friendly way to read rectangular data

ohdsi-studies/MentalHealthEquity documentation built on March 9, 2023, 12:39 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

ohdsi-studies/MentalHealthEquity
Assessing Health Equity in Mental Healthcare Delivery Using a Federated Network Research Model

In ohdsi-studies/MentalHealthEquity: Assessing Health Equity in Mental Healthcare Delivery Using a Federated Network Research Model

Feasibility Assessment

Executive Summary

Running Feasbility Assessment

Packages

Defining Connection Details

Example Connection

Queries and Analysis

Stratified Person Query

Care Site Query

State Person Query

Visit Type Query

Conclusion

Next Steps

THANK YOU!

Appendix

Packages Details

R Package Documentation

Browse R Packages

We want your feedback!

ohdsi-studies/MentalHealthEquity Assessing Health Equity in Mental Healthcare Delivery Using a Federated Network Research Model

In ohdsi-studies/MentalHealthEquity: Assessing Health Equity in Mental Healthcare Delivery Using a Federated Network Research Model

Feasibility Assessment

Executive Summary

Running Feasbility Assessment

Packages

Defining Connection Details

Example Connection

Queries and Analysis

Stratified Person Query

Care Site Query

State Person Query

Visit Type Query

Conclusion

Next Steps

THANK YOU!

Appendix

Packages Details

R Package Documentation

Browse R Packages

We want your feedback!

ohdsi-studies/MentalHealthEquity
Assessing Health Equity in Mental Healthcare Delivery Using a Federated Network Research Model