validationPackage/EhdenRaPredictionValidation/readme.md

EhdenRaPredictionValidation

Introduction

This package contains code to externally validate models for the prediction of adverse health outcomes in rheumatoid arthritis (RA) developed on the database Optum Clinformatics.

Features

Technology

EhdenRaPredictionValidation is an R package.

System Requirements

Dependencies

Guide

A general guide for running a validation study package is available here: Skeleton Validation Study guide

A1. Installing the package from GitHub

# To install the package from github:
install.packages("devtools")
devtools::install_github("ohdsi-studies/EhdenRaPrediction/validationPackage/EhdenRaPredictionValidation")

A2. Building the package inside RStudio

  1. Open the validation package project file (file ending in .Rproj)
  2. Build the package in RStudio by selecting the 'Build' option in the top right (the tabs contain 'Environment', 'History', 'Connections', 'Build', 'Git') and then clicking on the 'Install and Restart'

B. Getting Started

  1. Make sure to have either: installed (A1) or built (A2) the package
  2. In R, run the code in 'extras/codeToRun.R' (see Skeleton Validation Study guide for guidance)

C. Example Code

library(EhdenRaPredictionValidation)

# add details of your database setting:
databaseName <- 'add a shareable name for the database you are currently validating on'

# add the cdm database schema with the data
cdmDatabaseSchema <- 'your cdm database schema for the validation'

# add the work database schema this requires read/write privileges 
cohortDatabaseSchema <- 'your work database schema'

# if using oracle please set the location of your temp schema
oracleTempSchema <- NULL

# the name of the table that will be created in cohortDatabaseSchema to hold the cohorts
cohortTable <- 'EhdenRaPredictionValidationCohortTable'

# the location to save the prediction models results to:
# NOTE: if you set the outputFolder to the 'Validation' directory in the 
#       prediction study outputFolder then the external validation will be
#       saved in a format that can be used by the shiny app 
outputFolder <- '../Validation'

# add connection details:
options(fftempdir = 'T:/fftemp')
dbms <- "pdw"
user <- NULL
pw <- NULL
server <- Sys.getenv('server')
port <- Sys.getenv('port')
connectionDetails <- DatabaseConnector::createConnectionDetails(dbms = dbms,
                                                                server = server,
                                                                user = user,
                                                                password = pw,
                                                                port = port)

# Now run the study:
EhdenRaPredictionValidation::execute(connectionDetails = connectionDetails,
                 databaseName = databaseName,
                 cdmDatabaseSchema = cdmDatabaseSchema,
                 cohortDatabaseSchema = cohortDatabaseSchema,
                 oracleTempSchema = oracleTempSchema,
                 cohortTable = cohortTable,
                 outputFolder = outputFolder,
                 createCohorts = T,
                 runValidation = T,
                 packageResults = F,
                 minCellCount = 5,
                 sampleSize = NULL)

# If the validation study runs to completion and returns results, package it up ready to share with the study owner (but remove counts less than 10) by running:
EhdenRaPredictionValidation::execute(connectionDetails = connectionDetails,
                 databaseName = databaseName,
                 cdmDatabaseSchema = cdmDatabaseSchema,
                 cohortDatabaseSchema = cohortDatabaseSchema,
                 oracleTempSchema = oracleTempSchema,
                 cohortTable = cohortTable,
                 outputFolder = outputFolder,
                 createCohorts = F,
                 runValidation = F,
                 packageResults = T,
                 minCellCount = 10,
                 sampleSize = NULL)


# If your target cohort is large use the sampleSize setting to sample from the cohort:
EhdenRaPredictionValidation::execute(connectionDetails = connectionDetails,
                 databaseName = databaseName,
                 cdmDatabaseSchema = cdmDatabaseSchema,
                 cohortDatabaseSchema = cohortDatabaseSchema,
                 oracleTempSchema = oracleTempSchema,
                 cohortTable = cohortTable,
                 outputFolder = outputFolder,
                 createCohorts = T,
                 runValidation = T,
                 packageResults = F,
                 minCellCount = 10,
                 sampleSize = 1000000)

License

EhdenRaPredictionValidation is licensed under Apache License 2.0

Development

EhdenRaPredictionValidation is being developed in R Studio.



ohdsi-studies/EhdenRaPrediction documentation built on Sept. 15, 2021, 3:07 a.m.