xliu-stat/SAT: Surrogate Assisted Two-wave Case Boosting Sampling

Electronic health records (EHR) enable investigation of the association between phenotypes and risk factors. However, studies solely relying on potentially error-prone EHR-derived phenotypes (i.e., surrogates) are subject to bias. Analyses of low prevalence phenotypes may also suffer from poor efficiency. For analyzing rare diseases, we develop a Surrogate Assisted Two-wave (SAT) sampling method to select a subsample for outcome validation through manual chart review subject to budget constraints. A model is then fitted based on the subsample. The subsample selected with the proposed method contains informative observations that effectively reduce the mean squared error (MSE) of the resultant estimator of the association.

Getting started

Package details

Maintainer
LicenseGPL-3
Version0.1.0
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("xliu-stat/SAT")
xliu-stat/SAT documentation built on Dec. 23, 2021, 7:10 p.m.