Guide for use bcROCsurface"
In bcROCsurface: Bias-Corrected Methods for Estimating the ROC Surface of Continuous Diagnostic Tests

if (!requireNamespace("rmarkdown", quietly = TRUE) ||
      !rmarkdown::pandoc_available("1.14")) {
    warning(call. = FALSE, "These vignettes assume rmarkdown and pandoc version 1.14.  These were not found. Older versions will not work.")
    knitr::knit_exit()
  }

Install and Load the package

To install the package from CRAN, type:

install.packages("bcROCsurface")

Next, load the package.

library(bcROCsurface)

Diagnosis of EOC

To illustrate the use of the package bcROCsurface, we consisder an example, which presents the evaluation process for biomaker CA125 in the diagnosis of epithelial ovarian cancer (EOC).

Load the data set

data(EOC)

The data have 278 observations on the following 6 variables:

head(EOC)

##   D.full V  D        CA125       CA153 Age
## 1      3 1  3  3.304971965  1.42822875  41
## 2      1 0 NA  0.112479444  0.11665310  52
## 3      2 1  2  2.375011262 -0.04096794  50
## 4      1 0 NA -0.001545381  0.32111633  66
## 5      1 0 NA  0.278200345 -0.14283052  52
## 6      2 0 NA  0.167645382  0.81470563  50

In data set, CA125 and CA153 are two biomarkers, Age is the age of the patients. The variable V is the verification status; 1 and 0 indicates verified and non-verified subject, respectively. D.full is disease status, which consist of three classes, says, 1, 2, 3. These levels correspond to benign disease, early stage (I and II) and late stage (III and IV). On the other hand, D is missing disease status.

Obtaining ROC surface and VUS

FULL Estimator

The ROC surface and VUS are only applied when an monotone increasing ordering is of interest. Thus, before estimate ROC and also VUS, we have to be sure that the ordering of disease classes is monotone incresasing (with respect to the diagnostic test values). In order to do that, the function pre_data() is usefull.

dise_full <- pre_data(EOC$D.full, EOC$CA125)

On the other hand, we describe the full disease status dise_full as the binary matrix having three columns, corresponding to three classes of the disease status. Each row corresponds to a trinomial vector, in which, 1 indicates the subject belongs to class j with j = 1,2,3. The function pre_data() also done this work.

head(dise_full$dise)

## [1] 3 1 2 1 1 2
## Levels: 1 2 3

head(dise_full$dise_vec)

##      D1 D2 D3
## [1,]  0  0  1
## [2,]  1  0  0
## [3,]  0  1  0
## [4,]  1  0  0
## [5,]  1  0  0
## [6,]  0  1  0

We construct the ROC surface of full data, and estimate the VUS.

library(knitr)
library(rgl)
knit_hooks$set(webgl = hook_webgl)

dise_vec_full <- dise_full$dise_vec
rocs(method = "full", diag_test =  EOC$CA125, dise_vec = dise_vec_full,
     ncp = 30, ellipsoid = TRUE, cpst = c(-0.56, 2.31))

Here, we consider the full data, so we only need to put the arguments diag_test and dise_vec, and method is full.

The FULL estimator of VUS is obtained by the following command:

vus_mar("full", diag_test = EOC$CA125, dise_vec = dise_vec_full, ci = TRUE)

## Hmm, look likes the full data
## The verification status is not available
## You are working on FULL or Complete Case approach
## Number of observation: 278
## The diagnostic test: CA125 
## Processing .... 
## DONE
## 
## CALL: vus_mar(method = "full", diag_test = EOC$CA125, dise_vec = dise_vec_full, 
##     ci = TRUE)
##  
## Estimate of VUS: 0.5663 
## Standard error: 0.0377 
## 
## Intervals :
## Level        Normal                Logit        
## 95%   ( 0.4924,  0.6402 )   ( 0.4914,  0.6382 ) 
## Estimation of Standard Error and Intervals are based on Jackknife approach
##
## Testing the null hypothesis H0: VUS = 1/6 
##             Test Statistic   P-value    
## Normal-test         10.597 < 2.2e-16 ***
## ---
## Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

FI and MSI Estimator

Now, we compute the FI and MSI estimator with missing data. First, we need to estimate the disease probabilities by using multinomial logistic model. In bcROCsurface package, this work is done by using rho_mlogit() function.

dise_na <- pre_data(EOC$D, EOC$CA125)
dise_vec_na <- dise_na$dise_vec
dise_fact_na <- dise_na$dise
rho_out <- rho_mlogit(dise_fact_na ~ CA125 + CA153 + Age, data = EOC, 
                      test = TRUE)

The following command provides the ROC surface by means of FI esimator:

rocs(method = "fi", diag_test =  EOC$CA125, dise_vec = dise_vec_na, 
     veri_stat = EOC$V, rho_est = rho_out, ncp = 30, ellipsoid = TRUE, 
     cpst = c(-0.56, 2.31))

And, for VUS:

vus_mar(method = "fi", diag_test = EOC$CA125, dise_vec = dise_vec_na, 
        veri_stat = EOC$V, rho_est = rho_out, ci = TRUE)

## Hmm, look likes the incomplete data
## Number of observation: 278
## 64% of the subjects receive disease verification. 
## You required estimate VUS using FI approach 
## The diagnostic test: CA125 
## Processing .... 
## DONE
## 
## CALL: vus_mar(method = "fi", diag_test = EOC$CA125, dise_vec = dise_vec_na, 
##     veri_stat = EOC$V, rho_est = rho_out, ci = TRUE)
##  
## Estimate of VUS: 0.515 
## Standard error: 0.0404 
## 
## Intervals :
## Level        Normal                Logit        
## 95%   ( 0.4357,  0.5942 )   ( 0.4360,  0.5932 ) 
## Estimation of Standard Error and Intervals are based on Asymptotic Theory
##
## Testing the null hypothesis H0: VUS = 1/6 
##             Test Statistic   P-value    
## Normal-test         8.6168 < 2.2e-16 ***
## ---
## Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

For MSI estimator, we could do that to plot ROC surface

rocs(method = "msi", diag_test = EOC$CA125, dise_vec = dise_vec_na, 
     veri_stat = EOC$V, rho_est = rho_out, ncp = 30,
     ellipsoid = TRUE, cpst = c(-0.56, 2.31))

and compute VUS

vus_mar(method = "msi", diag_test = EOC$CA125, dise_vec = dise_vec_na, 
        veri_stat = EOC$V, rho_est = rho_out, ci = TRUE)

## Hmm, look likes the incomplete data
## Number of observation: 278
## 64% of the subjects receive disease verification. 
## You required estimate VUS using MSI approach 
## The diagnostic test: CA125 
## Processing .... 
## DONE
## 
## CALL: vus_mar(method = "msi", diag_test = EOC$CA125, dise_vec = dise_vec_na, 
##     veri_stat = EOC$V, rho_est = rho_out, ci = TRUE)
##  
## Estimate of VUS: 0.5183 
## Standard error: 0.0415 
## 
## Intervals :
## Level        Normal                Logit        
## 95%   ( 0.4368,  0.5997 )   ( 0.4371,  0.5985 ) 
## Estimation of Standard Error and Intervals are based on Asymptotic Theory
##
## Testing the null hypothesis H0: VUS = 1/6 
##             Test Statistic   P-value    
## Normal-test         8.4644 < 2.2e-16 ***
## ---
## Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

IPW and SPE Estimator

The IPW and SPE estimator require the estimation for verification probabilities. In order to do that we can employ some regression model for binary response, i.e. logistic, probit and may be threshold. Here, we consider the implementation of logistic model. In this package, the function psglm() is used to fit the verification model.

pi_out <- psglm(V ~ CA125 + CA153 + Age, data = EOC, model = "logit", 
                test = TRUE, trace = TRUE)

To plot the ROC surface and estimate VUS, we do

rocs(method = "ipw", diag_test = EOC$CA125, dise_vec = dise_vec_na, 
     veri_stat = EOC$V, pi_est = pi_out, ncp = 30,
     ellipsoid = TRUE, cpst = c(-0.56, 2.31))

vus_mar(method = "ipw", diag_test = EOC$CA125, dise_vec = dise_vec_na, 
        veri_stat = EOC$V, pi_est = pi_out, ci = TRUE)

## Hmm, look likes the incomplete data
## Number of observation: 278
## 64% of the subjects receive disease verification. 
## You required estimate VUS using IPW approach 
## The diagnostic test: CA125 
## Processing .... 
## DONE
##
## CALL: vus_mar(method = "ipw", diag_test = EOC$CA125, dise_vec = dise_vec_na, 
##     veri_stat = EOC$V, pi_est = pi_out, ci = TRUE)
## 
## Estimate of VUS: 0.55 
## Standard error: 0.0416 
##
## Intervals :
## Level        Normal                Logit        
## 95%   ( 0.4685,  0.6314 )   ( 0.4679,  0.6294 ) 
## Estimation of Standard Error and Intervals are based on Asymptotic Theory
## 
## Testing the null hypothesis H0: VUS = 1/6 
##             Test Statistic   P-value    
## Normal-test         9.2212 < 2.2e-16 ***
## ---
## Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

For SPE estimator, we type

rocs(method = "spe", diag_test = EOC$CA125, dise_vec = dise_vec_na, 
     veri_stat = EOC$V, rho_est = rho_out,
     pi_est = pi_out, ncp = 30, ellipsoid = TRUE, cpst = c(-0.56, 2.31))

to provide the plot of ROC surface, and

vus_mar(method = "spe", diag_test = EOC$CA125, dise_vec = dise_vec_na,
        veri_stat = EOC$V, rho_est = rho_out,
        pi_est = pi_out, ci = TRUE)

## Hmm, look likes the incomplete data
## Number of observation: 278
## 64% of the subjects receive disease verification. 
## You required estimate VUS using SPE approach 
## The diagnostic test: CA125 
## Processing .... 
## DONE
##
## CALL: vus_mar(method = "spe", diag_test = EOC$CA125, dise_vec = dise_vec_na, 
##     veri_stat = EOC$V, rho_est = rho_out, pi_est = pi_out, ci = TRUE)
##  
## Estimate of VUS: 0.5581 
## Standard error: 0.0443 
## 
## Intervals :
## Level        Normal                Logit        
## 95%   ( 0.4712,  0.6450 )   ( 0.4703,  0.6424 ) 
## Estimation of Standard Error and Intervals are based on Asymptotic Theory
##
## Testing the null hypothesis H0: VUS = 1/6 
##            Test Statistic   P-value    
## Normal-test          8.827 < 2.2e-16 ***
## ---
## Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

for VUS.

Note that, the asymptotic variances of bias-corrected estimators (FI, MSI, IPW, SPE) of the VUS presented in previous parts could be obtained by using the bootstrap process. This is done in a simple way, in fact, the users only need to use the option boot = TRUE and select the number of bootstrap replication n_boot (default 250). In addition, to save the computation time, the option of parallel computing could be allowed by parallel = TRUE. In this case, the users may design the number of cpus by option ncpus, however, if this argument is ignored, then the defaut (a half of available cpus) will be supplied.

KNN estimator

Like the MSI estimator, the KNN approach is based on the estimate of disease probabilities. However, in this framework, we will use K nearest-neighbor estimators.

1NN estimator

x_mat <- cbind(EOC$CA125, EOC$CA153, EOC$Age)
rho_1nn <- rho_knn(x_mat, dise_vec = dise_vec_na, veri_stat = EOC$V, k = 1, 
                   type = "mahala")

rocs("knn", diag_test = EOC$CA125, dise_vec_na, veri_stat = EOC$V, 
     rho_est = rho_1nn, ncp = 30, ellipsoid = TRUE,
     cpst = c(-0.56, 2.31))

vus_mar(method = "knn", diag_test = EOC$CA125, dise_vec = dise_vec_na, 
        veri_stat = EOC$V, rho_est = rho_1nn, ci = TRUE,
        parallel = TRUE)

## Hmm, look likes the incomplete data
## Number of observation: 278
## 64% of the subjects receive disease verification. 
## You required estimate VUS using KNN approach 
## The diagnostic test: CA125 
## Processing .... 
## DONE
##
## CALL: vus_mar(method = "knn", diag_test = EOC$CA125, dise_vec = dise_vec_na, 
##     veri_stat = EOC$V, rho_est = rho_1nn, ci = TRUE, parallel = TRUE)
##  
## Estimate of VUS: 0.5123 
## Standard error: 0.0468 
## 
## Intervals :
## Level        Normal                Logit        
## 95%   ( 0.4205,  0.6040 )   ( 0.4212,  0.6025 )
## Estimation of Standard Error and Intervals are based on Bootstrap with 250 replicates
##
## Testing the null hypothesis H0: VUS = 1/6 
##             Test Statistic   P-value    
## Normal-test         7.3856 7.583e-14 ***
## ---
## Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

To find the good choice for the number of nearest-neighbor, we use the following command:

cv_knn(x_mat, dise_vec_na, EOC$V, type = "mahala", plot = TRUE)

Any scripts or data that you put into this service are public.

bcROCsurface documentation built on Sept. 9, 2023, 9:07 a.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

bcROCsurface
Bias-Corrected Methods for Estimating the ROC Surface of Continuous Diagnostic Tests

Guide for use bcROCsurface"
In bcROCsurface: Bias-Corrected Methods for Estimating the ROC Surface of Continuous Diagnostic Tests

Install and Load the package

Diagnosis of EOC

Load the data set

Obtaining ROC surface and VUS

FULL Estimator

FI and MSI Estimator

IPW and SPE Estimator

KNN estimator

1NN estimator

Try the bcROCsurface package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

bcROCsurface Bias-Corrected Methods for Estimating the ROC Surface of Continuous Diagnostic Tests

Guide for use bcROCsurface" In bcROCsurface: Bias-Corrected Methods for Estimating the ROC Surface of Continuous Diagnostic Tests

Install and Load the package

Diagnosis of EOC

Load the data set

Obtaining ROC surface and VUS

FULL Estimator

FI and MSI Estimator

IPW and SPE Estimator

KNN estimator

1NN estimator

Try the bcROCsurface package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

bcROCsurface
Bias-Corrected Methods for Estimating the ROC Surface of Continuous Diagnostic Tests

Guide for use bcROCsurface"
In bcROCsurface: Bias-Corrected Methods for Estimating the ROC Surface of Continuous Diagnostic Tests