HSROCSummary: Summary statistics for HSROC models.

Description Usage Arguments Details Value References Examples

Description

This function produces summaries for the HSROC model parameters

Usage

1
2
3
4
5
6
7
HSROCSummary(data, burn_in = 0, iter.keep = NULL, Thin = 1, sub_rs=NULL, 
  point_estimate = c("median", "mean"), summary.path = getwd(), 
  chain = getwd(), tv = NULL, digit = 6, print_plot = FALSE, 
  plot.ind.studies = TRUE, cred_region = TRUE, predict_region = TRUE, 
  col.pooled.estimate = "red", col.predict.region = "blue", 
  lty.cred.region = "dotdash", lty.predict.region = "dotted", 
  region_level = 0.95, trunc_low = 0.025, trunc_up = 0.025 ) 

Arguments

data

a matrix with the number of rows equal to the number of studies and 4 columns. Each row consists of the entries of the 2x2 table of the index test (i.e. test under evaluation) vs. the reference test reported in each study. The ordering of the columns is ++, +-, -+, –, where the first entry refers to the result of the test under evaluation and the second entry refers to the result of the reference test.

burn_in

The number of early iterations that are to be dropped. The default value is 0.

iter.keep

Maximum number of iteration we want to keep.

Thin

a single numeric value. It sets the numerical field used to select every thin-th iteration to contribute to the estimates being calculated. The default value is 1.

sub_rs

a list that describes the partition of the reference standard among the studies, if any. See details for further explanations.

point_estimate

a character string indicating which method is to be used to calculate the estimates. One of "median" (default) or "mean", can be used

summary.path

a character string pointing to the directory where the SUMMARY files are to be stored.

chain

A list of character strings pointing to each directory, one for each chain, where the files created during the Gibbs sampler process are stored.

tv

a list of true parameter values. See details for further explanations

digit

integer indicating the number of decimal places to be used. The default value is 6.

print_plot

logical. If TRUE, pdf files of trace, density and summary receiver operating characteristic (SROC) curve plots are saved in the summary.path working directory to help assess convergence of the Gibbs sampler.

plot.ind.studies

logical. If TRUE (default), empty circles representing individual studies are drawn on the SROC plot. The radius of the circles are proportional to the sample size of the studies.

cred_region

logical. If TRUE (default), a credible region curve is drawn on the SROC plot.

predict_region

logical. If TRUE (default), a prediction region curve is drawn on the SROC plot.

col.pooled.estimate

A specification for the default SROC plotting color of the pooled estimate point and credible region curve. Defaults to "red".

col.predict.region

A specification for the default SROC plotting color of the prediction region curve. Defaults to "blue".

lty.cred.region

The credible region line type. Default to "dotdash"

lty.predict.region

The prediction region line type. Default to "dotted"

region_level

The credible (prediction) level required for the credible (prediction) region.

trunc_low

Lower limit truncation entering the creation of the SROC curve. Default to 0.025. See details for further explanations.

trunc_up

Upper limit truncation entering the creation of the SROC curve. Default to 0.025. See details for further explanations.

Details

The first element of the list-object sub_rs corresponds to the number of different reference standards. The default value is 1. The number of additional elements will depend on the value of the first element. There must be as many additional element in sub_rs as there are different reference standards. Assuming the studies are labelled 1, ..., N, each of these additional elements must be a vector (possibly of length one) taking as their values the labelling of the corresponding studies sharing the same reference standard. For example, if we have 2 reference tests, the first one aplied over study 1-10 and the second one applied over study 11-15 then the sub_rs list-argument should be of length 3 with the following elements : 3, 1:10, 11:15

If the argument tv is equal to NULL, the function assumes the data are coming from a real life example. Otherwise, it assumes the data are coming from a simulated dataset in which case the user must provide the “true parameters” that were used to simulate the data for the within-study and between-study parameters and for the reference standards through the tv argument. The within-study parameters must be a matrix-like object with each column being true values for alpha_i, theta_i, sensitivity of test under evaluation S1_i, specificity of test under evaluation C1_i and prevalence pi_i. The between-study parameters must be a vector of the following true values : THETA, sigma_theta, LAMBDA, sigma_alpha and beta. The reference stadard initial values must be a 2 X sub_rs[[1]] matrix-like object. The first row must be the true values of the sensitivity of the reference standard, while the second row must correspond to true values of the specificity of the reference standard. The ordering described above in the within-study, between-study and reference standard true parameters must be preserved otherwise an error message will be displayed.

If the argument print_plot is equal to TRUE, the function will create and save in the summary.path working directory 3 type of plots to help the user judge if the descriptive statistics are reliable. First, a trace plot for each parameter will be created to help evaluate whether the Gibbs sampler has converged. Each trace plot is a scatter plot of the posterior sample of a single parameter vs the iteration number of the Gibbs sampler. Second, a density plot for each parameter will also be created. It plots a smoothed posterior kernel density estimate for each parameter. Finally, a SROC curve plot will also be created by the function. It summarizes the performance of diagnostic tests by plotting the relationship between the true positive rate and the false positive rate of the tests, as the threshold used to distinguish disease cases from noncases varies.

The SROC curve plots Sensitivity () vs Specificity () at the posterior mean values of LAMBDA and beta as the value of T varies over the posterior sample of THETA. The trunc_low and trunc_up arguments truncate the posterior sample of THETA by (trunc_low)*100

For more help on this function, see the tutorial pdf file available at http://www.nandinidendukuri.com/filesonjoomlasite/HSROC_R_Tutorial.pdf

Value

A list of : Point estimates and 95\% highest posterior density (HPD) intervals of the between-study parameters, within-study parameters and reference standard parameters. All estimates are obtained via a Gibbs sampler process.

It also prints in the summary.path working directory a text file with a more complete summary of the results listed above (i.e. the results also include stadard errors and MC errors plus the posterior predictive value for the sensitivity and specificity of a new study that has not yet taken place).

References

Dendukuri, N., Schiller, I., Joseph, L., and Pai, M. (2012) Bayesian meta-analysis of the accuracy of a test for tuberculosis pleuritis in the absence of a gold-standard reference. Biometrics. doi:10.1111/j. 1541-0420.2012.01773.x

Rutter, C. M., and Gatsonis, C. A. (2001) A hierarchical regression approach to meta-analysis of diagnostic accuracy evaluations. Statistics in Medicine, 20(19):2865-2884.

Scheidler J, Hricak H, Yu KK, Subak L, Segal MR. Radiological evaluation of lymph node metastases in patients with cervical cancer : a meta-analysis. Journal of the American Medical Association 1997 ; 278(13):1096-1101.

Pai, M. et al. (2004) Nucleic acid amplification in the diagnosis of tuberculous pleuritis: a systematic review and meta-analysis. BMC Infect Dis 2004, 4:6.

Reitsma, J. B., Glas, A. S., Rutjes, A. W., Scholten, R. J., Bossuyt, P. M., and Zwinderman, A. H. (2005) Bivariate analysis of sensitivity and specificity produces informative summary measures in diagnositic reviews. Journal of Clinical Epidemiology ; 58(10), 982-990.

Harbord, R. M., Deeks, J. J., Egger, M., Whiting, P., and Sterne, J. A. (2006) Unification of models for meta-analysis of diagnostic accuracy studies. Biostatistics ; 8(2), 239-251.

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
#REAL-LIFE EXAMPLES
#
#PLEASE NOTE THAT BOTH EXAMPLES BELOW ASSUME THE EXISTANCE OF POSTERIOR 
#SAMPLES OBTAINED FROM THE 'HSROC' FUNCTION.  
#IN OTHER WORDS' ONE NEEDS TO RUN THE 'HSROC' FUNCTION BEFORE USING THE 
#'HSROCSUmmary' FUNCTION.
#
#Example 1
#To get descriptive statistics and graphical summaries for the MRI data 
#(Scheidler et al. 1997) after dropping the first 5,000 iterations.

data(MRI)	#load the data
## Not run: 
HSROCSummary(data = MRI, burn_in=5000, print_plot=TRUE )

## End(Not run)


#Example 2
#To get descriptive statistics and graphical summaries for the In.house 
#data (Pai et al. 2004) coming from 2 different chains.  
#We provide the path to each chain's directory, i.e. the directory where 
#all files created during the Gibbs sampler process are stored for 
#each chain.  Let's assume there are two fictional directoies 
#chain_path = list("C:/path_to_chain_1", "C:/path_to_chain_2").
#Let's assume we drop the first 5,000 iterations and we use a thinning 
#interval of 10. 


data(In.house)	#load the data
## Not run: 
HSROCSum1<- HSROCSummary(data = In.house, burn_in=5000, Thin=10, 
		chain=chain_path, print_plot=TRUE,
		sub_rs=REFSTD  )


#For more help on this function, see the tutorial pdf file availalbe
#at http://www.nandinidendukuri.com/filesonjoomlasite/HSROC_R_Tutorial.pdf


## End(Not run)

HSROC documentation built on Sept. 19, 2019, 9:05 a.m.

Related to HSROCSummary in HSROC...