epi.ssdxtest: Sample size to validate a diagnostic test in the absence of a...
In epiR: Tools for the Analysis of Epidemiological Data

epi.ssdxtest

R Documentation

Sample size to validate a diagnostic test in the absence of a gold standard

Description

Sample size to validate a diagnostic test in the absence of a gold standard.

Usage

epi.ssdxtest(pi, se, sp, epsilon.api, epsilon.ase, epsilon.asp, epsilon.asesp, 
    r = 1, nfractional = FALSE, verbose = FALSE, conf.level = 0.95)

Arguments

`pi`	vector of length two, the expected prevalence of the outcome of interest in the two populations (0 to 1), respectively.
`se`	vector of length two, the expected diagnostic sensitivity of the first and second test (0 to 1), respectively.
`sp`	vector of length two, the expected diagnostic specificity of the first and second test (0 to 1), respectively.
`epsilon.api`	vector of length two, the absolute error for the prevalence of the outcome of interest in the first and second study populations.
`epsilon.ase`	vector of length two, the absolute error for the sensitivity estimate of the first and second test.
`epsilon.asp`	vector of length two, the absolute error for the specificity estimate of the first and second test.
`epsilon.asesp`	vector of length two, the absolute error for the difference in the two sensitivity and specificity estimates.
`r`	scalar, the required number in the second population divided by the number in the first population.
`nfractional`	logical, return fractional sample size.
`verbose`	logical, return sample size estimates for `se`, `sp`, and `pi`?
`conf.level`	scalar, defining the level of confidence in the computed result.

Details

Hui and Walter (1980) describe an approach for estimating the sensitivity and specificity of a diagnostic test in the absence of a gold standard. Their method involves testing individuals from two populations with two conditionally independent diagnostic tests (neither of which is a gold standard). With such data, all six parameters of interest (two sensitivities, two specificities and two prevalences) can be estimated since there are six degrees of freedom available. The methodology in this function follows the sample size calculation methods described by Georgiadis et al. (2005).

In their paper Georgiadis et al. (2005) parameterise the uncertainty in the prevalence, sensitivity and specificity estimates in terms of the width of the confidence interval. For consistency with the other sample size calculation functions in epiR the amount of uncertainty in the prevalence, sensitivity and specificity is parameterised in absolute terms. Using this approach, if we set se = c(0.80,0.90) and epsilon.ase = c(0.05,0.10) the number of subjects to return an estimate of se1 that is between 0.75 and 0.85 and se2 that is between 0.80 and 1.0 will be returned.

Value

When verbose = TRUE a data frame listing the required sample size to estimate:

`p1`	the prevalence of the outcome of interest in population 1.
`p2`	the prevalence of the outcome of interest in population 2.
`se1`	the sensitivity of the first diagnostic test.
`se2`	the sensitivity of the second diagnostic test.
`sp1`	the specificity of the first diagnostic test.
`sp2`	the specificity of the second diagnostic test.
`se1.se2`	the difference in the sensitivities of the two diagnostic tests.
`sp1.sp2`	the difference in the specificities of the two diagnostic tests.

When verbose = FALSE a data frame listing the maximum of the sample size estimates listed when verbose = TRUE.

References

Georgiadis M, Johnson W, Gardner I (2005) Sample size determination for estimation of the accuracy of two conditionally independent tests in the absence of a gold standard. Preventive Veterinary Medicine 71, 1 - 10. DOI: 10.1016/j.prevetmed.2005.04.004.

Hui SL, Walter SD (1980) Estimating the error rates of diagnostic tests. Biometrics 36, 167 - 171.

Nielsen SS, Gronbaek C, Agger JF, Houe H (2002) Maximum-likelihood estimation of sensitivity and specificity of ELISAs and faecal culture for diagnosis of paratuberculosis. Preventive Veterinary Medicine 53, 191 - 204. DOI: 10.1016/s0167-5877(01)00280-x.

Examples

## EXAMPLE 1 (from Georgiadis et al. 2005, pp. 5):
## Nielsen et al. (2002) present data from the evaluation of a milk 
## antibody ELISA and faecal culture for the diagnosis of Mycobacterium avium 
## subsp. paratuberculosis infection in cattle. Because the ELISA detects 
## antibodies and culture is based on isolation of the bacterium in faeces 
## we can reasonably assume that the two tests are conditionally independent.

## How many cattle need to be sampled if we wanted to be 95% certain that
## our estimate of diagnostic sensitivity and specificity of the two tests 
## is within 0.05 of the true population value assuming the number sampled 
## in the second population divided by the number sampled in the first 
## population is 0.817? The prevalence of Mycobacterium avium subsp. 
## paratuberculosis is thought to be 0.093 and 0.204, respectively. Assume 
## the sensitivity of the the ELISA and faecal culture is 0.349 and 0.534, 
## respectively. Assume the specificity of the ELISA and faecal culture is 
## 0.995 and 0.894, respectively.

epi.ssdxtest(pi = c(0.093,0.204), se = c(0.349,0.534), sp = c(0.995,0.894), 
   epsilon.api = c(0.05,0.05), epsilon.ase = c(0.05,0.05), 
   epsilon.asp = c(0.05,0.05), epsilon.asesp = c(0.05,0.05), 
    r = 0.817, nfractional = FALSE, verbose = FALSE, conf.level = 0.95)

## A total of 63,887 cattle need to be sampled (35,161 from population 1 and
## 28,726 from population 2) to meet the requirements of the study.

epiR documentation built on Dec. 15, 2025, 9:06 a.m.