Abstract
In clinical research gold standard tests for making a diagnosis are often laborious and sometimes impossible. Instead, simple and non-invasive tests are often used. A problem is that these tests have limited sensitivities and specificities. Levels around 50% means that no more information is given than flipping a coin. Levels substantially higher than 50% are commonly accepted as documented proof, that the diagnostic test is valid. However, sensitivity/specificity are estimates from experimental samples, and scientific rigor recommends that with experimental sampling amounts of uncertainty be included. Although the STARD (Standards for Reporting Diagnostic Accuracy) working party recently advised “to include in the estimates of diagnostic accuracy adequate measures of uncertainty, e.g., 95%-confidence intervals” (Bossuyt et al. 2003), so far uncertainty is virtually never assessed in sensitivity/specificity evaluations of clinical diagnostic tests. This is a pity, because calculated levels of uncertainty can be used for statistically testing whether the sensitivity/specificity is significantly larger than 50%. The present chapter uses examples to describe (1) simple methods for calculating standard errors and 95% confidence intervals, and (2) how they can be employed for statistical testing whether the new test is valid. We do hope that this chapter will stimulate clinical investigators to more often assess the uncertainty of the diagnostic tests they apply.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Anonymous Delta-method. http://en.wikidepia.org/wiki/Delta_method. Accessed 15 Dec 2011
Berger JO, Bernerdo J (1989) Estimating a product of normal means: Bayesian analysis with some priors. J Am Stat Assoc 84:200–207
Bossuyt PM, Reitsma JB, Bruns DE, Gatsonis CA, Glasziou PP, Irwig JG, Moher D, Rennie D, De Vet HC, for the STARD steering group (2003) Education and debate. Towards complete and accurate reporting of studies of diagnostic accuracy: the STARD initiative. BMJ 326:41–44
Author information
Authors and Affiliations
Appendices
Appendix 1
For the calculation of the standard errors (SEs) of sensitivity, specificity and overall-validity we make use of the Gaussian curve assumption in the data.
Definitive diagnosis (n) | |||
---|---|---|---|
Yes | No | ||
Result diagnostic test | Yes | a | b |
No | c | d |
Sensitivity = a/(a + c) = proportion true positives
Specificity = d/(b + d) = proportion true negatives
1-specificity = b/(b + d)
Proportion of patients with a definitive diagnosis = (a + c)/(a + b + c + d)
Overall validity = (a + d)/(a + b + c + d)
In order to make predictions from these estimates of validity their standard deviations/errors are required. The standard deviation/error (SD/SE) of a proportion can be calculated.
where p equals a/(a + c) for the sensitivity. Using the above equations the standard errors can be readily obtained.
Appendix 2
The equation of the SE of the overall-validity is less straightforward, but can be obtained using the Bayes’ rule (Berger and Bernerdo 1989) and the delta method (Anonymous 2011). The calculations are given for the purpose of completeness (Var = variance = square root of the standard error; prevalence = proportion of patients with a definitive diagnosis).
In order to calculate the standard error (SE), we make use of the equation (Var = variance, Cov = covariance)
If X = sensitivity × prevalence, and Y = specificity × (1 − prevalence), then the equations can be combined to obtain an equation for the variance of the overall-validity (sens = sensitivity, spec = specificity, prev = prevalence)
The variance of X + Y may according to the delta-method (Anonymous 2011) be approached from:
By combining the equations we will end up finding:
The delta-method describes the variance of natural logarithm (ln) (X) as Var (ln(x)) = Var(x)/x2. The approach is sufficiently accurate if the standard errors of prevalence, sensitivity and specificity are small, which is true if samples are not too small. We should add that the delta method is very helpful for the statistical assessment of complex functions like those of standard errors. Second derivatives of parabolas with values similar to those of the complex functions are used to find the best fit parabolas (second order polynomes). Parabolas are easy, and produce a good fit of such complex functions. This methodology has developed tremendously, and terms commonly used for it are the quadratic approximation, eigenvectors, and the delta-method.
Rights and permissions
Copyright information
© 2012 Springer Science+Business Media B.V.
About this chapter
Cite this chapter
Cleophas, T.J., Zwinderman, A.H. (2012). Uncertainty of Qualitative Diagnostic Tests. In: Statistics Applied to Clinical Studies. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-2863-9_47
Download citation
DOI: https://doi.org/10.1007/978-94-007-2863-9_47
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-2862-2
Online ISBN: 978-94-007-2863-9
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)