Skip to main content

Biomarker assessment in ROC curve analysis using the length of the curve as an index of diagnostic accuracy: the binormal model framework

Abstract

In receiver operating characteristic (ROC) curve analysis, the area under the curve (AUC) is undoubtedly the most widely used index of diagnostic accuracy for the assessment of the utility of a biomarker or for the comparison of competing biomarkers. Along with the AUC, the maximum of the Youden index, J, is often used both as an index of diagnostic accuracy and as a tool useful for the estimation of an optimal cutoff point that can be used for diagnostic purposes based on the biomarker under consideration. In this work, we study the utility of the length of the binormal model-based ROC curve (LoC) as an index of diagnostic accuracy for biomarker evaluation. Estimation procedures for LoC, described in this article, are based either on normality assumptions or on the same assumptions after a Box–Cox transformation to normality. Two simulation studies are considered. In the first, the estimation procedures for LoC are compared in terms of bias and root mean squared error, while in the second one, the performance of LoC is compared with approaches based on AUC and J, both for the case of the assessment of a single biomarker and for the comparison of two biomarkers, in a parametric framework. We provide an interpretation for the proposed index and illustrate with an application on biomarkers from a colorectal cancer study.

This is a preview of subscription content, access via your institution.

Fig. 1
Fig. 2
Fig. 3

References

  • Alonso, R., Nakas, C.T., Pardo, M.C.: A study of indices useful for the assessment of diagnostic markers in non-parametric ROC curve analysis. Commun. Stat. Simul. Comput. (2018). https://doi.org/10.1080/03610918.2018.1511806

    Article  Google Scholar 

  • Aoki, K., Misumi, J., Kimura, T., Zhao, W., Xie, T.: Evaluation of cutoff levels for screening of gastric cancer using serum pepsinogens and distributions of levels of serum pepsinogen I, II and of PG I/PG II ratios in a gastric cancer case-control study. J. Epidemiol. 7(3), 143–151 (1997)

    Article  Google Scholar 

  • Baker, S.G.: The central role of receiver operating characteristic (ROC) curves in evaluating tests for the early detection of cancer. J. Natl. Cancer Inst. 95, 511–515 (2003)

    Article  Google Scholar 

  • Bamber, D.: The area above the ordinal dominance graph and the area below the receiver operating characteristic graph. J. Math. Psychol. 12(4), 387–415 (1975)

    Article  MathSciNet  Google Scholar 

  • Bantis, L.E., Nakas, C.T., Reiser, B.: Construction of confidence intervals for the maximum of the Youden index and the corresponding cutoff point of a continuous biomarker. Biom. J. 61(1), 138–156 (2019)

    Article  MathSciNet  Google Scholar 

  • Bao, Y., Lu, J., Wang, C., Yang, M., Li, H., Zhang, X., Zhu, J., Lu, H., Jia, W., Xiang, K.: Optimal waist circumference cutoffs for abdominal obesity in Chinese. Atherosclerosis 201, 378–384 (2008)

    Article  Google Scholar 

  • Bloch, D.A.: Comparing two diagnostic tests against the same ”gold standard” in the same sample. Biometrics 53(1), 73–85 (1997)

    Article  Google Scholar 

  • Faraggi, D., Reiser, B.: Estimation of the area under the ROC curve. Stat. Med. 21, 3093–3106 (2002)

    Article  Google Scholar 

  • Fluss, R., Faraggi, D., Reiser, B.: Estimation of the Youden Index and its associate cutoff point. Biom. J. 47(4), 458–472 (2005)

    Article  MathSciNet  Google Scholar 

  • Hanley, J.A.: The use of the ’binormal’ model for parametric ROC analysis of quantitative diagnostic tests. Stat. Med. 15, 1575–1585 (1996)

    Article  Google Scholar 

  • Hanley, J., McNeil, B.: The meaning and use of the area under a receiver operating characteristic (ROC) curve. Diagn. Radiol. 143(1), 29–36 (1982)

    Google Scholar 

  • Krzanowski, W.J., Hand, D.J.: ROC Curves for Continuous Data. Chapman and Hall/CRC, Boca Raton (2009)

    Book  Google Scholar 

  • Krzanowski, W.J., Hand, D.J.: Testing the difference between two Kolmogorov–Smirnov values in the context of receiver operating characteristic curves. J. Appl. Stat. 38(3), 437–450 (2011)

    Article  MathSciNet  Google Scholar 

  • Lee, W.C., Hsiao, C.K.: Alternative summary indices for the receiver operating characteristic curve. Epidemiology 7, 605–611 (1996)

    Article  Google Scholar 

  • Leichtle, A.B., Nuoffer, J.M., Ceglarek, U., Kase, J., Conrad, T., Witzigmann, H., Thiery, J., Fiedler, G.M.: Serum amino acid profiles and their alterations in colorectal cancer. Metabolomics 8, 643–653 (2012)

    Article  Google Scholar 

  • Matoba, Y., Inoguchi, T., Nasu, S., Suzuki, S., Yanase, T., Nawata, H., Takayanagi, R.: Optimal cut points of waist circumference for the clinical diagnosis of metabolic syndrome in the Japanese population. Diabetes Care 31(3), 590–592 (2008)

    Article  Google Scholar 

  • McIntosh, M.W., Pepe, M.S.: Combining several screening tests: optimality of the risk score. Biometrics 58, 657–664 (2002)

    Article  MathSciNet  Google Scholar 

  • Molodianovitch, K., Faraggi, D., Reiser, B.: Comparing the areas under two correlated ROC curves: parametric and non-parametric approaches. Biom. J. 48, 745–757 (2006)

    Article  MathSciNet  Google Scholar 

  • Nakas, C.T.: Performance of the one-sample goodness-of-Fit PP-plot length test. Commun. Stat. Simul. Comput. 35, 1053–1059 (2007)

    Article  Google Scholar 

  • Nakas, C.T., Dalrymple-Alford, J.C., Anderson, T.J., Alonzo, T.A.: Generalization of Youden index for multiple-class classification problems applied to the assessment of externally validated cognition in Parkinson disease screening. Stat. Med. 32, 995–1003 (2013)

    Article  MathSciNet  Google Scholar 

  • Pardo, M.C., Franco-Pereira, A.M.: Non parametric ROC summary statistics. REVSTAT 15(4), 583–600 (2017)

    MathSciNet  MATH  Google Scholar 

  • Pepe, M.S.: The Statistical Evaluation of Medical Tests for Classification and Prediction. Oxford University Press, New York (2003)

    MATH  Google Scholar 

  • Pepe, M.S., Janes, H., Longton, G., Leisenring, W., Newcomb, P.: Limitations of the odds ratio in gauging the performance of a diagnostic, prognostic, or screening marker. Am. J. Epidemiol. 159(9), 882–890 (2003)

    Article  Google Scholar 

  • Schisterman, E.F., Faraggi, D., Reiser, B.: Adjusting the generalized ROC curve for covariates. Stat. Med. 23(21), 3319–3331 (2004)

    Article  Google Scholar 

  • Schisterman, E.F., Perkins, N.J., Liu, A., Bondell, H.: Optimal cut-point and its corresponding Youden Index to discriminate individuals using pooled blood samples. Epidemiology 16(1), 73–81 (2005)

    Article  Google Scholar 

  • Shan, G.: Improved confidence intervals for the Youden Index. PLoS One 10(7), e0127272 (2015)

    Article  Google Scholar 

  • Venkatraman, E.S., Begg, C.B.: A distribution-free procedure for comparing receiver operating characteristic curves from a paired experiment. Biometrika 83, 835–848 (1996)

    Article  MathSciNet  Google Scholar 

  • Youden, W.J.: Index for rating diagnostic tests. Cancer 3, 32–35 (1950)

    Article  Google Scholar 

  • Zhou, H., Qin, G.: Confidence intervals for the difference in paired Youden indices. Pharm. Stat. 12, 17–27 (2013)

    Article  Google Scholar 

  • Zhou, X.H., Obuchowski, N.A., McClish, D.K.: Statistical Methods in Diagnostic Medicine. Wiley, New York (2011)

    Book  Google Scholar 

  • Zou, K.H., Hall, W.J.: Two transformation models for estimating an ROC curve derived from continuous data. J. Appl. Stat. 27(5), 621–631 (2000)

    Article  Google Scholar 

Download references

Acknowledgements

The authors gratefully acknowledge the constructive comments of an associate editor and two anonymous referees, which led to great improvements in the manuscript. The authors also acknowledge support from projects MTM2017-89422-P and MTM2016-75351-R of the Spanish Ministry of Economy and Competitiveness. This work has received financial support from the Xunta de Galicia (Centro singular de investigación de Galicia accreditation 2016-2019) and the European Union (European Regional Development Fund—ERDF).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alba M. Franco-Pereira.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Appendix

1.1 A. Derivation of Eq. (3)

Based on the standard formula for the length of a curve, for \(y=\hbox {ROC}(p)\) the arc length is given by:

$$\begin{aligned} \int _{0}^{1}\sqrt{1+[\hbox {ROC}'(p)]^2}dp \end{aligned}$$

Therefore, Eq. (3) is obtained differentiating the binormal ROC curve, \(\hbox {ROC}(p)=\Phi (a+b\Phi ^{-1}(p))\), which is given by

$$\begin{aligned} \hbox {ROC}'(p)= & {} \phi [a + b \Phi ^{-1}(p)]\times \frac{d}{dp}b \Phi ^{-1}(p) \\= & {} \frac{b\phi [a+b\Phi ^{-1}(p)]}{\phi [\Phi ^{-1}(p)]}=b\frac{e^{-\frac{1}{2}[a+b\Phi ^{-1}(p)]^2}}{e^{-\frac{1}{2}[\Phi ^{-1}(p)]^2}} \end{aligned}$$

where \(\phi\) is the density function of a standard normal, \(a =\left( \mu _{Y}-\mu _{X}\right) /\sigma _{Y}\) and \(b =\dfrac{\sigma _{X}}{\sigma _{Y}}\).

The exponential function is always greater than zero, and hence, the fraction is always greater than zero. By definition, a standard deviation is greater than zero; therefore, \(b=\sigma _1/\sigma _2 >0\). The product of two components that are always greater than zero must itself be greater than zero. So the slope of the binormal ROC curve is always positive for p in the interval (0, 1).

1.2 B. Estimation of L without the assumption of normality for the underlying distributions

For lognormal and gamma distributions, we obtain the ROC derivative from (5). However, for \(X^{-1/3}\sim N\left( \mu _{X},\sigma _{X}^{2}\right)\) and \(Y^{-1/3}\sim N\left( \mu _{Y},\sigma _{Y}^{2}\right)\), we have obtained the explicit formula:

Taking into account that

$$\begin{aligned} F_{X}(x)=\Phi \left( \frac{x^{-1/3}-\mu _{X}}{\sigma _{X}}\right) \end{aligned}$$

then

$$\begin{aligned} F_{X}^{-1}(1-t)=\left( \mu _{X}+\sigma _{X}\Phi ^{-1}\left( t\right) \right) ^{-3}. \end{aligned}$$

Substituting this last expression in (5), we have

$$\begin{aligned} \hbox {ROC}^{\prime }\left( t\right)= & {} \frac{f_{Y}\left( \left( \mu _{X}+\sigma _{X}\Phi ^{-1}\left( t\right) \right) ^{-3}\right) }{f_{X}\left( \left( \mu _{X}+\sigma _{X}\Phi ^{-1}\left( t\right) \right) ^{-3}\right) }= \\= & {} \frac{\phi \left( \dfrac{\left( \left( \mu _{X}+\sigma _{X}\Phi ^{-1}\left( t\right) \right) ^{-3}\right) ^{-1/3}-\mu _{Y}}{\sigma _{Y}} \right) }{3\sigma _{Y}\left( \left( \mu _{X}+\sigma _{X}\Phi ^{-1}\left( t\right) \right) ^{-3}\right) ^{4/3}}\\&\quad \frac{3\sigma _{X}\left( \left( \mu _{X}+\sigma _{X}\Phi ^{-1}\left( t\right) \right) ^{-3}\right) ^{4/3}}{\phi \left( \dfrac{\left( \left( \mu _{X}+\sigma _{X}\Phi ^{-1}\left( t\right) \right) ^{-3}\right) ^{-1/3}-\mu _{X}}{\sigma _{X}}\right) } \\= & {} b \frac{\phi \left( -a +b \Phi ^{-1}\left( t\right) \right) }{ \phi \left( \Phi ^{-1}\left( t\right) \right) }. \end{aligned}$$

1.3 B. R-code for the basic calculation of L

figure a

Rights and permissions

Reprints and Permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Franco-Pereira, A.M., Nakas, C.T. & Pardo, M.C. Biomarker assessment in ROC curve analysis using the length of the curve as an index of diagnostic accuracy: the binormal model framework. AStA Adv Stat Anal 104, 625–647 (2020). https://doi.org/10.1007/s10182-020-00371-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10182-020-00371-8

Keywords

  • Area under the ROC curve (AUC)
  • Length of the ROC curve (LoC)
  • Binormal ROC curve
  • Maximum of the Youden index (J)
  • Diagnostic likelihood ratio (DLR)