Abstract
The accuracy of fit of a mathematical predictive model is the degree to which the predicted values coincide with the observed outcome. When the outcome variable is dichotomous and predictions are stated as probabilities that an event will occur, models can be checked for good discrimination and calibration. In case of the multiple logistic regression model for binary outcomes (event, non-event), the area under the ROC (Receiver Operating Characteristic) curve is the most used measure of model discrimination. The area under the ROC curve is identical to the Mann-Whitney statistic. We consider shift models for the distributions of predicted probabilities for event and non-event. From the interval estimates of the shift parameter, we calculate the confidence intervals for the area under the ROC curve. Also, we present the development of a general description of an overall discrimination index C (overall C) which we can extend to a survival time model such as the the Cox regression model. The general theory of rank correlation is applied in developing the overall C. The overall C is a linear combination of three independent components: event vs. non-event, event vs. event and event vs. censored. By showing that these three components are asymptotically normally distributed, the overall C is shown to be asymptotically normally distributed. The expected value and the variance of the overall C are presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bamber, D. (1975). The area above the ordinal dominance graph and the area below the receiver operating graph, Journal of Mathematical Psychology, 12, 387–415.
Birnbaum, Z. W. and McCarthy, R. C. (1958). A distribution-free upper bounds for PrY < X, based on independent samples of X and Y, Annals of Mathematical Statistics, 29, 558–562.
Govindarajulu, Z. (1991). Distribution-free confidence bounds for P(X < Y), Methods of Information in Medicine, 30, 96–101.
Hanley, J. A. and McNeil, B. J. (1982). The meaning and use of the area under a receiver operating characteristic (ROC) curve, Radiology, 143, 29–36.
Hilgers, R. A. (1991). Distribution-free confidence bounds for ROC curves, Methods of Information in Medicine, 30, 96–101.
Kendall, M. G. (1970). Rank Correlation Methods, London: Griffin.
Lehmann, E. L. (1975). Nonparametrics (Statistical Methods based on Ranks), California: Holden-Day.
Linnet, K. (1987). Comparison of quantitative diagnostic tests: type I error, power, and sample size, Statistics in Medicine, 6, 147–158.
Schafter, H. (1994). Efficient confidence bounds for ROC curves, Statistics in Medicine, 13, 1551–1561.
Sen, P. K. (1967). A note on asymptotically distribution-free confidence bounds for PX < Y, based on two independent samples, Sankhyá, Series A, 29, 95–102
Ury, H. K. (1972). On distribution-free confidence bounds for PrY < X, Technometrics, 14, 577–581.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer Science+Business Media New York
About this chapter
Cite this chapter
Nam, BH., D’Agostino, R.B. (2002). Discrimination Index, the Area Under the ROC Curve. In: Huber-Carol, C., Balakrishnan, N., Nikulin, M.S., Mesbah, M. (eds) Goodness-of-Fit Tests and Model Validity. Statistics for Industry and Technology. Birkhäuser, Boston, MA. https://doi.org/10.1007/978-1-4612-0103-8_20
Download citation
DOI: https://doi.org/10.1007/978-1-4612-0103-8_20
Publisher Name: Birkhäuser, Boston, MA
Print ISBN: 978-1-4612-6613-6
Online ISBN: 978-1-4612-0103-8
eBook Packages: Springer Book Archive