Lifetime Data Analysis

, Volume 19, Issue 2, pp 202–218

Understanding increments in model performance metrics

Authors

    • Department of BiostatisticsHarvard Clinical Research Institute, Boston University
  • Ralph B. D’Agostino
    • Department of Mathematics and StatisticsHarvard Clinical Research Institute, Boston University
  • Joseph M. Massaro
    • Department of BiostatisticsHarvard Clinical Research Institute, Boston University
Article

DOI: 10.1007/s10985-012-9238-0

Cite this article as:
Pencina, M.J., D’Agostino, R.B. & Massaro, J.M. Lifetime Data Anal (2013) 19: 202. doi:10.1007/s10985-012-9238-0

Abstract

The area under the receiver operating characteristic curve (AUC) is the most commonly reported measure of discrimination for prediction models with binary outcomes. However, recently it has been criticized for its inability to increase when important risk factors are added to a baseline model with good discrimination. This has led to the claim that the reliance on the AUC as a measure of discrimination may miss important improvements in clinical performance of risk prediction rules derived from a baseline model. In this paper we investigate this claim by relating the AUC to measures of clinical performance based on sensitivity and specificity under the assumption of multivariate normality. The behavior of the AUC is contrasted with that of discrimination slope. We show that unless rules with very good specificity are desired, the change in the AUC does an adequate job as a predictor of the change in measures of clinical performance. However, stronger or more numerous predictors are needed to achieve the same increment in the AUC for baseline models with good versus poor discrimination. When excellent specificity is desired, our results suggest that the discrimination slope might be a better measure of model improvement than AUC. The theoretical results are illustrated using a Framingham Heart Study example of a model for predicting the 10-year incidence of atrial fibrillation.

Keywords

Risk predictionDiscriminationAUCIDIYouden indexrelative utility

Copyright information

© Springer Science+Business Media New York 2012