Restrictions on candidate predictors

  • E.W. Steyerberg
Part of the Statistics for Biology and Health book series (SBH)


A major problem in predictive modelling is that we often have many candidate predictors available for the analysis, while the data set available for analysis is relatively small. A small sample size leads to problems as discussed in Chap. 5, such as limited power to test main effects of potential predictors, and too extreme predictions when predictions are based on the standard regression coefficients (overfitting). We discuss some procedures to increase the robustness and validity of a predictive model, including restriction of the number of candidate predictors, considering distributions of predictors, combining similar variables, and averaging the effects of similar variables. We provide a detailed description of a case study of modelling similar effects of aspects of family history for robust prediction of the presence of a genetic mutation.


Endometrial Cancer Lynch Syndrome Outcome Relationship Subject Knowledge Mass Size 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Copyright information

© Springer Science+Business Media, LLC 2009

Authors and Affiliations

  • E.W. Steyerberg
    • 1
  1. 1.Department of Public HealthErasmus MCRotterdamThe Netherlands

Personalised recommendations