Summmary
A major problem in predictive modelling is that we often have many candidate predictors available for the analysis, while the data set available for analysis is relatively small. A small sample size leads to problems as discussed in Chap. 5, such as limited power to test main effects of potential predictors, and too extreme predictions when predictions are based on the standard regression coefficients (overfitting). We discuss some procedures to increase the robustness and validity of a predictive model, including restriction of the number of candidate predictors, considering distributions of predictors, combining similar variables, and averaging the effects of similar variables. We provide a detailed description of a case study of modelling similar effects of aspects of family history for robust prediction of the presence of a genetic mutation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Steyerberg, E. (2009). Restrictions on candidate predictors. In: Clinical Prediction Models. Statistics for Biology and Health. Springer, New York, NY. https://doi.org/10.1007/978-0-387-77244-8_10
Download citation
DOI: https://doi.org/10.1007/978-0-387-77244-8_10
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-77243-1
Online ISBN: 978-0-387-77244-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)