Abstract
Walter et al. (2001) developed a model to identify older adults at high risk of death in the first year after hospitalization, using data collected for 2,922 patients discharged from two hospitals in Ohio. Potential predictors included demographics, activities of daily living (ADLs), the APACHE-II illness-severity score, and information about the index hospitalization. A “backward” selection procedure with a restrictive inclusion criterion was used to choose a multipredictor model, using data from one of the two hospitals. The model was then validated using data from the other hospital. The goal was to select a model that best predicted future events, with a view toward identifying patients in need of more vigorous monitoring and intervention.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
5.7 Further Notes and References
Breiman, L. (2001). Statistical modeling: the two cultures. Statistical Science, 16(3), 199–231.
Buckland, S. T., Burnham, K. P. and Augustin, N. H. (1997). Model selection: an integral part of inference. Biometrics, 53, 603–618.
Chatfield, C. (1995). Model uncertainty, data mining and statistical inference. Journal of the Royal Statistical Society, Series A, 158, 419–466.
Greenland, S. and Brumback, B. (2002). An overview of relations among causal modeling methods. International Journal of Epidemiology, 31(5), 1030–1037.
Greenland, S., Pearl, J. and Robins, J. M. (1999). Causal diagrams for epidemiologic research. Epidemiology, 10, 37–48.
Harrell, F., Lee, K. and Mark, D. (1996). Multivariable prognostic models: issues in developing models, evaluation assumptions, and adequacy, and measuring and reducing errors. Statistics in Medicine, 15, 361–387.
Hoerl, A. E. and Kennard, R. W. (1970). Ridge regression: biased estimates for nonorthogonal problems. Technometrics, 12, 55–67.
Hosmer, D. W. and Lemeshow, S. (2000). Applied Logistic Regression. John Wiley & Sons, New York, Chichester.
Le Cessie, S. and Van Houwelingen, J. C. (1992). Ridge estimators in logistic regression. Applied Statistics, 41, 191–201.
Jewell, N. P. (2004). Statistics for Epidemiology. Chapman & Hall/CRC, Boca Raton, FL.
Linhart, H. and Zucchini, W. (1986). Model Selection. John Wiley & Sons, New York, Chichester.
Miller, A. J. (1990). Subset Selection in Regression. Chapman & Hall Ltd, London, New York.
Pearl, J. (1995). Causal diagrams for empirical research. Biometrika, 82, 669–688.
Rothman, K. J. and Greenland, S. (1998). Modern Epidemiology. Lippincott Williams & Wilkins Publishers, Philadelphi, PA, 2nd ed.
Steyerberg, E. W., Eijkemans, M. J. C., Harrell, F. E. and Habbema, J. D. F. (2000). Prognostic modelling with logistic regression analysis: a comparison of selection and estimation methods in small datasets. Statistics in Medicine, 19, 1059–1079.
Tibshirani, R. (1997). The lasso method for variable selection in the Cox model. Statistics in Medicine, 16, 385–395.
Verweij, P. J. M. and Van Houwelingen, H. C. (1994). Penalized likelihood in Cox regression. Statistics in Medicine, 13, 2427–2436.
Weisberg, S. (1985). Applied Linear Regression. John Wiley & Sons, New York, Chichester.
Rights and permissions
Copyright information
© 2005 Springer Science+Business Media, Inc.
About this chapter
Cite this chapter
(2005). Predictor Selection. In: Regression Methods in Biostatistics. Statistics for Biology and Health. Springer, New York, NY. https://doi.org/10.1007/0-387-27255-0_5
Download citation
DOI: https://doi.org/10.1007/0-387-27255-0_5
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-20275-4
Online ISBN: 978-0-387-27255-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)