Prediction and Internal Statistical Cross Validation

Dinov, Ivo D.

doi:10.1007/978-3-319-72347-1_21

Ivo D. Dinov²

271k Accesses
1 Citations

Abstract

Cross-validation is a statistical approach for validating predictive methods, classification models, and clustering techniques. It assesses the reliability and stability of the results of the corresponding statistical analyses (e.g., predictions, classifications, forecasts) based on independent datasets. For prediction of trend, association, clustering, and classification, a model is usually trained on one dataset (training data) and subsequently tested on new data (testing or validation data). Statistical internal cross-validation uses iterative bootstrapping to define test datasets, evaluates the model predictive performance, and assesses its power to avoid overfitting. Overfitting is the process of computing a predictive or classification model that describes random error, i.e., fits to the noise components of the observations, instead of the actual underlying relationships and salient features in the data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 49.99; Price excludes VAT (USA)

Softcover Book: USD 64.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Elder, J, Nisbet, R, Miner, G (eds.) (2009) Handbook of Statistical Analysis and Data Mining Applications, Academic Press, ISBN 0080912036, 9780080912035.
Google Scholar
Hastie, T, Tibshirani, R, Friedman, J. (2013) The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Springer Series in Statistics, New York, ISBN 1489905189, 9781489905185.
Google Scholar
Hothorn, T, Everitt, BS. (2014) A Handbook of Statistical Analyses using R, CRC Press, ISBN 1482204592, 9781482204599.
Google Scholar
https://en.wikipedia.org/wiki/Coefficient_of_determination
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0157077

Download references

Author information

Authors and Affiliations

University of Michigan–Ann Arbor, Ann Arbor, Michigan, USA
Ivo D. Dinov

Authors

Ivo D. Dinov
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Dinov, I.D. (2018). Prediction and Internal Statistical Cross Validation. In: Data Science and Predictive Analytics. Springer, Cham. https://doi.org/10.1007/978-3-319-72347-1_21

Download citation

DOI: https://doi.org/10.1007/978-3-319-72347-1_21
Published: 28 August 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-72346-4
Online ISBN: 978-3-319-72347-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics