Skip to main content

Prediction and Internal Statistical Cross Validation

  • Chapter
  • First Online:
Data Science and Predictive Analytics

Abstract

Cross-validation is a statistical approach for validating predictive methods, classification models, and clustering techniques. It assesses the reliability and stability of the results of the corresponding statistical analyses (e.g., predictions, classifications, forecasts) based on independent datasets. For prediction of trend, association, clustering, and classification, a model is usually trained on one dataset (training data) and subsequently tested on new data (testing or validation data). Statistical internal cross-validation uses iterative bootstrapping to define test datasets, evaluates the model predictive performance, and assesses its power to avoid overfitting. Overfitting is the process of computing a predictive or classification model that describes random error, i.e., fits to the noise components of the observations, instead of the actual underlying relationships and salient features in the data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 49.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 64.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Ivo D. Dinov

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Dinov, I.D. (2018). Prediction and Internal Statistical Cross Validation. In: Data Science and Predictive Analytics. Springer, Cham. https://doi.org/10.1007/978-3-319-72347-1_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-72347-1_21

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-72346-4

  • Online ISBN: 978-3-319-72347-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics