Skip to main content

Statistical Learning Process for the Reduction of Sample Collection Assuring a Desired Level of Confidence

  • Chapter
  • First Online:
Applications of Machine Learning

Part of the book series: Algorithms for Intelligent Systems ((AIS))

  • 1888 Accesses

Abstract

In the process of characterizing a given population collecting samples, there are machine learning applications today that provide a wide range of possibilities regarding, e.g., clustering and data mining topics. These possibilities consist of industrial and scientific application techniques that are adapted to each particular field for the successful achievement of results. As a fundamental element in statistical learning, this paper aims to understand in a simple way the use of the t-Student statistical distribution, clarifying the concepts of sampling error and convergence criterion based on an iterative process for the calculation of the optimal number of samples. With this reasoning and inference application of the t-Student distribution, this paper is intended to find the convenience of a procedure that can be used to discard or not sampling protocols, serving as a starting point till more reliable data can be available. In other words, regarding problem-solving and planning issues, and at the beginning from a preliminary situation where simplifications are made, it is intended here to estimate the distortions introduced by the measurements, so that according to different values of sampling error, a reasonable number of samples can be obtained. As a criterion of convergence of the algorithm for calculating the number of samples, the objective here will be to determine a minimum number of characterizations that will reduce costs and efforts, while adjusting to the desired confidence level considering the error of the measurements. With this purpose, this chapter begins introducing concepts from probabilistic models and methods, in order to propose after that a sampling mathematical protocol. Then, the new protocol is validated by simulation in some study cases. Finally, this chapter ends discussing possible future researches on this field and with some conclusions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 179.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. De Neufville R (2004) Uncertainty management for engineering systems planning and design, MIT Engineering Systems Monograph. http://esd.mit.edu/symposium/pdfs/monograph/uncertainty.pdf

  2. Gonzalez-Prida V, Zamora J et al (2019) A risk indicator in asset management to optimize maintenance periods. In: WCEAM (World Congress on Engineering Asset Management), Stavanger, Norway, 24–26 Sep 2018

    Google Scholar 

  3. Helton JC, Oberkampf W (eds) (2004) Alternative representations of epistemic uncertainty. Spec Issue Reliab Eng Syst Saf 85(1–3)

    Google Scholar 

  4. de Rocquigny E, Devictor N, Tarantola S (2008) Uncertainty in industrial practice: a guide to quantitative uncertainty management. Wiley

    Google Scholar 

  5. Price C, Walker M (2019) Improving the accessibility of foundation statistics for undergraduate business and management students. Studies in Higher Education. Taylor and Francis Online. https://doi.org/10.1080/03075079.2019.1628204

  6. ASTM D4687—95(2006) Standard guide for general planning of waste sampling ASTM. Test methods for evaluating solid waste, physical/chemical methods. SW-846. EPA. Publication. USEPA

    Google Scholar 

  7. Ramsey FP (2016) Truth and probability. In: Arló-Costa H, Hendricks V, van Benthem J (eds) Readings in formal epistemology. Springer Graduate Texts in Philosophy, vol 1. Springer, Cham

    Google Scholar 

  8. Crespo A, González-Prida V, Gómez J (eds) (2018) Advanced maintenance modelling for asset management. Techniques and methods for complex industrial systems. Springer International Publishing. ISBN 978-3-319-58045-6

    Google Scholar 

  9. Crespo Márquez A, Macchi M, Parlikad AJ (eds) (2019) Value based and intelligent asset. Mastering the asset management transformation in industrial plants and infrastructures. Springer International Publishing. ISBN 978-3-030-20703-8

    Google Scholar 

  10. Aven T (2003) Foundations of risk analysis. Wiley, Chichester

    Book  Google Scholar 

  11. Helton JC, Cooke RM, McKay MD, Saltelli A (eds) (2006) Sensitivity analysis of model output: SAMO 2004. Spec Issue Reliab Eng Syst Saf 91(10–11)

    Google Scholar 

  12. Nilsen T, Aven T (2003) Models and model uncertainty in the context of risk analysis. Reliab Eng Syst Saf 79(309–317)

    Google Scholar 

  13. Gonzalez-Prida V, Zamora J (eds) (2019) Handbook of research on industrial advancement in scientific knowledge. IGI Global, Hershey, PA, pp 1–442. ISBN: 9781522571520

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vicente González-Prida .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

González-Prida, V., Zamora, J., Crespo, A., Moreu, P. (2020). Statistical Learning Process for the Reduction of Sample Collection Assuring a Desired Level of Confidence. In: Johri, P., Verma, J., Paul, S. (eds) Applications of Machine Learning. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-15-3357-0_1

Download citation

  • DOI: https://doi.org/10.1007/978-981-15-3357-0_1

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-3356-3

  • Online ISBN: 978-981-15-3357-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics