Statistical Learning Process for the Reduction of Sample Collection Assuring a Desired Level of Confidence

González-Prida, Vicente; Zamora, Jesús; Crespo, Adolfo; Moreu, Pedro

doi:10.1007/978-981-15-3357-0_1

Vicente González-Prida^7,8,
Jesús Zamora⁷,
Adolfo Crespo⁸ &
…
Pedro Moreu⁸

Part of the book series: Algorithms for Intelligent Systems ((AIS))

1888 Accesses

Abstract

In the process of characterizing a given population collecting samples, there are machine learning applications today that provide a wide range of possibilities regarding, e.g., clustering and data mining topics. These possibilities consist of industrial and scientific application techniques that are adapted to each particular field for the successful achievement of results. As a fundamental element in statistical learning, this paper aims to understand in a simple way the use of the t-Student statistical distribution, clarifying the concepts of sampling error and convergence criterion based on an iterative process for the calculation of the optimal number of samples. With this reasoning and inference application of the t-Student distribution, this paper is intended to find the convenience of a procedure that can be used to discard or not sampling protocols, serving as a starting point till more reliable data can be available. In other words, regarding problem-solving and planning issues, and at the beginning from a preliminary situation where simplifications are made, it is intended here to estimate the distortions introduced by the measurements, so that according to different values of sampling error, a reasonable number of samples can be obtained. As a criterion of convergence of the algorithm for calculating the number of samples, the objective here will be to determine a minimum number of characterizations that will reduce costs and efforts, while adjusting to the desired confidence level considering the error of the measurements. With this purpose, this chapter begins introducing concepts from probabilistic models and methods, in order to propose after that a sampling mathematical protocol. Then, the new protocol is validated by simulation in some study cases. Finally, this chapter ends discussing possible future researches on this field and with some conclusions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 139.00; Price excludes VAT (USA)

Softcover Book: USD 179.99; Price excludes VAT (USA)

Hardcover Book: USD 179.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

De Neufville R (2004) Uncertainty management for engineering systems planning and design, MIT Engineering Systems Monograph. http://esd.mit.edu/symposium/pdfs/monograph/uncertainty.pdf
Gonzalez-Prida V, Zamora J et al (2019) A risk indicator in asset management to optimize maintenance periods. In: WCEAM (World Congress on Engineering Asset Management), Stavanger, Norway, 24–26 Sep 2018
Google Scholar
Helton JC, Oberkampf W (eds) (2004) Alternative representations of epistemic uncertainty. Spec Issue Reliab Eng Syst Saf 85(1–3)
Google Scholar
de Rocquigny E, Devictor N, Tarantola S (2008) Uncertainty in industrial practice: a guide to quantitative uncertainty management. Wiley
Google Scholar
Price C, Walker M (2019) Improving the accessibility of foundation statistics for undergraduate business and management students. Studies in Higher Education. Taylor and Francis Online. https://doi.org/10.1080/03075079.2019.1628204
ASTM D4687—95(2006) Standard guide for general planning of waste sampling ASTM. Test methods for evaluating solid waste, physical/chemical methods. SW-846. EPA. Publication. USEPA
Google Scholar
Ramsey FP (2016) Truth and probability. In: Arló-Costa H, Hendricks V, van Benthem J (eds) Readings in formal epistemology. Springer Graduate Texts in Philosophy, vol 1. Springer, Cham
Google Scholar
Crespo A, González-Prida V, Gómez J (eds) (2018) Advanced maintenance modelling for asset management. Techniques and methods for complex industrial systems. Springer International Publishing. ISBN 978-3-319-58045-6
Google Scholar
Crespo Márquez A, Macchi M, Parlikad AJ (eds) (2019) Value based and intelligent asset. Mastering the asset management transformation in industrial plants and infrastructures. Springer International Publishing. ISBN 978-3-030-20703-8
Google Scholar
Aven T (2003) Foundations of risk analysis. Wiley, Chichester
Book Google Scholar
Helton JC, Cooke RM, McKay MD, Saltelli A (eds) (2006) Sensitivity analysis of model output: SAMO 2004. Spec Issue Reliab Eng Syst Saf 91(10–11)
Google Scholar
Nilsen T, Aven T (2003) Models and model uncertainty in the context of risk analysis. Reliab Eng Syst Saf 79(309–317)
Google Scholar
Gonzalez-Prida V, Zamora J (eds) (2019) Handbook of research on industrial advancement in scientific knowledge. IGI Global, Hershey, PA, pp 1–442. ISBN: 9781522571520
Google Scholar

Download references

Author information

Authors and Affiliations

UNED, Madrid, Spain
Vicente González-Prida & Jesús Zamora
University of Seville, Seville, Spain
Vicente González-Prida, Adolfo Crespo & Pedro Moreu

Authors

Vicente González-Prida
View author publications
You can also search for this author in PubMed Google Scholar
Jesús Zamora
View author publications
You can also search for this author in PubMed Google Scholar
Adolfo Crespo
View author publications
You can also search for this author in PubMed Google Scholar
Pedro Moreu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vicente González-Prida .

Editor information

Editors and Affiliations

School of Computer Science and Engineering, Galgotias University, Greater Noida, Uttar Pradesh, India
Prashant Johri
Department of Computer Science and Engineering, Amity School of Engineering and Technology, Amity University Haryana, Gurugram (Manesar), Haryana, India
Jitendra Kumar Verma
Department of Biomedical Engineering, North-Eastern Hill University, Shillong, Meghalaya, India
Sudip Paul

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

González-Prida, V., Zamora, J., Crespo, A., Moreu, P. (2020). Statistical Learning Process for the Reduction of Sample Collection Assuring a Desired Level of Confidence. In: Johri, P., Verma, J., Paul, S. (eds) Applications of Machine Learning. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-15-3357-0_1

Download citation

DOI: https://doi.org/10.1007/978-981-15-3357-0_1
Published: 05 May 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-3356-3
Online ISBN: 978-981-15-3357-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics