Skip to main content

Advertisement

SpringerLink
Book cover

Machine Learning for Cyber Physical Systems pp 26–35Cite as

A Random Forest Based Classifier for Error Prediction of Highly Individualized Products

A Random Forest Based Classifier for Error Prediction of Highly Individualized Products

  • Gerd Gröner5 
  • Conference paper
  • Open Access
  • First Online: 18 December 2018
  • 9168 Accesses

  • 2 Citations

Part of the Technologien für die intelligente Automation book series (TIA,volume 9)

Abstract

This paper presents an application of a random forest based classifier that aims at recognizing flawed products in a highly automated production environment. Within the course of this paper, some data set and application features are highlighted that make the underlying classification problem rather complex and hinders the usage of machine learning algorithms straight out-of-the-box. The findings regarding these features and how to treat the concluded challenges are highlighted in a abstracted and generalized manner.

Keywords

  • random forest classifier
  • imbalanced data
  • complex treebased models
  • high peculiarity of data

Download conference paper PDF

References

  1. Henmi, T., Deng, M., Yoshinaga, S.: Early Detection of Plant Faults by Using Machine Learning. Int. Conf. on Advanced Mechatronic Systems (ICAMechS), 2016

    Google Scholar 

  2. Zidek, K., Maxim, V.: Diagnostics of Product Defects by Clustering and Machine Learning Classification Algorithm. Journal of Automation and Control, vol.3, 2015

    Google Scholar 

  3. Meshram, A., Haas, C.: Anomaly Detection in Industrial Networks using Machine Learning: A Roadmap. Machine Learning for Cyber Physical Systems: Selected papers from the International Conference ML4CPS 2016. Ed.: J. Beyerer, Springer, pp. 65–72, 2017

    Google Scholar 

  4. Gèron, A.: Hands-On Machine Learning with Scikit-Learn & TensorFLow. O’Reilly, 2017

    Google Scholar 

  5. Breiman, L.: Random Forests. Machine Learning, pp. 5–32, vol. 7, Kluwer Academic Publishers, 2001

    Google Scholar 

  6. Rashid, T., Neuronale Netze selbst programmieren. O’Reilly, 2017

    Google Scholar 

  7. Lemaître, G., Nogueira, F., Aridas, C.K.: Imbalanced-learn: A Python Toolbox to Tackle the Curse of Imbalanced Datasets in Machine Learning. Journal of Machine Learning Research, vol. 18, pp. 1-5, 2017

    Google Scholar 

  8. Batista, G.E.A.P.A., Prati, R.C., Monard, M.C.: A Study of the Behavior of Several Methods for Balancing Machine Learning Training Data. ACM SIGKDD, vol. 6 (1), pp. 20–29, 2004

    CrossRef  Google Scholar 

  9. Ahmad, M.W., Mourshed, M., Rezgui, Y.: Trees vs Neurons: Comparison between Random Forest and ANN for High-Resolution Prediction of Building Energy Consumption. Energy and Buildings, Elsevier, vol. 147, pp. 77–89, 2017

    Google Scholar 

  10. Zhang, Y., Guo, W., Ray, S.: On the Consistency of Feature Selection With Lasso for Non-linear Targets. Proc. of the 33rd Int. Conference on Machine Learning, vol. 48, pp. 183–191, 2016

    Google Scholar 

  11. Eitrich, T., Kless, A., Druska, C., Meyer, W., Grotendorst, J.: Classification of Highly Unbalanced CYP450 Data of Drugs Using Cost Sensitive Machine Learning Techniques. Journal of Chemical Information and Modeling, vol. 47 (1), pp. 92–103, 2007

    Google Scholar 

  12. Wang, S., Yao, X.: Multiclass Imbalance Problems: Analysis and Potential Solutions. Systems Man Cybernetics Part B - Journal IEEE Transactions on Cybernetics, vol. 42, pp. 1119–1130, 2012

    Google Scholar 

  13. Kubat, M., Matwin, S.: Addressing the Course of Imbalanced Training Sets: One- Sided Selection. Proc. of the 14th Int. Conference on Machine Learning, pp. 217–225, 1997

    Google Scholar 

  14. Wyner, A.J., Olson, M., Bleich, J., Mease, D.: Explaining the Success of AdaBoost and Random Forests as Interpolating Classifiers. Journal of Machine Learning Research, vol. 18, pp. 48:1–48:33, 2017

    Google Scholar 

  15. Friedman, J.: Greedy Function Approximation: A Gradient Boosting Machine. Annals of Statistics, pp. 1189–1232, 2001

    Google Scholar 

  16. Liaw, A., Wiener, M.: Classification and Regression by Randomforest. R news, vol. 2 (3), pp. 18–22, 2002

    Google Scholar 

  17. Wainberg, M., Alipanahi, B., Frey, B.,J.: Are Random Forests Truly the Best Classifiers?. Journal of Machine Learning Research, vol. 17, pp. 110:1–110:5, 2016

    Google Scholar 

  18. Fernández-Delgado, M., Cernadas, E., Barro, S., Amorim, D: Do we Need Hundreds of Classifiers to Solve RealWorld Classification Problems? Journal of Machine Learning Research, vol. 15, pp. 3133–3181, 2014

    Google Scholar 

  19. Ahmad, W. M., Mourshed, M., Rezgui, Y.: Trees vs Neurons: Comparison between random forest and ANN for high-resolution prediction of building energy consumption Journal on Energy and Buildings, vol. 147, pp. 77–89, 2017

    CrossRef  Google Scholar 

Download references

Author information

Authors and Affiliations

  1. Carl Zeiss Vision International GmbH, Aalen, Germany

    Gerd Gröner

Authors
  1. Gerd Gröner
    View author publications

    You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gerd Gröner .

Editor information

Editors and Affiliations

  1. Institut für Optronik, Systemtechnik und Bildauswertung, Fraunhofer, Karlsruhe, Germany

    Prof. Dr. Jürgen Beyerer

  2. MRD, Fraunhofer Institute for Optronics, System Technologies and Image Exploitation IOSB, Karlsruhe, Germany

    Dr. Christian Kühnert

  3. inIT - Institut für industrielle Informationstechnik, Hochschule Ostwestfalen-Lippe, Lemgo, Germany

    Prof. Dr. Oliver Niggemann

Rights and permissions

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Reprints and Permissions

Copyright information

© 2019 The Author(s)

About this paper

Verify currency and authenticity via CrossMark

Cite this paper

Gröner, G. (2019). A Random Forest Based Classifier for Error Prediction of Highly Individualized Products. In: Beyerer, J., Kühnert, C., Niggemann, O. (eds) Machine Learning for Cyber Physical Systems. Technologien für die intelligente Automation, vol 9. Springer Vieweg, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-58485-9_4

Download citation

  • .RIS
  • .ENW
  • .BIB
  • DOI: https://doi.org/10.1007/978-3-662-58485-9_4

  • Published: 18 December 2018

  • Publisher Name: Springer Vieweg, Berlin, Heidelberg

  • Print ISBN: 978-3-662-58484-2

  • Online ISBN: 978-3-662-58485-9

  • eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Share this paper

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Over 10 million scientific documents at your fingertips

Switch Edition
  • Academic Edition
  • Corporate Edition
  • Home
  • Impressum
  • Legal information
  • Privacy statement
  • California Privacy Statement
  • How we use cookies
  • Manage cookies/Do not sell my data
  • Accessibility
  • FAQ
  • Contact us
  • Affiliate program

Not affiliated

Springer Nature

© 2023 Springer Nature Switzerland AG. Part of Springer Nature.