Enhancement of multilayer perceptron model training accuracy through the optimization of hyperparameters: a case study of the quality prediction of injection-molded parts

Ke, Kun-Cheng; Huang, Ming-Shyan

doi:10.1007/s00170-021-08109-9

Enhancement of multilayer perceptron model training accuracy through the optimization of hyperparameters: a case study of the quality prediction of injection-molded parts

ORIGINAL ARTICLE
Published: 26 September 2021

Volume 118, pages 2247–2263, (2022)
Cite this article

The International Journal of Advanced Manufacturing Technology Aims and scope Submit manuscript

699 Accesses
5 Citations
Explore all metrics

Abstract

Injection molding has been broadly used in the mass production of plastic parts and must meet the requirements of efficiency and quality consistency. Machine learning can effectively predict the quality of injection-molded part. However, the performance of machine learning models largely depends on the accuracy of the training. Hyperparameters such as activation functions, momentum, and learning rate are crucial to the accuracy and efficiency of model training. This research aims to analyze the influence of hyperparameters on testing accuracy, explore the corresponding optimal learning rate, and provide the optimal training model for predicting the quality of injection-molded parts. In this study, stochastic gradient descent (SGD) and stochastic gradient descent with momentum (SGDM) are used to optimize the artificial neural network model. Through optimization of these training model hyperparameters, the width testing accuracy of the injection product is improved. The experimental results indicate that in the absence of momentum effects, all five activation functions can achieve more than 90% of the training accuracy with a learning rate of 0.1. Moreover, when optimized with the SGD, the learning rate of the Sigmoid activation function is 0.1, and the testing accuracy reaches 95.8%. Although momentum has the least influence on accuracy, it affects the convergence speed of the Sigmoid function, which reduces the number of required learning iterations (82.4% reduction rate). In summary, optimizing hyperparameter settings can improve the accuracy of model testing and markedly reduce training time.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Hyperparameter optimization strategy of multilayer perceptron model for injection molding quality prediction

Article 15 February 2024

Multi-quality prediction of injection molding parts using a hybrid machine learning model

Article 25 September 2023

A study on the practical application of the integrated ANN system for manufacturing the target quality of the injection molded product

Article 17 March 2022

Data availability

The authors confirm that the data supporting the findings of this study are available within the article.

References

Huang MS, Ke KC, Liu CY (2021) Cavity pressure-based holding pressure adjustment for enhancing the consistency of injection molding quality. J Appl Polym Sci 138(50357):1–10
Google Scholar
Zhang J, Zhao P, Zhao Y, Huang J, Xia N, Fu J (2019) On-line measurement of cavity pressure during injection molding via ultrasonic investigation of tie bar. Sens Actuators Phys 285:118–126
Article Google Scholar
Lin CC, Wang WT, Kuo CC, Wu CL (2014) Experimental and theoretical study of melt viscosity in injection process. Int J Mech Mecha Eng 8:1–5
Google Scholar
Wang J, Peng J, Yang W (2011) Filling-to-packing switchover mode based on cavity temperature for injection molding. Polym-Plast Technol Eng 50:1273–1280
Article Google Scholar
Zhao P, Xia N, Zhang J, Xie J, Zhang C, Fu J (2020) Measurement of molecular orientation using longitudinal ultrasound and its first application in in-situ characterization. Polymer 187(122092):1–11
Google Scholar
Farahani S, Brown N, Loftis J, Krick C, Pichl F, Vaculik R, Pilla S (2019) Evaluation of in-mold sensors and machine data towards enhancing product quality and process monitoring via Industry 4.0. Int J Adv Manuf Technol 105:1371–1389
Article Google Scholar
Loftis J, Farahani S, Pilla S (2020) Online quality monitoring of plastic parts using real-time data from an injection molding machine. Inter Manuf Sci Eng Conf, ASME 84256:V001T02A001
Google Scholar
Chang YH, Wei TH, Chen SC, Lou YF (2020) The investigation on PVT control method establishment for scientific injection molding parameter setting and its quality control. Polym Eng Sci 60:2895–2907
Article Google Scholar
Wang J (2012) PVT properties of polymers for injection molding. Some Critical Issues for Injection Molding 1–30
Hopmann C, Kahve C, Schmitz M (2020) Development of a novel control strategy for a highly segmented injection mold tempering for inline part warpage control. Polym Eng Sci 60:2428–2438
Article Google Scholar
Wang J, Mao Q (2013) A novel process control methodology based on the PVT behavior of polymer for injection molding. Adv Polym Technol 32:E474–E485
Article Google Scholar
Kamaruddin S, Khan ZA, Foong SH (2010) Application of Taguchi method in the optimization of injection moulding parameters for manufacturing products from plastic blend. Int J Eng Technol 2:574–580
Article Google Scholar
Yizong T, Ariff ZM, Khalil AM (2017) Influence of processing parameters on injection molded polystyrene using Taguchi method as design of experiment. Procedia Eng 184:350–359
Article Google Scholar
Kiatcharoenpol T, Vichiraprasert T (2018) Optimizing and modeling for plastic injection molding process using Taguchi method. J Phys Conf Ser 1026:012018
Article Google Scholar
Wang Q, Yang C, Du K, Wu Z (2019) Effect of micro injection molding parameters on cavity pressure and temperature assisted by Taguchi method. Mechanika 25:261–268
Article Google Scholar
Feng Q, Liu L, Zhou X (2020) Automated multi-objective optimization for thin-walled plastic products using Taguchi, ANOVA, and hybrid ANN-MOGA. Int J Adv Manuf Technol 106:559–575
Article Google Scholar
Lockner Y, Hopmann C (2021) Induced network-based transfer learning in injection molding for process modelling and optimization with artificial neural networks. Int J Adv Manuf Technol 112:3501–3513
Article Google Scholar
Ke KC, Huang MS (2020) Quality prediction for injection molding by using a multilayer perceptron neural network. Polymers 12:1812
Article Google Scholar
Ke KC, Huang MS (2021) Quality classification of injection-molded components by using quality indices, grading, and machine learning. Polymers 13:353
Article Google Scholar
Hwang S, Kim J (2019) Injection mold design of reverse engineering using injection molding analysis and machine learning. J Mech Sci Technol 33:3803–3812
Article Google Scholar
Ogorodnyk O, Lyngstad OV, Larsen M, Wang K, Martinsen K (2019) Application of machine learning methods for prediction of parts quality in thermoplastics injection molding. Advanced Manufacturing and Automation VIII. Springer, Singapore, pp 237–244
Lei Y, Tang K (2021) Learning rates for stochastic gradient descent with nonconvex objectives. IEEE Trans Pattern Ana Mach Intell. https://doi.org/10.1109/TPAMI.2021.3068154
Article Google Scholar
Cheridito P, Jentzen A, Rossmannek F (2021) Non-convergence of stochastic gradient descent in the training of deep neural networks. J Complex 64:101540
Article MathSciNet Google Scholar
Jin R, He X (2020) Convergence of momentum-based stochastic gradient descent. 16th IEEE Int Conf Control Automation, Sapporo, Hokkaido, Japan, pp 779–784
Sharma A (2018) Guided stochastic gradient descent algorithm for inconsistent datasets. Appl Soft Comput J 73:1068–1080
Article Google Scholar
Gupta P, Garg S (2019) Breast cancer prediction using varying parameters of machine learning models. 3rd Int Conf Computing Network Communications, Trivandrum, Kerala, India, pp 593–601
Bock S, Weis M (2019) A proof of local convergence for the Adam optimizer. Int Joint Conf Neural Networks, Institute of Electrical and Electronics Engineers Inc., Budapest, Hungary
Hochreiter S (1998) The vanishing gradient problem during learning recurrent neural nets and problem solutions”. Int J Uncertain Fuzziness Knowledge-Based Syst 6:107–116
Article Google Scholar
Bjorck J, Gomes C, Selman B, Weinberger KQ (2018) Understanding batch normalization. ArXiv180602375 Cs Stat

Download references

Acknowledgements

This research was supported in part by the Frontier Mould and Die Research and Development Center from The Featured Areas Research Center Program within the framework of the Higher Education Sprout Project by the Ministry of Education (MOE), Taiwan.

Author information

Authors and Affiliations

Department of Mechanical Engineering, YuanZe University, 135 Yuandong Road, Zhongli District, Taoyuan City, 320, Taiwan
Kun-Cheng Ke
Department of Mechatronics Engineering, National Kaohsiung University of Science and Technology, 1 University Road, Yanchao Dist, Kaohsiung City, 824, Taiwan
Ming-Shyan Huang

Authors

Kun-Cheng Ke
View author publications
You can also search for this author in PubMed Google Scholar
Ming-Shyan Huang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.-C. Ke and M.-S. Huang were responsible for deriving formulas. K.-C. Ke was responsible for simulation. K.-C. Ke and M.-S. Huang were involved in the discussion and significantly contributed to making the final draft of the article. All the authors read and approved the final manuscript.

Corresponding author

Correspondence to Ming-Shyan Huang.

Ethics declarations

Consent to participate

Not applicable.

Consent to publish

Not applicable.

Competing interests

The authors declare no competing interests.

Ethical approval

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ke, KC., Huang, MS. Enhancement of multilayer perceptron model training accuracy through the optimization of hyperparameters: a case study of the quality prediction of injection-molded parts. Int J Adv Manuf Technol 118, 2247–2263 (2022). https://doi.org/10.1007/s00170-021-08109-9

Download citation

Received: 06 July 2021
Accepted: 20 September 2021
Published: 26 September 2021
Issue Date: February 2022
DOI: https://doi.org/10.1007/s00170-021-08109-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Enhancement of multilayer perceptron model training accuracy through the optimization of hyperparameters: a case study of the quality prediction of injection-molded parts

Abstract

Access this article

Similar content being viewed by others

Hyperparameter optimization strategy of multilayer perceptron model for injection molding quality prediction

Multi-quality prediction of injection molding parts using a hybrid machine learning model

A study on the practical application of the integrated ANN system for manufacturing the target quality of the injection molded product

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Consent to participate

Consent to publish

Competing interests

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Enhancement of multilayer perceptron model training accuracy through the optimization of hyperparameters: a case study of the quality prediction of injection-molded parts

Abstract

Access this article

Similar content being viewed by others

Hyperparameter optimization strategy of multilayer perceptron model for injection molding quality prediction

Multi-quality prediction of injection molding parts using a hybrid machine learning model

A study on the practical application of the integrated ANN system for manufacturing the target quality of the injection molded product

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Consent to participate

Consent to publish

Competing interests

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation