Skip to main content

Performance Improvement in Hot Rolling Process with Novel Neural Architectural Search

  • Chapter
  • First Online:
Machine Learning in Industry

Abstract

State-of-the-art infrastructure, excellent computational facilities and ubiquitous connectivity across the industries have led to the generation of large amounts of heterogeneous process data. At the same time, the applicability of machine learning and artificial intelligence is witnessing a significant rise in academics and engineering, leading to the development of a large number of resources and tools. However, the number of research works and applications aimed at implementing data sciences to problems in process industries is far less. The proposed work aims to fill the niche by proposing Artificial Neural Network (ANN)-based surrogate construction using extremely nonlinear, static, high dimensional (32 features) noisy data sampled irregularly from inlet and outlet streams of hot rolling process in iron and steel making industry. Though ANNs are used extensively for modelling nonlinear data, literature survey has shown that their modelling is governed by heuristics thus making them inefficient for use in process industries. This aspect is of high relevance in contemporary times as hyper-parameter optimization, automated machine learning and neural architecture search (NAS) constitute a major share of current research in data sciences. We propose a novel multi-objective evolutionary NAS algorithm to optimally design multi-layered feed-forward ANNs by balancing the aspects of parsimony and accuracy. The integer nonlinear programming problem of ANN design is solved using binary coded Non-Dominated Sorting Genetic Algorithm (NSGA-II). ANNs designed for the hot rolling process are found to demonstrate an accuracy of 0.98 (averaged on three outputs) measured in terms of correlation coefficient R2 on the test set. The successful construction of accurate and optimal ANNs provides a first-of-its-kind model for the hot rolling process in the iron and steel making industry. The proposed method can minimize the chances of over-fitting in ANNs and provides a generic method applicable to any kind of data/model from process industries.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Mittal, P., Mohanty, I., Malik, A., & Mitra, K. (2020). Many-objective optimization of hot-rolling process of steel: A hybrid approach. Materials and Manufacturing Processes, 35(6), 668–676.

    Article  Google Scholar 

  2. Miriyala, S. S., Mittal, P., Majumdar, S., & Mitra, K. (2016). Comparative study of surrogate approaches while optimizing computationally expensive reaction networks. Chemical Engineering Science, 140, 44–61.

    Article  Google Scholar 

  3. Van Der Aalst, W. (2016). Data science in action. In Process mining (pp. 3–23). Springer, Berlin, Heidelberg.

    Google Scholar 

  4. Venkatasubramanian, V. (2019). The promise of artificial intelligence in chemical engineering: Is it here, finally. AIChE Journal, 65(2), 466–478.

    Article  Google Scholar 

  5. Nowakowski, P., Szwarc, K., & Boryczka, U. (2018). Vehicle route planning in e-waste mobile collection on demand supported by artificial intelligence algorithms. Transportation Research Part D: Transport and Environment, 63, 1–22.

    Article  Google Scholar 

  6. Ahmad, F., Abbasi, A., Li, J., Dobolyi, D. G., Netemeyer, R. G., Clifford, G. D., et al. (2020). A Deep learning architecture for psychometric natural language processing. ACM Transactions on Information Systems (TOIS), 38(1), 1–29.

    Article  Google Scholar 

  7. Moen, E., Bannon, D., Kudo, T., Graf, W., Covert, M., & Van Valen, D. (2019). Deep learning for cellular image analysis. Nature methods, 1–14.

    Google Scholar 

  8. Ardabili, S., Mosavi, A., Dehghani, M., & Várkonyi-Kóczy, A. R. (2019, September). Deep learning and machine learning in hydrological processes climate change and earth systems a systematic review. In International Conference on Global Research and Education (pp. 52–62). Springer, Cham.

    Google Scholar 

  9. Bauer, A., Bostrom, A. G., Ball, J., Applegate, C., Cheng, T., Laycock, S., et al. (2019). Combining computer vision and deep learning to enable ultra-scale aerial phenotyping and precision agriculture: A case study of lettuce production. Horticulture research, 6(1), 1–12.

    Article  Google Scholar 

  10. Sajeev, S., Maeder, A., Champion, S., Beleigoli, A., Ton, C., Kong, X., & Shu, M. (2019). Deep Learning to improve heart disease risk prediction. In Machine Learning and Medical Engineering for Cardiovascular Health and Intravascular Imaging and Computer Assisted Stenting (pp. 96–103). Springer, Cham.

    Google Scholar 

  11. Schneider, P., Walters, W. P., Plowright, A. T., Sieroka, N., Listgarten, J., Goodnow, R. A., Fisher, J., Jansen, J.M., Duca, J.S., Rush, T.S. & Zentgraf, M. (2019). Rethinking drug design in the artificial intelligence era. Nature Reviews Drug Discovery, 1–12.

    Google Scholar 

  12. Justesen, N., Bontrager, P., Togelius, J., & Risi, S. (2019). Deep learning for video game playing. IEEE Transactions on Games.

    Google Scholar 

  13. Naik, N., & Mohan, B. R. (2019, May). Stock Price Movements Classification Using Machine and Deep Learning Techniques-The Case Study of Indian Stock Market. In International Conference on Engineering Applications of Neural Networks (pp. 445–452). Springer, Cham.

    Google Scholar 

  14. Lu, C. Y., Suhartanto, D., Gunawan, A. I., & Chen, B. T. (2020). Customer satisfaction toward online purchasing services: Evidence from small & medium restaurants. International Journal of Applied Business Research, 2(01), 1–14.

    Article  Google Scholar 

  15. Piccione, P. M. (2019). Realistic interplays between data science and chemical engineering in the first quarter of the 21st century: Facts and a vision. Chemical Engineering Research and Design, 147, 668–675.

    Article  Google Scholar 

  16. Beck, D. A., Carothers, J. M., Subramanian, V. R., & Pfaendtner, J. (2016). Data science: Accelerating innovation and discovery in chemical engineering. AIChE Journal, 62(5), 1402–1416.

    Article  Google Scholar 

  17. Qi, C., Fourie, A., Chen, Q., Tang, X., Zhang, Q., & Gao, R. (2018). Data-driven modelling of the flocculation process on mineral processing tailings treatment. Journal of Cleaner Production, 196, 505–516.

    Article  Google Scholar 

  18. Han, H., Zhu, S., Qiao, J., & Guo, M. (2018). Data-driven intelligent monitoring system for key variables in wastewater treatment process. Chinese Journal of Chemical Engineering, 26(10), 2093–2101.

    Article  Google Scholar 

  19. Almeshaiei, E., Al-Habaibeh, A., & Shakmak, B. (2020). Rapid evaluation of micro-scale photovoltaic solar energy systems using empirical methods combined with deep learning neural networks to support systems’ manufacturers. Journal of Cleaner Production, 244,.

    Article  Google Scholar 

  20. Wu, H., & Zhao, J. (2018). Deep convolutional neural network model based chemical process fault diagnosis. Computers & Chemical Engineering, 115, 185–197.

    Article  Google Scholar 

  21. Kitchin, J. R. (2018). Machine learning in catalysis. Nature Catalysis, 1(4), 230–232.

    Article  Google Scholar 

  22. Spellings, M., & Glotzer, S. C. (2018). Machine learning for crystal identification and discovery. AIChE Journal, 64(6), 2198–2206.

    Article  Google Scholar 

  23. del Rio-Chanona, E. A., Wagner, J. L., Ali, H., Fiorelli, F., Zhang, D., & Hellgardt, K. (2019). Deep learning-based surrogate modeling and optimization for microalgal biofuel production and photobioreactor design. AIChE Journal, 65(3), 915–923.

    Article  Google Scholar 

  24. Pantula, P. D., & Mitra, K. (2020). Towards efficient robust optimization using data based optimal segmentation of uncertain space. Reliability Engineering & System Safety, 197, 106821.

    Article  Google Scholar 

  25. Haghighatlari, M., & Hachmann, J. (2019). Advances of machine learning in molecular modeling and simulation. Current Opinion in Chemical Engineering, 23, 51–57.

    Article  Google Scholar 

  26. Alizadeh, R., Allen, J. K., & Mistree, F. (2020). Managing computational complexity using surrogate models: A critical review. Research in Engineering Design, 31(3), 275–298.

    Article  Google Scholar 

  27. Miriyala, S. S., Subramanian, V. R., & Mitra, K. (2018). TRANSFORM-ANN for online optimization of complex industrial processes: Casting process as case study. European Journal of Operational Research, 264(1), 294–309.

    Article  MathSciNet  Google Scholar 

  28. Dua, V. (2010). A mixed-integer programming approach for optimal configuration of artificial neural networks. Chemical Engineering Research and Design, 88, 55–60.

    Article  Google Scholar 

  29. Carvalho, A. R., Ramos, F. M., & Chaves, A. A. (2011). Metaheuristics for the feedforward artificial neural network (ANN) architecture optimization problem. Neural Computing and Applications, 20(8), 1273–1284.

    Article  Google Scholar 

  30. Boithias, F., Mankibi, M., & Michel, P. (2012). Genetic algorithms based optimization of artificial neural network architecture for buildings’ indoor discomfort and energy consumption prediction. Building Simulation, 5(2), 95–106.

    Article  Google Scholar 

  31. Eason, J., & Cremaschi, S. (2014). Adaptive sequential sampling for surrogate model generation with artificial neural networks. Computers & Chemical Engineering, 68, 220–232.

    Article  Google Scholar 

  32. Jones, D. R. (2001). A taxonomy of global optimization methods based on response surfaces. Journal of Global Optimization, 21, 345–383.

    Article  MathSciNet  Google Scholar 

  33. Crombecq, K. (2011.) Surrogate modeling of computer experiments with sequential experimental design.

    Google Scholar 

  34. Davis, E., & Ierapetritou, M. (2010). A centroid-based sampling strategy for kriging global modeling and optimization. AIChE, 56, 220–240.

    Google Scholar 

  35. Gorissen, D., Couckuyt, I., Demeester, P., Dhaene, T., & Crombecq, T. (2010). A surrogate modeling and adaptive sampling toolbox for computer based design. The Journal of Machine Learning Research, 11, 2055–8722.

    Google Scholar 

  36. Müller, J., & Shoemaker, C. A. (2014). Influence of ensemble surrogate models and sampling strategy on the solution quality of algorithms for computationally expensive black-box global optimization problems. The Journal of Global, 60(2), 123–144.

    MathSciNet  MATH  Google Scholar 

  37. Chugh, T., Sindhya, K., Hakanen, J., & Miettinen, K. (2019). A survey on handling computationally expensive multiobjective optimization problems with evolutionary algorithms. Soft Computing, 23(9), 3137–3166.

    Article  Google Scholar 

  38. Miriyala, S. S., & Mitra, K. (2020). Multi-objective optimization of iron ore induration process using optimal neural networks. Materials and Manufacturing Processes, 35(5), 537–544.

    Article  Google Scholar 

  39. Zoph, B., & Le, Q. V. (2016). Neural architecture search with reinforcement learning. arXiv preprint arXiv:1611.01578.

  40. Elsken, T., Metzen, J. H., & Hutter, F. (2018). Neural architecture search: A survey. arXiv preprint arXiv:1808.05377.

  41. Hagan Martin, T., Demuth Howard, B., & Beale Mark, H. (2002). Neural network design. University of Colorado at Boulder.

    Google Scholar 

  42. Graves, A. (2012). Supervised sequence labelling. In Supervised sequence labelling with recurrent neural networks (pp. 5–13). Springer, Berlin, Heidelberg.

    Google Scholar 

  43. Bengio, Y., Goodfellow, I., & Courville, A. (2017). Deep learning (Vol. 1). Massachusetts, USA: MIT press.

    MATH  Google Scholar 

  44. Deb, K. (2001). Multi-objective Optimization using Evolutionary Algorithms. Chichester, UK: Wiley.

    MATH  Google Scholar 

  45. Arlot, S., & Celisse, A. (2010). A survey of cross-validation procedures for model selection. Statistics Surveys, 4, 40–79.

    Article  MathSciNet  Google Scholar 

  46. Akaike H, Information theory and an extension of the maximum likelihood principle. In: B. N. Petrov, F. Csáki (Eds.), Proceedings 2nd International Symposium on Inf. Theory, Tsahkadsor, Armenia, USSR, September 2:8 (1971) 267–281..

    Google Scholar 

  47. FB, P. (1978). Physical metallurgy and the design of steels. London: Applied Science Publishers Ltd.

    Google Scholar 

  48. Yada, H., Ruddle, G. E., & Crawley, A. F. (1987). Proc. Int. Symp. On Accelerated Cooling of Rolled Steel.

    Google Scholar 

  49. Mohanty, I., Chintha, A. R., & Kundu, S. (2018). Design optimization of microalloyed steels using thermodynamics principles and neural-network-based modeling. Metallurgical and Materials Transactions A, 49(6), 2405–2418.

    Article  Google Scholar 

  50. Mohanty, I., Sarkar, S., Jha, B., Das, S., & Kumar, R. (2014). Online mechanical property prediction system for hot rolled IF steel. Ironmaking and Steelmaking, 41(8), 618–627.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kishalay Mitra .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Miriyala, S.S., Mohanty, I., Mitra, K. (2022). Performance Improvement in Hot Rolling Process with Novel Neural Architectural Search. In: Datta, S., Davim, J.P. (eds) Machine Learning in Industry. Management and Industrial Engineering. Springer, Cham. https://doi.org/10.1007/978-3-030-75847-9_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-75847-9_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-75846-2

  • Online ISBN: 978-3-030-75847-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics