Skip to main content
Log in

Anomaly detection in three-axis CNC machines using LSTM networks and transfer learning

  • ORIGINAL ARTICLE
  • Published:
The International Journal of Advanced Manufacturing Technology Aims and scope Submit manuscript

Abstract

There is a growing interest in developing automated manufacturing technologies to achieve a fully autonomous factory. An integral part of these smart machines is a mechanism to automatically detect operational and process anomalies before they cause serious damage. The long-short-term memory (LSTM) network has shown considerable promise in the literature, with applications in the detection of tool wear and tool breakage to name a few. However, these methods require a significant amount of machine-specific training data to be successful, which makes these networks custom to a machine, requiring new networks and new data for each machine. Transfer learning is an approach where we use a network developed with a rich data set on one machine and re-train it with a smaller data set on a target machine. We have implemented this approach for chatter detection with a LSTM network, using sensor data and a rich data set from one machine, and then use a transfer learning methodology, similar sensors, and a smaller data set for the chatter detection algorithm on another machine. This allows for the transfer of knowledge from one machine to be applied to a similar machine, with some local optimization from transfer learning.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

References

  1. Altintas Y (2012) Manufacturing automation. Cambridge University Press. https://doi.org/10.1017/CBO9780511843723

    Article  Google Scholar 

  2. Ay M, Schwenzer M, Abell D, Bergs T (2021) Recurrent online and transfer learning of a CNC-machining center with support vector machines. In: IEEE International Symposium on Industrial Electronics, Institute of Electrical and Electronics Engineers Inc., vol 2021-June. https://doi.org/10.1109/ISIE45552.2021.9576328

  3. Bozinovski S (2020) Reminder of the first paper on transfer learning in neural networks, 1976. Informatica 44:291–302. https://doi.org/10.31449/inf.v44i3.2828

  4. Burkov A (2019) The hundred-page machine learning book. Andriy Burkov

  5. Deebak BD, Al-Turjman F (2021) Digital-twin assisted: fault diagnosis using deep transfer learning for machining tool condition. Int J Intell Syst. https://doi.org/10.1002/int.22493

    Article  Google Scholar 

  6. Erdoğan G (2019) Land selection criteria for lights out factory districts during the industry 4.0 process. J Urban Manag 8(3):377–385. https://doi.org/10.1016/J.JUM.2019.01.001

  7. Gers FA, Schmidhuber J, Cummins F (2000) Learning to forget: continual prediction with LSTM. Neural Comput 12(10):2451–2471. https://doi.org/10.1162/089976600300015015, https://direct.mit.edu/neco/article/12/10/2451-2471/6415

  8. Gretton A, Borgwardt KM, Rasch M, Schölkopf B, Smola AJ (2008) A Kernel method for the two-sample-problem. Neural Inf Process Syst. www.kyb.mpg.de/bs/people/arthur/mmd.htm. Accessed 8 May 2023

  9. Hao G, Kunpeng Z (2020) Pyramid LSTM auto-encoder for tool wear monitoring. In: 2020 IEEE 16th international conference on automation science and engineering (CASE). IEEE, Online Zoom Meeting

  10. Hochreiter S, Urgen Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780. http://direct.mit.edu/neco/article-pdf/9/8/1735/813796/neco.1997.9.8.1735.pdf. Accessed 30 Sept 2022

  11. IBM (2020) What are recurrent neural networks? https://www.ibm.com/cloud/learn/recurrent-neural-networks. Accessed 30 Sept 2022

  12. Inventables (2022) X-Carve Pro. https://www.inventables.com/presales/tech-specs. Accessed 20 Oct 2022

  13. Khirirat S, Feyzmahdavian HR, Johansson M (2017) Mini-batch gradient descent faster convergence under data sparsity. In: 2017 IEEE 56th Annual Conference on Decision and Control (CDC), IEEE, Melbourne, Australia. https://ieeexplore.ieee.org/document/8264077. Accessed 20 Oct 2022

  14. Kim YM, Shin SJ, Cho HW (2022) Predictive modeling for machining power based on multi-source transfer learning in metal cutting. Int J Precis Eng Manuf - Green Technol 9(1):107–125. https://doi.org/10.1007/s40684-021-00327-6

    Article  Google Scholar 

  15. Kounta CAKA, Arnaud L, Kamsu-Foguem B, Tangara F (2022). Review of AI-based methods for chatter detection in machining based on bibliometric analysis. https://doi.org/10.1007/s00170-022-10059-9

    Article  Google Scholar 

  16. Kuljanic E, Sortino M, Totis G (2008) Multisensor approaches for chatter detection in milling. J Sound Vib 312(4–5):672–693. https://doi.org/10.1016/j.jsv.2007.11.006

    Article  Google Scholar 

  17. Kuljanic E, Totis G, Sortino M (2009) Development of an intelligent multisensor chatter detection system in milling. Mech Syst Signal Process 23(5):1704–1718. https://doi.org/10.1016/j.ymssp.2009.01.003

    Article  Google Scholar 

  18. Kuo WF, Huang BM, Lee CH (2020) Development of virtual milling system using data fusion and transfer learning. In: Proceedings - 2020 International Conference on Pervasive Artificial Intelligence, ICPAI 2020, Institute of Electrical and Electronics Engineers Inc., pp 253–257. https://doi.org/10.1109/ICPAI51961.2020.00054

  19. Kvinevskiy I, Bedi S, Mann S (2020) Detecting machine chatter using audio data and machine learning. Int J Adv Manuf Technol 108(11–12):3707–3716. https://doi.org/10.1007/s00170-020-05571-9

    Article  Google Scholar 

  20. Li WD, Liang YC (2020) Deep transfer learning based diagnosis for machining process lifecycle. Procedia CIRP 90:642–647. https://doi.org/10.1016/J.PROCIR.2020.02.048

    Article  Google Scholar 

  21. Li J, Lu J, Chen C, Ma J, Liao X (2021) Tool wear state prediction based on feature-based transfer learning. Int J Adv Manuf Technol. https://doi.org/10.1007/s00170-021-06780-6/Published, https://doi.org/10.1007/s00170-021-06780-6

  22. Lindemann B, Maschler B, Sahlab N, Weyrich M (2021). A survey on anomaly detection for technical systems using LSTM networks. https://doi.org/10.1016/j.compind.2021.103498

    Article  Google Scholar 

  23. Malhotra P, Ramakrishnan A, Anand G, Vig L, Agarwal P, Shroff G (2016a) LSTM-based encoder-decoder for multi-sensor anomaly detection. 2016 Anomaly Detection Workshop. http://arxiv.org/abs/1607.00148

  24. Malhotra P, Ramakrishnan A, Anand G, Vig L, Agarwal P, Shroff G (2016b) LSTM-based encoder-decoder for multi-sensor anomaly detection. In: 2016 Anomaly Detection Workshop. https://doi.org/10.48550/arxiv.1607.00148, http://arxiv.org/abs/1607.00148v2

  25. Park D, Hoshi Y, Kemp CC (2018) A multimodal anomaly detector for robot-assisted feeding using an LSTM-based variational autoencoder. IEEE Robot Autom Lett 3(3):1544–1551. https://doi.org/10.1109/LRA.2018.2801475

  26. Postel M, Bugdayci B, Wegener K (2020) Ensemble transfer learning for refining stability predictions in milling using experimental stability states. Int J Adv Manuf Technol 107(9–10):4123–4139. https://doi.org/10.1007/s00170-020-05322-w

    Article  Google Scholar 

  27. Rahimi MH, Huynh HN, Altintas Y (2021) On-line chatter detection in milling with hybrid machine learning and physics-based model. CIRP J Manuf Sci Technol 35:25–40. https://doi.org/10.1016/j.cirpj.2021.05.006

    Article  Google Scholar 

  28. Rashid KM, Louis J (2019) Times-series data augmentation and deep learning for construction equipment activity recognition. Adv Eng Inform 42. https://doi.org/10.1016/J.AEI.2019.100944

  29. Serin G, Sener B, Ozbayoglu AM, Unver HO (2020) Review of tool condition monitoring in machining and opportunities for deep learning. Int J Adv Manuf Technol. https://doi.org/10.1007/s00170-020-05449-w/Published, https://doi.org/10.1007/s00170-020-05449-w

  30. Shi B, Attia H (2010). Current status and future direction in the numerical modeling and simulation of machining processes: a critical literature review. https://doi.org/10.1080/10910344.2010.503455

    Article  Google Scholar 

  31. Siegel B (2020) Industrial anomaly detection: a comparison of unsupervised neural network architectures. IEEE Sensors Lett 4(8). https://doi.org/10.1109/LSENS.2020.3007880

  32. Smagulova K, James AP (2019) A survey on LSTM memristive neural network architectures and applications. Eur Phys J Special Topics 228:2313–2324. https://doi.org/10.1140/epjst/e2019-900046-x

    Article  Google Scholar 

  33. Sun C, Ma M, Zhao Z, Tian S, Yan R, Chen X (2019) Deep transfer learning based on sparse autoencoder for remaining useful life prediction of tool in manufacturing. IEEE Trans Ind Inform 15(4):2416–2425. https://doi.org/10.1109/TII.2018.2881543

    Article  Google Scholar 

  34. Tan C, Sun F, Kong T, Zhang W, Yang C, Liu C (2018) A survey on deep transfer learning. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 11141 LNCS:270–279. https://doi.org/10.1007/978-3-030-01424-7_27/FIGURES/5, https://link.springer.com/chapter/10.1007/978-3-030-01424-7_27. Accessed 15 Oct 2022

  35. Unver HO, Sener B (2022) Exploring the potential of transfer learning for chatter detection. In: Procedia Computer Science, Elsevier B.V., vol 200, pp 151–159. https://doi.org/10.1016/j.procs.2022.01.214

  36. Wang J, Zou B, Liu M, Li Y, Ding H, Xue K (2021) Milling force prediction model based on transfer learning and neural network. J Intell Manuf 32(4):947–956. https://doi.org/10.1007/S10845-020-01595-W/FIGURES/6, https://link.springer.com/article/10.1007/s10845-020-01595-w

  37. Yang HC (2020) Roughness of milling process. https://doi.org/10.21227/rx49-xs81, https://ieee-dataport.org/open-access/roughness-milling-process. Accessed 15 Aug 2022

Download references

Acknowledgements

We would like to thank the co-op students, Eric Jessee, Yang Li, and Sarah Ahmed, who helped with the fabrication and data collection for this project. Your assistance was greatly appreciated.

Author information

Authors and Affiliations

Authors

Contributions

All authors contributed to the study conception and design. Material preparation and data collection were performed by Eric Jessee, Sarah Ahmed, Yang Li, and Eugene Li. The data analysis and the first draft of the manuscript were written by Eugene Li, and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Eugene Li.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Sanjeev Bedi and William Melek contributed equally to this work.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, E., Bedi, S. & Melek, W. Anomaly detection in three-axis CNC machines using LSTM networks and transfer learning. Int J Adv Manuf Technol 127, 5185–5198 (2023). https://doi.org/10.1007/s00170-023-11617-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00170-023-11617-5

Keywords

Navigation