Abstract
The bearing is one of the key components in modern industrial equipment. In the past few years, many studies have been carried out on bearing diagnosis through data-driven methods. However, there are two practical problems. First, under actual working conditions, the lack of fault samples is a major factor that hinders the application of these methods in industrial environments. Second, there is a lack of full utilization of a priori knowledge in the current stage of methods using relational networks for fault diagnosis. It is manifested by the incompleteness of the relational network structure. To address these problems, we present a new diagnosis method based on few-shot learning, which is suitable for the environment where the data is scarce. In this method, we train the model with the data generated by the artificial damaged bearings instead of the data from the real bearing. We experimentally validate the performance improvement of the complete relational network structure. It is able to perform the few-shot learning task better. In addition, we also reduce the global feature discrepancy by introducing an attention mechanism to improve the performance of the model. And the impact of the number of layers of the attention mechanism on the model is also discussed in detail. In this paper, our model performs better under the same experimental conditions compared with other transfer learning models.
Similar content being viewed by others
References
A. H. Aljemely, J. Xuan, F. K. J. Jawad, O. AlAzzawi and A. S. Alhumaima, A novel unsupervised learning method for intelligent fault diagnosis of rolling element bearing based on deep functional auto-encoder, Journal of Mechanical Science and Technology, 34 (11) (2020) 4367–4381.
C. Lessmeier, J. K. Kimotho, D. Zimmer and W. Sextro, Condition monitoring of bearing damage in electromechanical drive systems by using motor current signals of electric motors: a benchmark data set for data-driven classifification, PHM Society European Conference, 3 (1) (2016) 5–8.
S. Riaz, H. Elahi, K. Javaid and T. Shahzad, Vibration feature extraction and analysis for fault diagnosis of rotating machinery-a literature survey, Asia Pacific Journal of Multidisciplinary Research, 5 (1) (2017) 103–110.
L. Liu, S. Wang, D. Liu and Y. Peng, Quantitative selection of sensor data based on improved permutation entropy for system remaining useful life prediction, Microelectronics Reliability, 75 (2017) 264–270.
H. Zhang, Q. Miao, X. Zhang and Z. Liu, An improved unscented particle filter approach for lithium-ion battery remaining useful life prediction, Microelectronics Reliability, 81 (2018) 288–298.
D.-S. Huang, Systematic Theory of Neural Networks For Pattern Recognitionm, Publishing House of Electronic Industry of China, Beijing, 201 (1996).
F.-F. Diego, M.-R. David, F.-R. Oscar and A.-B. Amparo, Automatic bearing fault diagnosis based on one-class ν-svm, Computers and Industrial Engineering, 64 (1) (2013) 357–365.
A. Khatir and M. A. Wahab, Fast simulations for solving fracture mechanics inverse problems using pod-rbf xiga andjaya algorithm, Engineering Fracture Mechanics, 205 (2019) 285–300.
S. Tiachacht, S. Khatir, C. L. Thanh, R. V. Rao, S. Mirjalili and M. A. Wahab, Inverse problem for dynamic structural health-monitoring based on slime mould algorithm, Engineering with Computers (2021) 1–24.
S. Khatir, D. Boutchicha, C. Le Thanh, H. Tran-Ngoc, T. N. Nguyen and M. Abdel-Wahab, Improved ann technique combined with jaya algorithm for crack identification in plates usingxiga and experimental analysis, Theoretical and Applied Fracture Mechanics, 107 (2020) 102554.
H. Tran-Ngoc, S. Khatir, H. Ho-Khac, G. De Roeck, T. Bui-Tien and M. Abdel Wahab, Efficient artificial neural networks basedon a hybrid metaheuristic optimization algorithm for damage detection in laminated composite structures, Composite Structures, 262 (2021) 113339.
Z. Zhao, T. Li, J. Wu, C. Sun, S. Wang, R. Yan and X. Chen, Deep learning algorithms for rotating machinery intelligent diagnosis: an open source benchmark study, ISA Transactions, 107 (2020) 224–255.
A. Khatir, M. A. Wahab, D. Boutchicha and T. Khatir, Structural health monitoring using modal strain energydamage indicator coupled with teaching-learning-based optimization algorithm and isogoemetric analysis, Journal of Sound and Vibration, 448 (2019) 230–246.
X. Zhang, G. Chen, T. Hao and Z. He, Rolling bearing fault convolutional neural network diagnosis method based on casing signal, Journal of Mechanical Science and Technology, 34 (2020) 2307–2316.
Y. Chen, G. Peng, C. Xie, W. Zhang, C. Li and S. Liu, Acdin: bridging the gap between artificial and real bearing damages for bearing fault diag nosis, Neurocomputing, 294 (2018) 61–71.
Z. Wu, H. Jiang, K. Zhao and X. Li, An adaptive deep transfer learning method for bearing fault diagnosis, Measurement, 151 (2020) 107227.
M. J. Hasan, M. M. Islam and J.-M. Kim, Acoustic spectral imaging and transfer learning for reliable bearing fault diagnosis under variable speed conditions, Measurement, 138 (2019) 620–631.
J. Wu, Z. Zhao, C. Sun, R. Yan and X. Chen, Few-shot transfer learning for intelligent fault diagnosis of machine, Measurement, 166 (2020) 108202.
M. Han, Y. Wu, Y. Wang and W. Liu, Roller bearing fault diagnosis based on lmd and multi-scale symbolic dynamic information entropy, Journal of Mechanical Science and Technology, 35 (5) (2021) 1993–2005.
N. Bendre, H. T. Marín and P. Najafirad, Learning from few samples: a survey, arXiv.2007.15484 (2020).
H. Lee, S. J. Hwang and J. Shin, Self-supervised label augmentation via input transformations, arXiv:1910.05872 (2019).
A. Alfassy, L. Karlinsky, A. Aides, J. Shtok, S. Harary, R. Feris, R. Giryes and A. M. Bronstein, Laso: label-set operations networks for multi-label few-shot learning, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2019) 6548–6557.
F. Sung, Y. Yang, L. Zhang, T. Xiang, P. H. S. Torr and T. M. Hospedales, Learning to compare: relation network for few-shot learning, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2018) 1199–1208.
S. Hochreiter and J. Schmidhuber, Long short-term memory, Neural Computation, 9 (8) (1997) 1735–1780.
A. Graves, G. Wayne and I. Danihelka, Neural turing machines, arXiv:1410.5401 (2014).
Z. Zhang and V. Saligrama, Zero-shot learning via semantic similarity embedding, Proceedings of the IEEE International Conference on Computer Vision (2015) 4166–4174.
A. Frome, G. Corrado, J. Shlens, S. Bengio, J. Dean, M. Ranzato and T. Mikolov, Devise: a adeep visual-semantic embedding model, Advances in Neural Information Processing Systems (2013) 26.
S. Ravi and H. Larochelle, Optimization as a model for few-shot learning, International Conference on Learning Representations (ICLR) (2017).
C. Finn, P. Abbeel and S. Levine, Model-agnostic metalearning for fast adaptation of deep networks, International Conference on Machine Learning (2017) 1126–1135.
J. Yosinski, J. Clune, Y. Bengio and H. Lipson, How transferable are features in deep neural networks?, arXiv:1411.1792 (2014).
A. Li, T. Luo, Z. Lu, T. Xiang and L. Wang, Large-scale few-shot learning: knowledge transfer with class hierarchy, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2019) 7212–7220.
D. Bahdanau, K. Cho and Y. Bengio, Neural machine translation by jointly learning to align and translate, arXiv:1409.0473 (2014).
D. Hu, An introductory survey on attention mechanisms in nlp problems, Proceedings of SAI Intelligent Systems Conference (2019) 432–448.
J. B. Lee, R. A. Rossi, S. Kim, N. K. Ahmed and E. Koh, Attention models in graphs: a survey, ACM Transactions on Knowledge Discovery from Data (TKDD), 13 (6) (2019) 1–25.
S. Woo, J. Park, J.-Y. Lee and I. S. Kweon, CBAM: convolutional block attention module, Proceedings of the European Conference on Computer Vision (ECCV) (2018) 3–19.
J. Hu, L. Shen and G. Sun, Squeeze-and-excitation networks, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018) 7132–7141.
L. Chen, H. Zhang, J. Xiao, L. Nie, J. Shao, W. Liu and T. Chua, SCA-CNN: spatial and channel-wise attention in convolutional networks for image captioning, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017) 5659–5667.
A. Santoro, D. Raposo, D. G. T. Barrett, M. Malinowski, R. Pascanu, P. Battaglia and T. Lillicrap, A simple neural network module for relational reasoning, arXiv:1706.01427 (2017).
Acknowledgments
This work was supported by the National Key R&D Program of China (Grant No. 2020YFB1712901), the Major Special Program of Chongqing Science and Technology Commission (No. CSTC 2019jscx-zdztzxX0031), Graduate Scientific Research and Innovation Foundation of Chongqing (No. CYB 19072).
Author information
Authors and Affiliations
Corresponding author
Additional information
Yao Hu received the B.Sc. degree from the School of Big Data & Software Engineering, Chongqing University, Chongqing, China, in 2019. He is currently pursuing the M.Sc. degree in Big Data & Software Engineering at Chongqing University, Chongqing, China. His current research interests include deep learning and intelligent system.
Qingyu Xiong received the Ph.D. degree in Electrical and Electronic Systems Engineering from Kyushu University, Fukuoka, Japan, in 2002, and the M.Sc. degree from Chongqing University, Chongqing, China, in 1991. He is currently a Professor with the School of Big Data & Software Engineering, Chongqing University. His current research interests include intelligent system and intelligent computation, and pervasive computation and embedded system. Dr. Xiong is a member of the China Computer Federation.
Rights and permissions
About this article
Cite this article
Hu, Y., Xiong, Q., Zhu, Q. et al. Few-shot transfer learning with attention for intelligent fault diagnosis of bearing. J Mech Sci Technol 36, 6181–6192 (2022). https://doi.org/10.1007/s12206-022-1132-4
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12206-022-1132-4