Skip to main content
Log in

Health stages diagnostics of underwater thruster using sound features with imbalanced dataset

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

The underwater thruster is considered one of the most critical components located on an unmanned underwater vehicle to maneuver in the water. However, it is recognized as a common source of the fault. This phenomenon is made worse when collected data for equipment health diagnostics are highly imbalanced. A new sampling method to tackle the problem of imbalanced data based on cosine similarity is proposed to improve the classification accuracy for thruster health diagnostics. The results show that it outperforms SMOTE (Synthetic Minority Oversampling Technique) and ADASYN (Adaptive Synthetic Sampling Approach for Imbalanced Learning). The proposed method was further validated using different imbalanced datasets with different imbalance ratio from KEEL and UCI machine learning repository (such as Pima Indians Diabetes, Ionosphere, Fertility Diagnostics, Mammographic Masses, Blood Transfusion Service Centre). The majority of the results from the datasets show that the proposed method produces the higher classification accuracy as well as g-means that suggests the potential approach for classification problem that has a highly imbalanced dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  1. Sun Y, Ran X, Li Y, Zhang G, Zhang Y (2016) Thruster fault diagnosis method based on Gaussian particle filter for autonomous underwater vehicles. Int J Nav Archit Ocean Eng 8:243–251. https://doi.org/10.1016/j.ijnaoe.2016.03.003

    Article  Google Scholar 

  2. Dos Santos CHF, Cardozo DIK, Reginatto R, De Pieri ER (2016) Bank of controllers and virtual thrusters for fault-tolerant control of autonomous underwater vehicles. Ocean Eng 121:210–223. https://doi.org/10.1016/j.oceaneng.2016.05.029

    Article  Google Scholar 

  3. Omerdic E, Roberts G (2004) Thruster fault diagnosis and accommodation for open-frame underwater vehicles. Control Eng Pract 12:1575–1598. https://doi.org/10.1016/j.conengprac.2003.12.014

    Article  Google Scholar 

  4. Shin J-H, Jun H-B (2015) On condition based maintenance policy. J Comput Des Eng 2:119–127. https://doi.org/10.1016/j.jcde.2014.12.006

    Article  Google Scholar 

  5. Susto GA, Schirru A, Pampuri S, McLoone S, Beghi A (2015) Machine learning for predictive maintenance: a multiple classifier approach. IEEE Trans Ind Inform 11:812–820. https://doi.org/10.1109/TII.2014.2349359

    Article  Google Scholar 

  6. Lee J, Wu F, Zhao W, Ghaffari M, Liao L, Siegel D (2014) Prognostics and health management design for rotary machinery systems—reviews, methodology and applications. Mech Syst Signal Process 42:314–334. https://doi.org/10.1016/j.ymssp.2013.06.004

    Article  Google Scholar 

  7. Yu J (2015) Machine health prognostics using the Bayesian-inference-based probabilistic indication and high-order particle filtering framework. J Sound Vib 358:97–110. https://doi.org/10.1016/j.jsv.2015.08.013

    Article  Google Scholar 

  8. Nayal A, Jomaa H, Awad M (2017) KerMinSVM for imbalanced datasets with a case study on arabic comics classification. Eng Appl Artif Intell 59:159–169. https://doi.org/10.1016/j.engappai.2017.01.001

    Article  Google Scholar 

  9. Fan Q, Wang Z, Gao D (2016) One-sided dynamic undersampling No-propagation neural networks for imbalance problem. Eng Appl Artif Intell 53:62–73. https://doi.org/10.1016/j.engappai.2016.02.011

    Article  Google Scholar 

  10. Sundarkumar GG, Ravi V (2015) A novel hybrid undersampling method for mining unbalanced datasets in banking and insurance. Eng Appl Artif Intell 37:368–377. https://doi.org/10.1016/j.engappai.2014.09.019

    Article  Google Scholar 

  11. He H, Garcia EA (2009) Learning from imbalanced data. IEEE Trans Knowl Data Eng 21:1263–1284. https://doi.org/10.1109/TKDE.2008.239

    Article  Google Scholar 

  12. Douzas G, Bacao F (2017) Self-organizing map oversampling (SOMO) for imbalanced data set learning. Expert Syst Appl 82:40–52. https://doi.org/10.1016/j.eswa.2017.03.073

    Article  Google Scholar 

  13. Haixiang G, Yijing L, Yanan L, Xiao L, Jinling L (2015) BPSO-Adaboost-KNN ensemble learning algorithm for multi-class imbalanced data classification. Eng Appl Artif Intell 49:176–193. https://doi.org/10.1016/j.engappai.2015.09.011

    Article  Google Scholar 

  14. Barua S, Islam MM, Yao X, Murase K (2014) MWMOTE—majority weighted minority oversampling technique for imbalanced data set learning. IEEE Trans Knowl Data Eng 26:405–425. https://doi.org/10.1109/TKDE.2012.232

    Article  Google Scholar 

  15. Ng WWY, Hu J, Yeung DS, Yin S, Roli F (2015) Diversified sensitivity-based undersampling for imbalance classification problems. IEEE Trans Cybern 45:2402–2412. https://doi.org/10.1109/TCYB.2014.2372060

    Article  Google Scholar 

  16. Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357. https://doi.org/10.1613/jair.953

    Article  MATH  Google Scholar 

  17. Gao M, Hong X, Chen S, Harris CJ (2011) On combination of SMOTE and particle swarm optimization based radial basis function classifier for imbalanced problems. In: International joint conference neural networks, pp 1146–1153. https://doi.org/10.1109/ijcnn.2011.6033353

  18. Nekooeimehr I, Lai-Yuen SK (2016) Cluster-based weighted oversampling for ordinal regression (CWOS-Ord). Neurocomputing 218:51–60. https://doi.org/10.1016/j.neucom.2016.08.071

    Article  Google Scholar 

  19. Chawla NV, Lazarevic A, Hall L, Bowyer K (2003) SMOTEBoost: improving prediction of the minority class in boosting. Knowledge Discovery in Databases: PKDD pp 107–119

  20. Cui Y, Ma H, Saha T (2014) Improvement of power transformer insulation diagnosis using oil characteristics data preprocessed by SMOTEBoost technique. IEEE Trans Dielectr Electr Insul 21:2363–2373. https://doi.org/10.1109/TDEI.2014.004547

    Article  Google Scholar 

  21. Seiffert C, Khoshgoftaar T, Van Hulse J, Napolitano A (2010) RUSBoost: a hybrid approach to alleviating class imbalance. IEEE Trans Syst Man Cybern Part A Syst Hum 40:185–197. https://doi.org/10.1109/TSMCA.2009.2029559

    Article  Google Scholar 

  22. He H, Bai Y, Garcia EA, Li S (2008) ADASYN: adaptive synthetic sampling approach for imbalanced learning. In: Proceedings of the international joint conference on neural networks. pp 1322–1328. https://doi.org/10.1109/ijcnn.2008.4633969

  23. Guo H, Viktor HL (2004) Learning from imbalanced data sets with boosting and data generation: the databoost-IM approach. ACM SIGKD Explor Newslett 6:30–39. https://doi.org/10.1145/1007730.1007736

    Article  Google Scholar 

  24. Galar M, Fernandez A, Barrenechea E, Bustince H, Herrera F (2012) A review on ensembles for the class imbalance problem: bagging-, boosting-, and hybrid-based approaches. IEEE Trans Syst Man Cybern Part C Appl Rev. https://doi.org/10.1109/TSMCC.2011.2161285

    Article  Google Scholar 

  25. Nekooeimehr I, Lai-Yuen SK (2016) Adaptive semi-unsupervised weighted oversampling (A-SUWO) for imbalanced datasets. Expert Syst Appl 46:405–416. https://doi.org/10.1016/j.eswa.2015.10.031

    Article  Google Scholar 

  26. Baccour L, John RI (2015) Experimental analysis of crisp similarity and distance measures. In: 6th international conference on soft computing and pattern recognition, SoCPaR 2014. pp 96–100. https://doi.org/10.1109/socpar.2014.7007988

  27. Xia P, Zhang L, Li F (2015) Learning similarity with cosine similarity ensemble. Inf Sci 307:39–52. https://doi.org/10.1016/j.ins.2015.02.024

    Article  MathSciNet  MATH  Google Scholar 

  28. Huang G-B, Zhou H, Ding X, Zhang R (2012) Extreme learning machine for regression and multiclass classification. IEEE Trans Syst Man Cybern Part B Cybern 42:513–529. https://doi.org/10.1109/TSMCB.2011.2168604

    Article  Google Scholar 

  29. Liang N-Y, Huang G-B, Saratchandran P, Sundararajan N (2006) A fast and accurate online sequential learning algorithm for feedforward networks. IEEE Trans Neural Netw 17:1411–1423. https://doi.org/10.1109/TNN.2006.880583

    Article  Google Scholar 

  30. Samat A, Du PJ, Liu SC, Li J, Cheng L (2014) (ELMs)-L-2: ensemble extreme learning machines for hyperspectral image classification. IEEE J Sel Top Appl Earth Obs Remote Sens 7:1060–1069. https://doi.org/10.1109/jstars.2014.2301775

    Article  Google Scholar 

  31. Javed K, Gouriveau R, Zerhouni N, Zemouri R, Li X (2012) Robust, reliable and applicable tool wear monitoring and prognostic: approach based on an improved-extreme learning machine. In: IEEE conference on prognostics and health management

  32. Kowalski J, Krawczyk B, Woźniak M (2017) Fault diagnosis of marine 4-stroke diesel engines using a one-vs-one extreme learning ensemble. Eng Appl Artif Intell 57:134–141. https://doi.org/10.1016/j.engappai.2016.10.015

    Article  Google Scholar 

  33. Wang N, Er MJ, Han M (2015) Generalized single-hidden layer feedforward networks for regression problems. IEEE Trans Neural Netw Learn Syst 26:1161–1176. https://doi.org/10.1109/TNNLS.2014.2334366

    Article  MathSciNet  Google Scholar 

  34. Alexandre E, Cuadra L, Salcedo-Sanz S, Pastor-Sánchez A, Casanova-Mateo C (2015) Hybridizing extreme learning machines and genetic algorithms to select acoustic features in vehicle classification applications. Neurocomputing 152:58–68. https://doi.org/10.1016/j.neucom.2014.11.019

    Article  Google Scholar 

  35. Lei B, Rahman SA, Song I (2014) Content-based classification of breath sound with enhanced features. Neurocomputing 141:139–147. https://doi.org/10.1016/j.neucom.2014.04.002

    Article  Google Scholar 

  36. de Cheveigné A, Kawahara H (2002) YIN, a fundamental frequency estimator for speech and music. J Acoust Soc Am 111:1917–1930. https://doi.org/10.1121/1.1458024

    Article  Google Scholar 

  37. Salamon J, Gomez E (2012) Melody extraction from polyphonic music signals using pitch contour characteristics. IEEE Trans Audio Speech Lang Process 20:1759–1770. https://doi.org/10.1109/TASL.2012.2188515

    Article  Google Scholar 

  38. Huang Guang-bin, Qin-yu Zhu CS (2006) Extreme learning machine: a new learning scheme of feedforward neural networks. Neurocomputing 70:489–501. https://doi.org/10.1109/IJCNN.2004.1380068

    Article  Google Scholar 

  39. Chan TK, Chin CS (2017) Proposed framework for multi-level extreme machine learning for underwater thruster’s fault classification using YIN fundamental frequency estimator and pitch sound. In: IEEE international conference on advanced robotics and mechatronics

  40. KEEL Dataset Repository, http://sci2s.ugr.es/keel/datasets.php, 2017 Accessed 27 June 2017

  41. UCI Machine Learning Repository, https://archive.ics.uci.edu/ml/index.php, 2017. Accessed 27 June 2017

Download references

Acknowledgements

This work is fully supported by the Newcastle University in the UK and Singapore. The authors would like to thank all the staff involved in this project.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Cheng Siong Chin.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chan, T.K., Chin, C.S. Health stages diagnostics of underwater thruster using sound features with imbalanced dataset. Neural Comput & Applic 31, 5767–5782 (2019). https://doi.org/10.1007/s00521-018-3407-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-018-3407-3

Keywords

Navigation