Abstract
The underwater thruster is considered one of the most critical components located on an unmanned underwater vehicle to maneuver in the water. However, it is recognized as a common source of the fault. This phenomenon is made worse when collected data for equipment health diagnostics are highly imbalanced. A new sampling method to tackle the problem of imbalanced data based on cosine similarity is proposed to improve the classification accuracy for thruster health diagnostics. The results show that it outperforms SMOTE (Synthetic Minority Oversampling Technique) and ADASYN (Adaptive Synthetic Sampling Approach for Imbalanced Learning). The proposed method was further validated using different imbalanced datasets with different imbalance ratio from KEEL and UCI machine learning repository (such as Pima Indians Diabetes, Ionosphere, Fertility Diagnostics, Mammographic Masses, Blood Transfusion Service Centre). The majority of the results from the datasets show that the proposed method produces the higher classification accuracy as well as g-means that suggests the potential approach for classification problem that has a highly imbalanced dataset.
Similar content being viewed by others
References
Sun Y, Ran X, Li Y, Zhang G, Zhang Y (2016) Thruster fault diagnosis method based on Gaussian particle filter for autonomous underwater vehicles. Int J Nav Archit Ocean Eng 8:243–251. https://doi.org/10.1016/j.ijnaoe.2016.03.003
Dos Santos CHF, Cardozo DIK, Reginatto R, De Pieri ER (2016) Bank of controllers and virtual thrusters for fault-tolerant control of autonomous underwater vehicles. Ocean Eng 121:210–223. https://doi.org/10.1016/j.oceaneng.2016.05.029
Omerdic E, Roberts G (2004) Thruster fault diagnosis and accommodation for open-frame underwater vehicles. Control Eng Pract 12:1575–1598. https://doi.org/10.1016/j.conengprac.2003.12.014
Shin J-H, Jun H-B (2015) On condition based maintenance policy. J Comput Des Eng 2:119–127. https://doi.org/10.1016/j.jcde.2014.12.006
Susto GA, Schirru A, Pampuri S, McLoone S, Beghi A (2015) Machine learning for predictive maintenance: a multiple classifier approach. IEEE Trans Ind Inform 11:812–820. https://doi.org/10.1109/TII.2014.2349359
Lee J, Wu F, Zhao W, Ghaffari M, Liao L, Siegel D (2014) Prognostics and health management design for rotary machinery systems—reviews, methodology and applications. Mech Syst Signal Process 42:314–334. https://doi.org/10.1016/j.ymssp.2013.06.004
Yu J (2015) Machine health prognostics using the Bayesian-inference-based probabilistic indication and high-order particle filtering framework. J Sound Vib 358:97–110. https://doi.org/10.1016/j.jsv.2015.08.013
Nayal A, Jomaa H, Awad M (2017) KerMinSVM for imbalanced datasets with a case study on arabic comics classification. Eng Appl Artif Intell 59:159–169. https://doi.org/10.1016/j.engappai.2017.01.001
Fan Q, Wang Z, Gao D (2016) One-sided dynamic undersampling No-propagation neural networks for imbalance problem. Eng Appl Artif Intell 53:62–73. https://doi.org/10.1016/j.engappai.2016.02.011
Sundarkumar GG, Ravi V (2015) A novel hybrid undersampling method for mining unbalanced datasets in banking and insurance. Eng Appl Artif Intell 37:368–377. https://doi.org/10.1016/j.engappai.2014.09.019
He H, Garcia EA (2009) Learning from imbalanced data. IEEE Trans Knowl Data Eng 21:1263–1284. https://doi.org/10.1109/TKDE.2008.239
Douzas G, Bacao F (2017) Self-organizing map oversampling (SOMO) for imbalanced data set learning. Expert Syst Appl 82:40–52. https://doi.org/10.1016/j.eswa.2017.03.073
Haixiang G, Yijing L, Yanan L, Xiao L, Jinling L (2015) BPSO-Adaboost-KNN ensemble learning algorithm for multi-class imbalanced data classification. Eng Appl Artif Intell 49:176–193. https://doi.org/10.1016/j.engappai.2015.09.011
Barua S, Islam MM, Yao X, Murase K (2014) MWMOTE—majority weighted minority oversampling technique for imbalanced data set learning. IEEE Trans Knowl Data Eng 26:405–425. https://doi.org/10.1109/TKDE.2012.232
Ng WWY, Hu J, Yeung DS, Yin S, Roli F (2015) Diversified sensitivity-based undersampling for imbalance classification problems. IEEE Trans Cybern 45:2402–2412. https://doi.org/10.1109/TCYB.2014.2372060
Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357. https://doi.org/10.1613/jair.953
Gao M, Hong X, Chen S, Harris CJ (2011) On combination of SMOTE and particle swarm optimization based radial basis function classifier for imbalanced problems. In: International joint conference neural networks, pp 1146–1153. https://doi.org/10.1109/ijcnn.2011.6033353
Nekooeimehr I, Lai-Yuen SK (2016) Cluster-based weighted oversampling for ordinal regression (CWOS-Ord). Neurocomputing 218:51–60. https://doi.org/10.1016/j.neucom.2016.08.071
Chawla NV, Lazarevic A, Hall L, Bowyer K (2003) SMOTEBoost: improving prediction of the minority class in boosting. Knowledge Discovery in Databases: PKDD pp 107–119
Cui Y, Ma H, Saha T (2014) Improvement of power transformer insulation diagnosis using oil characteristics data preprocessed by SMOTEBoost technique. IEEE Trans Dielectr Electr Insul 21:2363–2373. https://doi.org/10.1109/TDEI.2014.004547
Seiffert C, Khoshgoftaar T, Van Hulse J, Napolitano A (2010) RUSBoost: a hybrid approach to alleviating class imbalance. IEEE Trans Syst Man Cybern Part A Syst Hum 40:185–197. https://doi.org/10.1109/TSMCA.2009.2029559
He H, Bai Y, Garcia EA, Li S (2008) ADASYN: adaptive synthetic sampling approach for imbalanced learning. In: Proceedings of the international joint conference on neural networks. pp 1322–1328. https://doi.org/10.1109/ijcnn.2008.4633969
Guo H, Viktor HL (2004) Learning from imbalanced data sets with boosting and data generation: the databoost-IM approach. ACM SIGKD Explor Newslett 6:30–39. https://doi.org/10.1145/1007730.1007736
Galar M, Fernandez A, Barrenechea E, Bustince H, Herrera F (2012) A review on ensembles for the class imbalance problem: bagging-, boosting-, and hybrid-based approaches. IEEE Trans Syst Man Cybern Part C Appl Rev. https://doi.org/10.1109/TSMCC.2011.2161285
Nekooeimehr I, Lai-Yuen SK (2016) Adaptive semi-unsupervised weighted oversampling (A-SUWO) for imbalanced datasets. Expert Syst Appl 46:405–416. https://doi.org/10.1016/j.eswa.2015.10.031
Baccour L, John RI (2015) Experimental analysis of crisp similarity and distance measures. In: 6th international conference on soft computing and pattern recognition, SoCPaR 2014. pp 96–100. https://doi.org/10.1109/socpar.2014.7007988
Xia P, Zhang L, Li F (2015) Learning similarity with cosine similarity ensemble. Inf Sci 307:39–52. https://doi.org/10.1016/j.ins.2015.02.024
Huang G-B, Zhou H, Ding X, Zhang R (2012) Extreme learning machine for regression and multiclass classification. IEEE Trans Syst Man Cybern Part B Cybern 42:513–529. https://doi.org/10.1109/TSMCB.2011.2168604
Liang N-Y, Huang G-B, Saratchandran P, Sundararajan N (2006) A fast and accurate online sequential learning algorithm for feedforward networks. IEEE Trans Neural Netw 17:1411–1423. https://doi.org/10.1109/TNN.2006.880583
Samat A, Du PJ, Liu SC, Li J, Cheng L (2014) (ELMs)-L-2: ensemble extreme learning machines for hyperspectral image classification. IEEE J Sel Top Appl Earth Obs Remote Sens 7:1060–1069. https://doi.org/10.1109/jstars.2014.2301775
Javed K, Gouriveau R, Zerhouni N, Zemouri R, Li X (2012) Robust, reliable and applicable tool wear monitoring and prognostic: approach based on an improved-extreme learning machine. In: IEEE conference on prognostics and health management
Kowalski J, Krawczyk B, Woźniak M (2017) Fault diagnosis of marine 4-stroke diesel engines using a one-vs-one extreme learning ensemble. Eng Appl Artif Intell 57:134–141. https://doi.org/10.1016/j.engappai.2016.10.015
Wang N, Er MJ, Han M (2015) Generalized single-hidden layer feedforward networks for regression problems. IEEE Trans Neural Netw Learn Syst 26:1161–1176. https://doi.org/10.1109/TNNLS.2014.2334366
Alexandre E, Cuadra L, Salcedo-Sanz S, Pastor-Sánchez A, Casanova-Mateo C (2015) Hybridizing extreme learning machines and genetic algorithms to select acoustic features in vehicle classification applications. Neurocomputing 152:58–68. https://doi.org/10.1016/j.neucom.2014.11.019
Lei B, Rahman SA, Song I (2014) Content-based classification of breath sound with enhanced features. Neurocomputing 141:139–147. https://doi.org/10.1016/j.neucom.2014.04.002
de Cheveigné A, Kawahara H (2002) YIN, a fundamental frequency estimator for speech and music. J Acoust Soc Am 111:1917–1930. https://doi.org/10.1121/1.1458024
Salamon J, Gomez E (2012) Melody extraction from polyphonic music signals using pitch contour characteristics. IEEE Trans Audio Speech Lang Process 20:1759–1770. https://doi.org/10.1109/TASL.2012.2188515
Huang Guang-bin, Qin-yu Zhu CS (2006) Extreme learning machine: a new learning scheme of feedforward neural networks. Neurocomputing 70:489–501. https://doi.org/10.1109/IJCNN.2004.1380068
Chan TK, Chin CS (2017) Proposed framework for multi-level extreme machine learning for underwater thruster’s fault classification using YIN fundamental frequency estimator and pitch sound. In: IEEE international conference on advanced robotics and mechatronics
KEEL Dataset Repository, http://sci2s.ugr.es/keel/datasets.php, 2017 Accessed 27 June 2017
UCI Machine Learning Repository, https://archive.ics.uci.edu/ml/index.php, 2017. Accessed 27 June 2017
Acknowledgements
This work is fully supported by the Newcastle University in the UK and Singapore. The authors would like to thank all the staff involved in this project.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Chan, T.K., Chin, C.S. Health stages diagnostics of underwater thruster using sound features with imbalanced dataset. Neural Comput & Applic 31, 5767–5782 (2019). https://doi.org/10.1007/s00521-018-3407-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-018-3407-3