Skip to main content
Log in

Signal discrimination using category-preserving bag-of-words model for condition monitoring

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

Signal discrimination contributes to the development of machine–machine and human–machine interactive intelligent systems. In this study, a novel framework for signal discrimination was proposed. The proposed framework comprised three phases. In Phase I, a waveform shape-based feature extraction method was used for parameterizing signals. In Phase II, a novel category-preserving bag-of-words (CPBoW) model was proposed. In Phase III, signals were discriminated using a vector space model with term frequency–inverse document frequency. The bag-of-words model generally demonstrated promising performance for signal discrimination. However, the inherent connections among signals of homogeneous categories were considerably lost during signal framing and codebook generation processes. This was because the codebook was simply generated by clustering signal frame samples in the Euclidean space. In the proposed CPBoW model, Taguchi’s quality engineering method was used to develop a category-preserving distance metric for executing a clustering process to generate category-preserving codewords. This preserved category information in the codebook and consequently increased the effectiveness of the discrimination process. The proposed framework was verified through three condition monitoring applications that involved a musical instrument recognition problem, motor bearing fault recognition problem, and heart disease recognition problem. The results indicated the superior performance and effectiveness of the proposed framework.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  1. Mukhopadhyay S, Biswas S, Roy AB, Dey N (2012) Wavelet based QRS complex detection of ECG signal. Int J Eng Res Appl 2(3):2361–2365

    Google Scholar 

  2. Bhalke DG, Rao CR, Bormane DS (2014) Hybridization of fractional fourier transform and acoustic features for musical instrument recognition. Int J Signal Process Image Process Pattern Recognit 7(1):275–282

    Google Scholar 

  3. Ebrahimzadeh A, Shakiba B, Khazaee A (2014) Detection of electrocardiogram signals using an efficient method. Appl Soft Comput 22:108–117

    Google Scholar 

  4. Yin J, Wang W, Man Z, Khoo S (2014) Statistical modeling of gear vibration signals and its application to detecting and diagnosing gear faults. Inf Sci 259:295–303

    MathSciNet  MATH  Google Scholar 

  5. Teti R (2015) Advanced IT methods of signal processing and decision making for zero defect manufacturing in machining. Procedia CIRP 28:3–15

    Google Scholar 

  6. Bhalke DG, Rao CR, Bormane DS (2016) Automatic musical instrument classification using fractional fourier transform based-MFCC features and counter propagation neural network. J Intell Inf Syst 46(3):425–446

    Google Scholar 

  7. Wang D (2016) K-nearest neighbors based methods for identification of different gear crack levels under different motor speeds and loads: revisited. Mech Syst Signal Process 70:201–208

    Google Scholar 

  8. Pławiak P (2018) Novel methodology of cardiac health recognition based on ECG signals and evolutionary-neural system. Expert Syst Appl 92:334–349

    Google Scholar 

  9. Chatterjee S, Dey N, Shi F, Ashour AS, Fong SJ, Sen S (2018) Clinical application of modified bag-of-features coupled with hybrid neural-based classifier in dengue fever classification using gene expression data. Med Biol Eng Compu 56:709–720

    Google Scholar 

  10. Hsiao YH, Su CT (2009) Multi-class MTS for saxophone timbre quality inspection using waveform shape-based features. IEEE Trans Syst Man Cybern B Cybern 39(3):690–704

    Google Scholar 

  11. Pons-Llinares J, Antonino-Daviu J, Riera-Guasp M, Lee S, Kang TJ, Yang C (2015) Advanced induction motor rotor fault diagnosis via continuous and discrete time–frequency tools. IEEE Trans Industr Electron 62(3):1791–1802

    Google Scholar 

  12. Goyal D, Pabla BS (2016) The vibration monitoring methods and signal processing techniques for structural health monitoring: a review. Arch Comput Methods Eng 23(4):585–594

    MathSciNet  MATH  Google Scholar 

  13. Chauhan S, Wang P, Sing Lim C, Anantharaman V (2008) A computer-aided MFCC-based HMM system for automatic auscultation. Comput Biol Med 38(2):221–233

    Google Scholar 

  14. Fu Z, Lu G, Ting KM, Zhang D (2011) Music classification via the bag-of-features approach. Pattern Recognit Lett 32(14):1768–1777

    Google Scholar 

  15. Zokaee S, Faez K (2012) Human identification based on ECG and palmprint. Int J Electr Comput Eng 2(2):261–266

    Google Scholar 

  16. Schmitt M, Ringeval F, Schuller BW (2016) At the border of acoustics and linguistics: bag-of-audio-words for the recognition of emotions in speech. In: Interspeech, pp 495–499

  17. Proakis JG, Manolakis DG (2006) Digital signal processing: principles, algorithms, and applications. Prentice Hall, Englewood Cliffs

    Google Scholar 

  18. Lee CH, Shih JL, Yu KM, Lin HS (2009) Automatic music genre classification based on modulation spectral analysis of spectral and cepstral features. IEEE Trans Multimed 11(4):670–682

    Google Scholar 

  19. Morvidone M, Sturm BL, Daudet L (2010) Incorporating scale information with cepstral features: experiments on musical instrument recognition. Pattern Recognit Lett 31(12):1489–1497

    Google Scholar 

  20. Peeters G (2011) Spectral and temporal periodicity representations of rhythm for the automatic classification of music audio signal. IEEE Trans Audio Speech Lang Process 19(5):1242–1252

    Google Scholar 

  21. Engin M (2004) ECG beat classification using neuro-fuzzy network. Pattern Recognit Lett 25(15):1715–1722

    Google Scholar 

  22. Zheng J, Pan H, Cheng J (2017) Rolling bearing fault detection and diagnosis based on composite multiscale fuzzy entropy and ensemble support vector machines. Mech Syst Signal Process 85:746–759

    Google Scholar 

  23. Wang C, Huang K (2015) How to use bag-of-words model better for image classification. Image Vis Comput 38:65–74

    Google Scholar 

  24. Qin J, Yung NHC (2010) Scene categorization via contextual visual words. Pattern Recognit 43(5):1874–1888

    MATH  Google Scholar 

  25. Li T, Mei T, Kweon I-S, Hua X-S (2011) Contextual bag-of-words for visual categorization. IEEE Trans Circuits Syst Video Technol 21(4):381–392

    Google Scholar 

  26. Passalis N, Tefas A (2016) Entropy optimized feature-based bag-of-words representation for information retrieval. IEEE Trans Knowl Data Eng 28(7):1664–1677

    Google Scholar 

  27. Dimitrovski I, Kocev D, Loskovska S, Džeroski S (2016) Improving bag-of-visual-words image retrieval with predictive clustering trees. Inf Sci 329:851–865

    Google Scholar 

  28. Beagum S, Ashour A, Dey N (2016) Bag-of features in microscopic images classification. In: Dey N, Ashour A (eds) Classification and clustering in biomedical signal processing. IGI Global, Hershey, pp 1–22

    Google Scholar 

  29. Passalis N, Tefas A (2017) Learning neural bag-of-features for large-scale image retrieval. IEEE Trans Syst Man Cybern Syst 47(10):2641–2652

    Google Scholar 

  30. Lin J, Li Y (2009) Finding structural similarity in time series data using bag-of-patterns representation. In: International conference on scientific and statistical database management, pp 461–477

    Google Scholar 

  31. González LC, Moreno R, Escalante HJ, Martínez F, Carlos MR (2017) Learning roadway surface disruption patterns using the bag of words representation. IEEE Trans Intell Transp Syst 18(11):2916–2928

    Google Scholar 

  32. Zhao J, Itti L (2016) Classifying time series using local descriptors with hybrid sampling. IEEE Trans Knowl Data Eng 28(3):623–637

    Google Scholar 

  33. Tsui KL (1992) An overview of Taguchi method and newly developed statistical methods for robust design. IIE Trans 24(5):44–57

    Google Scholar 

  34. Taguchi G, Chowdhury S, Wu Y (2005) Taguchi’s quality engineering handbook. Wiley, Hoboken

    MATH  Google Scholar 

  35. Singhal A (2001) Modern information retrieval: a brief overview. Bull IEEE Comput Soc Tech Comm Data Eng 24(4):35–43

    Google Scholar 

  36. Zhang H, Chow TW, Wu QJ (2016) Organizing books and authors by multilayer SOM. IEEE Trans Neural Netw Learn Syst 27(12):2537–2550

    Google Scholar 

  37. Burden RL, Faires JD (2000) Numerical analysis, 7th Bk&Cdr ed. Brooks/Cole, Boston

    Google Scholar 

  38. Pham DT, Dimov SS, Nguyen CD (2005) Selection of K in K-means clustering. J Mech Eng Sci 219(1):103–119

    Google Scholar 

  39. Huang A (2008) Similarity measures for text document clustering. In: Proceedings of the sixth New Zealand computer science research student conference, pp 49–56

  40. Seshadrinath J, Singh B, Panigrahi BK (2014) Vibration analysis based interturn fault diagnosis in induction machines. IEEE Trans Ind Inf 10(1):340–350

    Google Scholar 

  41. Rauber TW, de Assis Boldt F, Varejão FM (2015) Heterogeneous feature models and feature selection applied to bearing fault diagnosis. IEEE Trans Ind Electron 62(1):637–646

    Google Scholar 

  42. Sharma A, Amarnath M, Kankar PK (2016) Feature extraction and fault severity classification in ball bearings. J Vib Control 22(1):176–192

    Google Scholar 

  43. Goldberger AL, Amaral LAN, Glass L, Hausdorff JM, Ivanov PCh, Mark RG, Mietus JE, Moody GB, Peng CK, Stanley HE (2000) PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals. Circulation 101(23):e215–e220

    Google Scholar 

  44. Slaney M (1998) Auditory toolbox. Software retrieved August 13, 2017, from http://cobweb.ecn.purdue.edu/~malcolm/interval/1998-010/

  45. Chang CC, Lin CJ (2001) LIBSVM: a library for support vector machines, 2001. Software Retrieved January 21, 2017, from http://www.csie.ntu.edu.tw/~cjlin/libsvm

  46. University of Iowa Musical Instrument Sample Database. http://theremin.music.uiowa.edu/index.html

  47. Loparo KA (2013) Bearing Data Center, Case Western Reserve University. http://csegroups.case.edu/bearingdatacenter/pages/welcome-case-western-reserve-university-bearing-data-center-website. Accessed 6 Mar 2016

  48. Zhang S, Li W (2014) Bearing condition recognition and degradation assessment under varying running conditions using NPE and SOM. Math Problems Eng Vol. 2014, Article ID 781583

  49. Moody GB, Mark RG (2001) The impact of the MIT-BIH arrhythmia database. IEEE Eng Med Biol Mag 20(3):45–50

    Google Scholar 

  50. Su CT, Hsiao YH (2007) An evaluation of the robustness of MTS for imbalanced data. IEEE Trans Knowl Data Eng 19(10):1321–1332

    Google Scholar 

Download references

Acknowledgements

This study was supported by the Ministry of Science and Technology of Taiwan (Grant No. NSC 102-2410-H-305-062).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yu-Hsiang Hsiao.

Ethics declarations

Conflict of interest

The author declares that he has no conflict of interest.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hsiao, YH. Signal discrimination using category-preserving bag-of-words model for condition monitoring. Neural Comput & Applic 31, 8615–8630 (2019). https://doi.org/10.1007/s00521-018-3799-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-018-3799-0

Keywords

Navigation