Skip to main content
Log in

Development of Optimal Feature Selection and Deep Learning Toward Hungry Stomach Detection Using Audio Signals

  • Published:
Journal of Control, Automation and Electrical Systems Aims and scope Submit manuscript

Abstract

The process of transmitting signals to the body regarding the hungry stomach is referred to as the migrating motor complex (MMC) process. The intestines and stomach are considered for sensing the unavailability of food in the body. Hence, the receptors present in the stomach wall generate the electrical activity waves and activate the hunger. In general, audio signal processing algorithms include signal analysis, property extraction, and behavior prediction, identifying the pattern available in the signal, and predicting how a specific signal is correlated to various identical signals. The major challenge here is to consider the audio signals that are produced from the stomach for identifying the growling sound that well describes the hungry. The main intent of this paper is to develop an intelligent model using the deep learning concept for recognizing the hungry stomach by using the synthetically collected audio signals through mobile phones. This makes society reaching the hungry stomach by way of intelligent technology. The proposed detection model covers different phases for automated hungry stomach detection. The data acquisition is performed by gathering information using mobile phones. Further, the pre-processing of the signals is done by the median filtering and the smoothening methods. In order to perform the proper classification, the spectral features like spectral centroid, spectral roll-off, spectral skewness, spectral kurtosis, spectral slope, spectral crest factor, and spectral flux, and cepstral domain features like mel-frequency cepstral coefficients (MFCCs), linear prediction cepstral coefficients (LPCCs), Perceptual linear prediction (PLP) cepstral coefficients, Greenwood function cepstral coefficients (GFCC), and gammatone cepstral coefficients (GTCCs) are extracted. Further, the optimal feature selection is done by the improved meta-heuristic algorithm called best and worst position updated deer hunting optimization algorithm (BWP-DHOA). An improved deep learning model called optimized recurrent neural network (RNN) is used for classifying the optimal features of the audio signal into growling sound and burp sound. Finally, the performance comparison over the existing models proves the efficiency and reliability of the proposed model.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Abbreviations

MMC:

Migrating motor complex

LPCC:

Linear prediction cepstral coefficient

ASC:

Acoustic scene classification

NN:

Nearest neighbor

MFCC:

Mel-frequency cepstral coefficient

CASA:

Computational auditory scene analysis

SSA:

Spectral similarity-based algorithm

PLP:

Perceptual linear prediction cepstral coefficient

RNN:

Recurrent neural network

MIB:

Monitoring of ingestive behavior

DNN:

Deep neural network

PPG:

Photoplethysmography

GMM:

Gaussian mixture model

DT:

Decision Tree

GFCC:

Greenwood Function Cepstral Coefficient

SVDD:

Support Vector Data Description

AED:

Audio Event Detection

PCG:

Phono-cardio grams

SVM:

Support vector machine

HS:

Heart Sounds

DCASE:

Detection and classification of acoustic scenes and events

GRU:

Gated recurrent unit

GTCC:

Gamma tone cepstral coefficient

ARMA:

Auto-regressive moving average

PSD:

Power spectral density

WSVM:

Weighted support vector machine

BWP-DHOA:

Best and worst position updated deer hunting optimization algorithm

LSTM:

Long short-term memory

MLP:

Multi-layer perceptron

ANN:

Artificial neural network

References

  • Abdulaziz, Y., Mumtazah, S., Ahmad, S. (2010). Infant cry recognition system: A comparison of system performance based on mel frequency and linear prediction cepstral coefficients. IEEE Access, 2010.

  • Amft, O., Stäger, M., Lukowicz, P., & Tröster, G. (2005). Analysis of chewing sounds for dietary monitoring. UbiComp 2005: Ubiquitous Computing. Springer, pp 56–72.

  • Amft, O., & Tröster, G. (2009). On-body sensing solutions for automatic dietary monitoring. IEEE Pervasive Computing, 8, 62–70.

    Article  Google Scholar 

  • Bai, M. R., & Chen, M.-C. (2007). Intelligent preprocessing and classification of audio signals. The Journal of the Audio Engineering Society, 55, 372–384.

    Google Scholar 

  • Beck, G., Duong, T., Lebbah, M., Azzag, H., & Cerin, C. (2019) A distributed approximate nearest neighbors algorithm for efficient large scale mean shift clustering. Journal of Parallel and Distributed Computing, 134, 128–139.

  • Borkar, G. M., Patil, L. H., Dalgade, D., & Hutke, A. (2019). A novel clustering approach and adaptive SVM classifier for intrusion detection in WSN: A data mining concept. Sustainable Computing: Informatics and Systems, 23, 120–135.

  • Brammya, G., Ramya, S. Praveena, N.S. Preetha, N., Ramya, R., Rajakumar, B.R., & Binu, D. (2019). Deer Hunting Optimization Algorithm: A New Nature-Inspired Meta-heuristic Paradigm. Computer Science Theory, Methods and Tools the Computer Journal.

  • Burred, J. J., & Lerch, A. (2004). Hierarchical automatic audio signal classification. The Journal of the Audio Engineering Society, 52, 724–739.

    Google Scholar 

  • Cadavid, S., Abdel-Mottaleb, M., & Helal, A. (2012). Exploiting visual quasi-periodicity for real-time chewing event detection using active appearance models and support vector machines. Personal and Ubiquitous Computing, 16(6), 729–739.

    Article  Google Scholar 

  • Chung, Y., Lee, J., Oh, S., Park, D., Chang, H. H., & Kim, S. (2013). Automatic detection of Cow’s Oestrus in Audio Surveillance System. Asian Australasian Journal of Animal Sciences, 26(7), 1030–1037.

    Article  Google Scholar 

  • Deng, L., & Xiao, L. (2013). Machine learning paradigms for speech recognition: An overview. IEEE Transactions on Audio, Speech, and Language Processing, 21(5), 1060–1089.

    Article  Google Scholar 

  • Dong, Y., Hoover, A., Scisco, J., & Muth, E. (2012). A new method for measuring meal intake in humans via automated wrist motion tracking. Applied psychophysiology and biofeedback, 37, 205–215.

    Article  Google Scholar 

  • Abid Ishaq, Saima Sadiq, Muhammad Umer, Saleem Ullah, Seyedali Mirjalili, Vaibhav Rupapara, & Michele Nappi (2021). Improving the Prediction of Heart Failure Patients’ Survival Using SMOTE and Effective Data Mining Techniques. IEEE Access, 2021.

  • Fitzgerald, D. (2010). Harmonic/Percussive Separation using Median Filtering", Proc. of the 13th Int. Conference on Digital Audio Effects (DAFx-10), Graz, Austria , September 6–10, 2010.

  • Fontana, J. M., Farooq, M., & Sazonov, E. (2014). Automatic ingestion monitor: a novel wearable device for monitoring of ingestive behavior. Biomedical Engineering, IEEE Transactions on, 61(6), 1772–1779.

    Article  Google Scholar 

  • Guido, R. C.(2016). A tutorial on signal energy and its applications. Neurocomputing, 179, 264–282.

    Article  Google Scholar 

  • Hadjileontiadis, L. J., & Rekanos, I. T. (2003). Detection of explosive lung and bowel sounds by means of fractal dimension. IEEE Signal Processing Letters, 10(10), 311–314.

    Article  Google Scholar 

  • Mingming Huang, Runsheng Lin, Shuai Huang, & Tengfei Xing (2017). A novel approach for precipitation forecast via improved K-nearest neighbor algorithm. Advanced Engineering Informatics, 33, 89–95.

  • Kalantarian, H., Sarrafzadeh, M.(2015). Audio-based detection and evaluation of eating behavior using the smartwatch platform. Computers in Biology and Medicine, 65, 1–91.

    Article  Google Scholar 

  • Leal, A., Nunes, D., Couceiro, R., Henriques, J., Carvalho, P., Quintal, I., & Teixeira, C. (2018). Noise detection in phonocardiograms by exploring similarities in spectral features. Biomedical Signal Processing and Control, 44, 154–167.

    Article  Google Scholar 

  • Lopez-Meyer, P., Makeyev, O., Schuckers, S., Melanson, E., Neuman, M., & Sazonov, E. (2010). Detection of food intake from swallowing sequences by supervised and unsupervised methods. Annals of Biomedical Engineering, 38, 2766–2774.

    Article  Google Scholar 

  • Lopez-Meyer, P., Schuckers, S., Makeyev, O., Fontana, J. M., & Sazonov, E. (2012). Automatic identification of the number of food items in a meal using clustering techniques based on the monitoring of swallowing and chewing. Biomedical signal processing and control, 7(5), 474–480.

    Article  Google Scholar 

  • Makeyeva, O., Lopez-Meyer, P., Schuckers, S., & Besio, W., Sazonov, E. (2012). Automatic food intake detection based on swallowing sounds. Biomedical Signal Processing and Control, 7(6), 649–656.

    Article  Google Scholar 

  • Marsaline Beno, M., Valarmathi I. R, Swamy, S. M., & Rajakumar, B. R. (2014). Threshold prediction for segmenting tumour from brain MRI scans. International Journal of Imaging Systems and Technology, 24,(2), 129–137.

  • Mohammad, R., Masadeh, T., Mahafzah, B. A., & Sharieh, A. A.-A. (2019). Sea Lion Optimization Algorithm. International Journal of Advanced Computer Science and Applications, 10(5), 388–395.

  • Mirjalili, S., Mohammad Mirjalili, S., Lewis, A. (2014) Grey Wolf Optimizer. Advances in Engineering Software, 69, 46–61.

    Article  Google Scholar 

  • Ntalampiras, S. (2015) Audio pattern recognition of baby crying sound events. Journal of the Audio Engineering Society, 63,(5).

  • Patrick J. Clemins, Marek B. Trawicki, Kuntoro Adi, Jidong Tao, & Michael T. Johnson (2006) Generalised perceptual features for vocalization analysis across multiple species. IEEE Access, 2006.

  • Panda, A. (2018). Denoising Algorithms using Stacked RNN models for In-Car Speech Recognition System. In 2018 4th International Conference for Convergence in Technology (I2CT) SDMIT Ujire, Mangalore, India, October 27–28, 2018.

  • Papapanagiotou, V., Diou, C., Zhou, L., van den Boer, J., Mars, M., & Delopoulos, A. (2017). A novel chewing detection system based on PPG, Audio, and Accelerometry. IEEE Journal of Biomedical and Health Informatics, 21(3), 607–618.

    Article  Google Scholar 

  • Peyron, M.-A., Blanc, O., Lund, J. P., & Woda, A. (2004). Influence of age on adaptability of human mastication. Journal of Neurophysiology, 92(2), 773–779.

    Article  Google Scholar 

  • Rodrigo Capobianco Guido. (2018). A tutorial review on entropy-based handcrafted feature extraction for information fusion. Information Fusion, 41, 161–175.

    Article  Google Scholar 

  • Sazonov, E. S., & Fontana, J. M. (2012). A sensor system for automatic detection of food intake through non-invasive monitoring of chewing. IEEE Sensors Journal, 12(5), 1340–1348.

    Article  Google Scholar 

  • Sazonov, E., Schuckers, S., Lopez-Meyer, P., Makeyev, O., Sazonova, N., Melanson, E., & Neuman, M. (2008). Non-invasive monitoring of chewing and swallowing for objective quantification of ingestive behavior. Physiological Measurement, 29, 525–541.

    Article  Google Scholar 

  • Sazonov, E., Makeyev, O., Schuckers, S., Lopez-Meyer, P., Melanson, E., & Neuman, M. (2010). Automatic detection of swallowing events by acoustical means for applications of monitoring of ingestive behavior. IEEE Transactions on Biomedical Engineering, 57, 626–633.

    Article  Google Scholar 

  • Sumit, H. (2021). Sailee Bhambere, Abhishek B, “Rapid Digitization of Healthcare - A Review of COVID-19 Impact on our Health systems.” International Journal of All Research Education and Scientific Methods, 9(2), 1457–1459.

    Google Scholar 

  • S. M. Swamy, B. R. Rajakumar and I. R. Valarmathi, “Design of Hybrid Wind and Photovoltaic Power System using Opposition-based Genetic Algorithm with Cauchy Mutation”, IET Chennai Fourth International Conference on Sustainable Energy and Intelligent Systems (SEISCON 2013), Chennai, India, Dec. 2013.

  • T.Aditya Sai Srinivas, S.S.Manivannan, "Prevention of Hello Flood Attack in IoT using combination of Deep Learning with Improved Rider Optimization Algorithm", Computer Communications, pp. 162–175, Volume 163, 2020.

  • Smith Tsang, Ben Kao, Kevin Y. Yip, Wai-Shing Ho, & Sau Dan Lee (2011). Decision Trees for Uncertain Data. IEEE Transactions on Knowledge and Data Engineering, 23,(1).

  • Xavier Valero & Francesc Alías (2012). Gammatone Cepstral Coefficients: Biologically Inspired Features for Non-Speech Audio Classification. IEEE Transactions on Multimedia, 14(6).

  • Rivarol Vergin, & Douglas O’Shaughnessy (1999). Generalized Mel Frequency Cepstral Coefficients for Large-Vocabulary Speaker-Independent Continuous-Speech Recognition. IEEE Transactions on Speech and Audio Processing, 7(5).

  • Vesperini, F., Gabrielli, L., Principi, E., & Squartini, S. (2019). Polyphonic Sound Event Detection by Using Capsule Neural Networks. IEEE Journal of Selected Topics in Signal Processing, 13(2), 310–322.

    Article  Google Scholar 

  • Waldekar, S. (2018). Goutam Saha “Classification of audio scenes with novel features in a fused system framework,.” Digital Signal Processing, 75, 71–82.

    Article  MathSciNet  Google Scholar 

  • Dongshu Wang, Dapei Tan, & Lei Liu (2018) Particle swarm optimization algorithm: an overview. Soft Computing, 22, 387–408.

  • Wang, S., Zhou, G., Ma, Y., Lisha, Hu., Chen, Z., Chen, Y., & Zhao, H., Jung, W. (2018). Eating detection and chews counting through sensing mastication muscle contraction. Smart Health, 9–10, 179–191.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to A. Maria.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Maria, A., Jeyaseelan, A.S. Development of Optimal Feature Selection and Deep Learning Toward Hungry Stomach Detection Using Audio Signals. J Control Autom Electr Syst 32, 853–874 (2021). https://doi.org/10.1007/s40313-021-00727-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s40313-021-00727-8

Keywords

Navigation