Abstract
The process of transmitting signals to the body regarding the hungry stomach is referred to as the migrating motor complex (MMC) process. The intestines and stomach are considered for sensing the unavailability of food in the body. Hence, the receptors present in the stomach wall generate the electrical activity waves and activate the hunger. In general, audio signal processing algorithms include signal analysis, property extraction, and behavior prediction, identifying the pattern available in the signal, and predicting how a specific signal is correlated to various identical signals. The major challenge here is to consider the audio signals that are produced from the stomach for identifying the growling sound that well describes the hungry. The main intent of this paper is to develop an intelligent model using the deep learning concept for recognizing the hungry stomach by using the synthetically collected audio signals through mobile phones. This makes society reaching the hungry stomach by way of intelligent technology. The proposed detection model covers different phases for automated hungry stomach detection. The data acquisition is performed by gathering information using mobile phones. Further, the pre-processing of the signals is done by the median filtering and the smoothening methods. In order to perform the proper classification, the spectral features like spectral centroid, spectral roll-off, spectral skewness, spectral kurtosis, spectral slope, spectral crest factor, and spectral flux, and cepstral domain features like mel-frequency cepstral coefficients (MFCCs), linear prediction cepstral coefficients (LPCCs), Perceptual linear prediction (PLP) cepstral coefficients, Greenwood function cepstral coefficients (GFCC), and gammatone cepstral coefficients (GTCCs) are extracted. Further, the optimal feature selection is done by the improved meta-heuristic algorithm called best and worst position updated deer hunting optimization algorithm (BWP-DHOA). An improved deep learning model called optimized recurrent neural network (RNN) is used for classifying the optimal features of the audio signal into growling sound and burp sound. Finally, the performance comparison over the existing models proves the efficiency and reliability of the proposed model.
Similar content being viewed by others
Abbreviations
- MMC:
-
Migrating motor complex
- LPCC:
-
Linear prediction cepstral coefficient
- ASC:
-
Acoustic scene classification
- NN:
-
Nearest neighbor
- MFCC:
-
Mel-frequency cepstral coefficient
- CASA:
-
Computational auditory scene analysis
- SSA:
-
Spectral similarity-based algorithm
- PLP:
-
Perceptual linear prediction cepstral coefficient
- RNN:
-
Recurrent neural network
- MIB:
-
Monitoring of ingestive behavior
- DNN:
-
Deep neural network
- PPG:
-
Photoplethysmography
- GMM:
-
Gaussian mixture model
- DT:
-
Decision Tree
- GFCC:
-
Greenwood Function Cepstral Coefficient
- SVDD:
-
Support Vector Data Description
- AED:
-
Audio Event Detection
- PCG:
-
Phono-cardio grams
- SVM:
-
Support vector machine
- HS:
-
Heart Sounds
- DCASE:
-
Detection and classification of acoustic scenes and events
- GRU:
-
Gated recurrent unit
- GTCC:
-
Gamma tone cepstral coefficient
- ARMA:
-
Auto-regressive moving average
- PSD:
-
Power spectral density
- WSVM:
-
Weighted support vector machine
- BWP-DHOA:
-
Best and worst position updated deer hunting optimization algorithm
- LSTM:
-
Long short-term memory
- MLP:
-
Multi-layer perceptron
- ANN:
-
Artificial neural network
References
Abdulaziz, Y., Mumtazah, S., Ahmad, S. (2010). Infant cry recognition system: A comparison of system performance based on mel frequency and linear prediction cepstral coefficients. IEEE Access, 2010.
Amft, O., Stäger, M., Lukowicz, P., & Tröster, G. (2005). Analysis of chewing sounds for dietary monitoring. UbiComp 2005: Ubiquitous Computing. Springer, pp 56–72.
Amft, O., & Tröster, G. (2009). On-body sensing solutions for automatic dietary monitoring. IEEE Pervasive Computing, 8, 62–70.
Bai, M. R., & Chen, M.-C. (2007). Intelligent preprocessing and classification of audio signals. The Journal of the Audio Engineering Society, 55, 372–384.
Beck, G., Duong, T., Lebbah, M., Azzag, H., & Cerin, C. (2019) A distributed approximate nearest neighbors algorithm for efficient large scale mean shift clustering. Journal of Parallel and Distributed Computing, 134, 128–139.
Borkar, G. M., Patil, L. H., Dalgade, D., & Hutke, A. (2019). A novel clustering approach and adaptive SVM classifier for intrusion detection in WSN: A data mining concept. Sustainable Computing: Informatics and Systems, 23, 120–135.
Brammya, G., Ramya, S. Praveena, N.S. Preetha, N., Ramya, R., Rajakumar, B.R., & Binu, D. (2019). Deer Hunting Optimization Algorithm: A New Nature-Inspired Meta-heuristic Paradigm. Computer Science Theory, Methods and Tools the Computer Journal.
Burred, J. J., & Lerch, A. (2004). Hierarchical automatic audio signal classification. The Journal of the Audio Engineering Society, 52, 724–739.
Cadavid, S., Abdel-Mottaleb, M., & Helal, A. (2012). Exploiting visual quasi-periodicity for real-time chewing event detection using active appearance models and support vector machines. Personal and Ubiquitous Computing, 16(6), 729–739.
Chung, Y., Lee, J., Oh, S., Park, D., Chang, H. H., & Kim, S. (2013). Automatic detection of Cow’s Oestrus in Audio Surveillance System. Asian Australasian Journal of Animal Sciences, 26(7), 1030–1037.
Deng, L., & Xiao, L. (2013). Machine learning paradigms for speech recognition: An overview. IEEE Transactions on Audio, Speech, and Language Processing, 21(5), 1060–1089.
Dong, Y., Hoover, A., Scisco, J., & Muth, E. (2012). A new method for measuring meal intake in humans via automated wrist motion tracking. Applied psychophysiology and biofeedback, 37, 205–215.
Abid Ishaq, Saima Sadiq, Muhammad Umer, Saleem Ullah, Seyedali Mirjalili, Vaibhav Rupapara, & Michele Nappi (2021). Improving the Prediction of Heart Failure Patients’ Survival Using SMOTE and Effective Data Mining Techniques. IEEE Access, 2021.
Fitzgerald, D. (2010). Harmonic/Percussive Separation using Median Filtering", Proc. of the 13th Int. Conference on Digital Audio Effects (DAFx-10), Graz, Austria , September 6–10, 2010.
Fontana, J. M., Farooq, M., & Sazonov, E. (2014). Automatic ingestion monitor: a novel wearable device for monitoring of ingestive behavior. Biomedical Engineering, IEEE Transactions on, 61(6), 1772–1779.
Guido, R. C.(2016). A tutorial on signal energy and its applications. Neurocomputing, 179, 264–282.
Hadjileontiadis, L. J., & Rekanos, I. T. (2003). Detection of explosive lung and bowel sounds by means of fractal dimension. IEEE Signal Processing Letters, 10(10), 311–314.
Mingming Huang, Runsheng Lin, Shuai Huang, & Tengfei Xing (2017). A novel approach for precipitation forecast via improved K-nearest neighbor algorithm. Advanced Engineering Informatics, 33, 89–95.
Kalantarian, H., Sarrafzadeh, M.(2015). Audio-based detection and evaluation of eating behavior using the smartwatch platform. Computers in Biology and Medicine, 65, 1–91.
Leal, A., Nunes, D., Couceiro, R., Henriques, J., Carvalho, P., Quintal, I., & Teixeira, C. (2018). Noise detection in phonocardiograms by exploring similarities in spectral features. Biomedical Signal Processing and Control, 44, 154–167.
Lopez-Meyer, P., Makeyev, O., Schuckers, S., Melanson, E., Neuman, M., & Sazonov, E. (2010). Detection of food intake from swallowing sequences by supervised and unsupervised methods. Annals of Biomedical Engineering, 38, 2766–2774.
Lopez-Meyer, P., Schuckers, S., Makeyev, O., Fontana, J. M., & Sazonov, E. (2012). Automatic identification of the number of food items in a meal using clustering techniques based on the monitoring of swallowing and chewing. Biomedical signal processing and control, 7(5), 474–480.
Makeyeva, O., Lopez-Meyer, P., Schuckers, S., & Besio, W., Sazonov, E. (2012). Automatic food intake detection based on swallowing sounds. Biomedical Signal Processing and Control, 7(6), 649–656.
Marsaline Beno, M., Valarmathi I. R, Swamy, S. M., & Rajakumar, B. R. (2014). Threshold prediction for segmenting tumour from brain MRI scans. International Journal of Imaging Systems and Technology, 24,(2), 129–137.
Mohammad, R., Masadeh, T., Mahafzah, B. A., & Sharieh, A. A.-A. (2019). Sea Lion Optimization Algorithm. International Journal of Advanced Computer Science and Applications, 10(5), 388–395.
Mirjalili, S., Mohammad Mirjalili, S., Lewis, A. (2014) Grey Wolf Optimizer. Advances in Engineering Software, 69, 46–61.
Ntalampiras, S. (2015) Audio pattern recognition of baby crying sound events. Journal of the Audio Engineering Society, 63,(5).
Patrick J. Clemins, Marek B. Trawicki, Kuntoro Adi, Jidong Tao, & Michael T. Johnson (2006) Generalised perceptual features for vocalization analysis across multiple species. IEEE Access, 2006.
Panda, A. (2018). Denoising Algorithms using Stacked RNN models for In-Car Speech Recognition System. In 2018 4th International Conference for Convergence in Technology (I2CT) SDMIT Ujire, Mangalore, India, October 27–28, 2018.
Papapanagiotou, V., Diou, C., Zhou, L., van den Boer, J., Mars, M., & Delopoulos, A. (2017). A novel chewing detection system based on PPG, Audio, and Accelerometry. IEEE Journal of Biomedical and Health Informatics, 21(3), 607–618.
Peyron, M.-A., Blanc, O., Lund, J. P., & Woda, A. (2004). Influence of age on adaptability of human mastication. Journal of Neurophysiology, 92(2), 773–779.
Rodrigo Capobianco Guido. (2018). A tutorial review on entropy-based handcrafted feature extraction for information fusion. Information Fusion, 41, 161–175.
Sazonov, E. S., & Fontana, J. M. (2012). A sensor system for automatic detection of food intake through non-invasive monitoring of chewing. IEEE Sensors Journal, 12(5), 1340–1348.
Sazonov, E., Schuckers, S., Lopez-Meyer, P., Makeyev, O., Sazonova, N., Melanson, E., & Neuman, M. (2008). Non-invasive monitoring of chewing and swallowing for objective quantification of ingestive behavior. Physiological Measurement, 29, 525–541.
Sazonov, E., Makeyev, O., Schuckers, S., Lopez-Meyer, P., Melanson, E., & Neuman, M. (2010). Automatic detection of swallowing events by acoustical means for applications of monitoring of ingestive behavior. IEEE Transactions on Biomedical Engineering, 57, 626–633.
Sumit, H. (2021). Sailee Bhambere, Abhishek B, “Rapid Digitization of Healthcare - A Review of COVID-19 Impact on our Health systems.” International Journal of All Research Education and Scientific Methods, 9(2), 1457–1459.
S. M. Swamy, B. R. Rajakumar and I. R. Valarmathi, “Design of Hybrid Wind and Photovoltaic Power System using Opposition-based Genetic Algorithm with Cauchy Mutation”, IET Chennai Fourth International Conference on Sustainable Energy and Intelligent Systems (SEISCON 2013), Chennai, India, Dec. 2013.
T.Aditya Sai Srinivas, S.S.Manivannan, "Prevention of Hello Flood Attack in IoT using combination of Deep Learning with Improved Rider Optimization Algorithm", Computer Communications, pp. 162–175, Volume 163, 2020.
Smith Tsang, Ben Kao, Kevin Y. Yip, Wai-Shing Ho, & Sau Dan Lee (2011). Decision Trees for Uncertain Data. IEEE Transactions on Knowledge and Data Engineering, 23,(1).
Xavier Valero & Francesc Alías (2012). Gammatone Cepstral Coefficients: Biologically Inspired Features for Non-Speech Audio Classification. IEEE Transactions on Multimedia, 14(6).
Rivarol Vergin, & Douglas O’Shaughnessy (1999). Generalized Mel Frequency Cepstral Coefficients for Large-Vocabulary Speaker-Independent Continuous-Speech Recognition. IEEE Transactions on Speech and Audio Processing, 7(5).
Vesperini, F., Gabrielli, L., Principi, E., & Squartini, S. (2019). Polyphonic Sound Event Detection by Using Capsule Neural Networks. IEEE Journal of Selected Topics in Signal Processing, 13(2), 310–322.
Waldekar, S. (2018). Goutam Saha “Classification of audio scenes with novel features in a fused system framework,.” Digital Signal Processing, 75, 71–82.
Dongshu Wang, Dapei Tan, & Lei Liu (2018) Particle swarm optimization algorithm: an overview. Soft Computing, 22, 387–408.
Wang, S., Zhou, G., Ma, Y., Lisha, Hu., Chen, Z., Chen, Y., & Zhao, H., Jung, W. (2018). Eating detection and chews counting through sensing mastication muscle contraction. Smart Health, 9–10, 179–191.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Maria, A., Jeyaseelan, A.S. Development of Optimal Feature Selection and Deep Learning Toward Hungry Stomach Detection Using Audio Signals. J Control Autom Electr Syst 32, 853–874 (2021). https://doi.org/10.1007/s40313-021-00727-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40313-021-00727-8