Abstract
Purpose
Vocalizations of livestock convey information about the health and behavior of the animals, and vocal analysis could be a useful method to monitor livestock. We propose a deep learning classification of vocal recordings of laying hens and cattle with the aim of automatically classifying laying hen and cattle sounds in South Korea using a deep learning model.
Methods
Audio and video recordings of laying hens and cattle were acquired. We classified laying hens’ sounds into eight classes and cattle sounds into nine classes. Classified audio files were used for the development of convolutional neural network (CNN) models. Two types of CNN structures, one based on 2D ConVnet and the other based on a 1D model with long short-term memory, were developed and tested for modeling to classify the vocalizations of laying hens and cattle.
Results
The classification model based on 2D ConVnet performed better with a satisfactory classification accuracy of 75.78% for laying hens and 91.02% for cattle.
Conclusion
Based on the results for the developed CNN models, it is expected that real-time voice monitoring could be applicable for providing animal physiological information to growers.
Similar content being viewed by others
References
Chen, H.-M., Huang, C.-J., Chen, Y.-J., Chen, C.-Y., & Chien, S.-Y. (2015). An intelligent nocturnal animal vocalization recognition system. International Journal of Computer and Communication Engineering, 4(1), 39–45.
Guo, M., & Kuenzle, B. (2019). Obtaining narrow transition region in STFT domain processing using subband filters. ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 970–974. IEEE
Hershey, S., Chaudhuri, S., Ellis, D. P. W., Gemmeke, J. F., Jansen, A., Moore, R. C., et al. (2017). CNN architectures for large-scale audio classification. In 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 131–135). https://doi.org/10.1109/ICASSP.2017.7952132.
Huang, J., Wang, W., & Zhang, T. (2019). Method for detecting avian influenza disease of chickens based on sound analysis. Biosystems Engineering, 180, 16–24.
Ikeda, Y., & Ishii, Y. (2008). Recognition of two psychological conditions of a single cow by her voice. Computers and Electronics in Agriculture, 62(1), 67–72. https://doi.org/10.1016/j.compag.2007.08.012.
Jung, D. H., Kim, N. Y., Moon, S. H., Jhin, C., Kim, H. J., Yang, J. S., ... & Park, S. H. (2021). Deep learning-based cattle vocal classification model and real-time livestock monitoring system with noise filtering. Animals, 11(2), 357. https://doi.org/10.3390/ani11020357
Meen, G. H., Schellekens, M. A., Slegers, M. H. M., Leenders, N. L. G., van Erp-van der Kooij, E., & Noldus, L. P. J. J. (2015). Sound analysis in dairy cattle vocalisation as a potential welfare monitor. Computers and Electronics in Agriculture, 118, 111–115. https://doi.org/10.1016/j.compag.2015.08.028.
Noda, K., Yamaguchi, Y., Nakadai, K., Okuno, H. G., & Ogata, T. (2015). Audio-visual speech recognition using deep learning. Applied Intelligence, 42(4), 722–737.
Riede, T., Tembrock, G., Herzel, H., & Brunnberg, L. (1997). Vocalization as an indicator for disorders in mammals. ASA.
Sadeghi, M., Banakar, A., Khazaee, M., & Soleimani, M. R. (2015). An intelligent procedure for the detection and classification of chickens infected by clostridium perfringens based on their vocalization. Brazilian Journal of Poultry Science, 17(4), 537–544.
Sahidullah, M., & Saha, G. (2012). Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition. Speech Communication, 54(4), 543–565.
Şaşmaz, E., & Tek, F. B. (2018). Animal sound classification using a convolutional neural network. 2018 3rd International Conference on Computer Science and Engineering (UBMK), 625–629. IEEE
Sauvé, C. C., Beauplet, G., Hammill, M. O., & Charrier, I. (2015). Mother–pup vocal recognition in harbour seals: influence of maternal behaviour, pup voice and habitat sound properties. Animal Behaviour, 105, 109–120.
Tek, F. B., Cannavo, F., Nunnari, G., & Kale, İ. (2014). Robust localization and identification of African clawed frogs in digital images. Ecological Informatics, 23, 3–12.
Xu, M., Duan, L.-Y., Cai, J., Chia, L.-T., Xu, C., & Tian, Q. (2004). HMM-based audio keyword generation. Pacific-Rim Conference on Multimedia (pp. 566–574). Springer
Xuan, C., Ma, Y., Wu, P., Zhang, L., Hao, M., & Zhang, X. (2016). Behavior classification and recognition for facility breeding sheep based on acoustic signal weighted feature. Transactions of the Chinese Society of Agricultural Engineering, 32(19), 195–202.
Funding
This study was carried out with the support of the Research Program for Agricultural Science & Technology Development (Project No. PJ01389103), National Institute of Agricultural Sciences, Rural Development Administration, Republic of Korea.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Rights and permissions
About this article
Cite this article
Jung, DH., Kim, N.Y., Moon, S.H. et al. Classification of Vocalization Recordings of Laying Hens and Cattle Using Convolutional Neural Network Models. J. Biosyst. Eng. 46, 217–224 (2021). https://doi.org/10.1007/s42853-021-00101-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s42853-021-00101-1