Abstract
Convolutional Neural Networks (CNNs) have been widely used in the field of audio recognition and classification, since they often provide positive results. Motivated by the success of this kind of approach and the lack of practical methodologies for the monitoring of construction sites by using audio data, we developed an application for the classification of different types and brands of construction vehicles and tools, which operates on the emitted audio through a stack of convolutional layers. The proposed architecture works on the mel-spectrogram representation of the input audio frames and it demonstrates its effectiveness in environmental sound classification (ESC) achieving a high accuracy. In summary, our contribution shows that techniques employed for general ESC can be also successfully adapted to a more specific environmental sound classification task, such as event recognition in construction sites.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Available at: https://librosa.github.io/.
References
Scardapane, S., Scarpiniti, M., Bucciarelli, M., Colone, F., Mansueto, M.V., Parisi, R.: Microphone array based classification for security monitoring in unstructured environments. AEÜ Int. J. Electron. Commun. 69(11), 1715–1723 (2015)
Weinstein, E., Steele, K., Agarwal, A., Glass, J.: LOUD: a 1020-node modular micro-phone array and beamformer for intelligent computing spaces. Technical Report MIT/LCS Technical Memo MIT-LCS-TM-642 (2004)
Kaushik, B., Nance, D., Ahuja, K.K.: A review of the role of acoustic sensors in the modern battlefield. In: Proceedings of the 11th AIAA/CEAS Aeroacoustics Conference, pp. 1–13 (2005)
Wang, D., Brown, G.J.: Computational Auditory Scene Analysis: Principles, Algorithms, and Applications. Wiley-IEEE Press (2006)
Fu, Z., Lu, G., Ting, K.M., Zhang, D.: A survey of audio-based music classification and annotation. IEEE Trans. Multimed. 13(2), 303–319 (2011)
Cheng, C.-F., Rashidi, A., Davenport, M.A., Anderson, D.V.: Activity analysis of construction equipment using audio signals and support vector machines. Autom. Constr. 81, 240–253 (2017)
Zhang, T., Lee, Y.-C., Scarpiniti, M., Uncini, A.: A supervised machine learning-based sound identification for construction activity monitoring and performance evaluation. In: Proceedings of 2018 Construction Research Congress (CRC 2018), New Orleans, Louisiana, USA, pp. 358–366, 2–4 April 2018
Lee, Y.-C., Scarpiniti, M., Uncini, A.: Advanced sound identification classifiers using a grid search algorithm for accurate audio-based construction progress monitoring. J. Comput. Civil Eng. (2020)
Sherafat, B., Rashidi, A., Lee, Y.-C., Ahn, C.R.: A hybrid kinematic-acoustic system for automated activity detection of construction equipment. Sensors 19(19), 4286 (2019)
Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press (2016)
Piczak, K.J.: Environmental sound classification with convolutional neural networks. In: 2015 IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP), pp. 1–6, Sept. 2015
Tokozume, Y., Harada, T.: Learning environmental sounds with end-to-end convolutional neural network. In: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2721–2725, March 2017
Xie, Y., Lee, Y.-C., Scarpiniti, M.: Deep Learning-Based Highway Construction and Maintenance Activities Monitoring in Night Time. Construction Research Congress (CRC 2020), Tempe, AZ, USA, 8–10 March 2020
Li, S., Yao, Y., Hu, J., Liu, G., Yao, X., Hu, J.: An ensemble stacked convolutional neural network model for environmental event sound recognition. Appl. Sci. 8(7) (2018)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: ICLR (2015)
Stevens, S.S., Volkmann, J., Newman, E.B.: A scale for the measurement of the psychological magnitude pitch (1937)
Alpaydin, E.: Introduction to Machine Learning, 3rd edn. MIT Press (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Maccagno, A., Mastropietro, A., Mazziotta, U., Scarpiniti, M., Lee, YC., Uncini, A. (2021). A CNN Approach for Audio Classification in Construction Sites. In: Esposito, A., Faundez-Zanuy, M., Morabito, F., Pasero, E. (eds) Progresses in Artificial Intelligence and Neural Systems. Smart Innovation, Systems and Technologies, vol 184. Springer, Singapore. https://doi.org/10.1007/978-981-15-5093-5_33
Download citation
DOI: https://doi.org/10.1007/978-981-15-5093-5_33
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-5092-8
Online ISBN: 978-981-15-5093-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)