Feature Extraction of Surround Sound Recordings for Acoustic Scene Classification
This paper extends the traditional methodology of acoustic scene classification based on machine listening towards a new class of multichannel audio signals. It identifies a set of new features of five-channel surround recordings for classification of the two basic spatial audio scenes. Moreover, it compares the three artificial intelligence-based classification approaches to audio scene classification. The results indicate that the method based on the early fusion of features is superior compared to those involving the late fusion of signal metrics.
KeywordsMachine listening Acoustic scene classification Feature extraction Ensemble-based classifiers
This work was supported by a grant S/WI/1/2013 from Bialystok University of Technology and funded from the resources for research by Ministry of Science and Higher Education.
- 3.Chu, S., Narayanan, S., Jay Kuo C.-C., Matarić, M.J.: Where am I? Scene recognition for mobile robots using audio features. In: Proceedings of IEEE International Conference on Multimedia and Expo, Toronto, Canada, pp. 885–888. IEEE (2006)Google Scholar
- 4.Petetin, Y., Laroche, C., Mayoue, A.: Deep neural networks for audio scene recognition. In: Proceedings of 23rd European Signal Processing Conference (EUSIPCO), Nice, France, pp. 125–129. IEEE (2015)Google Scholar
- 9.Geiger, J.T., Schuller, B., Rigoll, G.: Large-scale audio feature extraction and SVM for acoustic scene classification. In: Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY. IEEE (2013)Google Scholar
- 14.ITU-R Rec. BS.775: Multichannel stereophonic sound system with and without accompanying picture. International Telecommunication Union, Geneva, Switzerland (2012)Google Scholar
- 16.Rumsey, F.: Spatial quality evaluation for reproduced sound: terminology, meaning, and a scene-based paradigm. J. Audio Eng. Soc. 50(9), 651–666 (2002)Google Scholar
- 17.Breiman, L., Cutler, A.: Random Forests for Classification and Regression. https://www.stat.berkeley.edu/~breiman/RandomForests. Accessed 18 Nov 2017
- 21.Ridgeway, G.: Generalized Boosted Regression Models. http://code.google.com/p/gradientboostedmodels. Accessed 18 Nov 2017
- 26.Gardner, B., Martin, K.: HRTF Measurements of a KEMAR Dummy-Head Microphone. http://sound.media.mit.edu/resources/KEMAR.html. Accessed 16 Nov 2017