Abstract
The classification of acoustic events is useful to describe the scene and can contribute to improve the robustness of different speech technologies. However, the events are usually overlapped with speech or other sounds. This work proposes an approach based on Factor Analysis to compensate the variability of the acoustic events due to overlap with speech. The system is evaluated in the CLEAR evaluation database composed of recordings in meeting rooms where the acoustic events have been spontaneously generated in five different locations. The experiments are divided in two sets. Firstly, isolated acoustic events are used as development to analyze and evaluate parameters of the Factor Analysis system. Secondly, the system is compared to a baseline based on Gaussians Mixture Models with Hidden Markov Models. The Factor Analysis approach improves the total error rate due to the variability compensation of overlapped segments.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Atrey, P.K., Maddage, N.C., Kankanhalli, M.S.: Audio Based Event Detection for Multimedia Surveillance. In: 2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings, pp. V–813–V–816 (2006)
Chen, J., Zhang, J., Kam, A.H., Shue, L.: An Automatic Acoustic Bathroom Monitoring System. In: 2005 IEEE International Symposium on Circuits and Systems, pp. 1750–1753 (2005)
van Hout, J., Akbacak, M., Castan, D., Yeh, E., Sanchez, M.: Extracting Spoken and Acoustic Concepts For Multimedia Event Detection. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 2–6 (2013)
Huang, Z., Cheng, Y.-C., Li, K., Hautamaki, V., Lee, C.-H.: A Blind Segmentation Approach to Acoustic Event Detection Based on I-Vector. In: Proc. Interspeech, pp. 2282–2286 (August 2013)
Castan, D., Akbacak, M.: Indexing Multimedia Documents with Acoustic Concept Recognition Lattices. In: Interspeech, pp. 3–7 (2013)
Temko, A., Malkin, R., Zieger, C., Macho, D.: Acoustic event detection and classification in smart-room environments: Evaluation of CHIL project systems. IV Jornadas en Tecnología del Habla, 1–6 (2006)
Zhou, X., Zhuang, X., Liu, M., Tang, H., Hasegawa-Johnson, M., Huang, T.: HMM-based acoustic event detection with AdaBoost feature selection. In: Stiefelhagen, R., Bowers, R., Fiscus, J. (eds.) RT 2007 and CLEAR 2007. LNCS, vol. 4625, pp. 345–353. Springer, Heidelberg (2008)
Butko, T., Camprubí, C.N.: Detection of overlapped acoustic events using fusion of audio and video modalities. In: Proc. FALA, pp. 165–168 (2010)
Chakraborty, R.: Acoustic Event Detection and Localization using Distributed Microphone Arrays. PhD thesis (2013)
Temko, A., Macho, D., Nadeu, C., Segura, C.: UPC-TALP Database of Isolated Acoustic Events. In: Internal UPC report (2005)
Kenny, P., Boulianne, G., Ouellet, P., Dumouchel, P.: Joint Factor Analysis Versus Eigenchannels in Speaker Recognition. IEEE Trans. Audio Speech Lang. 15(4), 1435–1447 (2007)
Castan, D., Ortega, A., Villalba, J., Miguel, A., Lleida, E.: Segmentation-By-Classification System Based on Factor Analysis. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2013)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Castán, D., Ortega, A., Miguel, A., Lleida, E. (2014). A Preliminary Study of Acoustic Events Classification with Factor Analysis in Meeting Rooms. In: Navarro Mesa, J.L., et al. Advances in Speech and Language Technologies for Iberian Languages. Lecture Notes in Computer Science(), vol 8854. Springer, Cham. https://doi.org/10.1007/978-3-319-13623-3_22
Download citation
DOI: https://doi.org/10.1007/978-3-319-13623-3_22
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13622-6
Online ISBN: 978-3-319-13623-3
eBook Packages: Computer ScienceComputer Science (R0)