Skip to main content

NMF-Based Spectral Analysis for Acoustic Event Classification Tasks

  • Conference paper
Advances in Nonlinear Speech Processing (NOLISP 2013)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7911))

Included in the following conference series:

Abstract

In this paper, we propose a new front-end for Acoustic Event Classification tasks (AEC). First, we study the spectral contents of different acoustic events by applying Non-Negative Matrix Factorization (NMF) on their spectral magnitude and compare them with the structure of speech spectra. Second, from the findings of this study, we propose a new parameterization for AEC, which is an extension of the conventional Mel Frequency Cepstrum Coefficients (MFCC) and is based on the high pass filtering of acoustic event spectra. Also, the influence of different frequency scales on the classification rate of the whole system is studied. The evaluation of the proposed features for AEC shows that relative error reductions about 12% at segment level and about 11% at target event level with respect to the conventional MFCC are achieved.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 72.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Temko, A., Nadeu, C.: Classification of acoustic events using SVM-based clustering schemes. Pattern Recognition 39, 684–694 (2006)

    Article  Google Scholar 

  2. Zieger, C.: An HMM based system for acoustic event detection. In: Stiefelhagen, R., Bowers, R., Fiscus, J.G. (eds.) RT 2007 and CLEAR 2007. LNCS, vol. 4625, pp. 338–344. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  3. Zhuang, X., Zhou, X., Hasegawa-Johnson, M.A., Huang, T.S.: Real-world acoustic event detection. Pattern Recognition Letters 31, 1543–1551 (2010)

    Article  Google Scholar 

  4. Kwangyoun, K., Hanseok, K.: Hierarchical approach for abnormal acoustic event classification in an elevator. In: IEEE Int. Conf. AVSS, pp. 89–94 (2011)

    Google Scholar 

  5. Portelo, J., Bugalho, M., Trancoso, I., Neto, J., Abad, A., Serralheiro, A.: Non speech audio event detection. In: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1973–1976 (2009)

    Google Scholar 

  6. Meng, A., Ahrendt, P., Larsen, J.: Temporal feature integration for music genre classification. IEEE Trans. on Audio, Speech, and Language Processing 15, 1654–1664 (2007)

    Article  Google Scholar 

  7. Mejía-Navarrete, D., Gallardo-Antolín, A., Peláez, C., Valverde, F.: Feature extraction assesment for an acoustic-event classification task using the entropy triangle. In: Interspeech, pp. 309–312 (2011)

    Google Scholar 

  8. Lee, D., Seung, H.: Algorithms for non-negative matrix factorization. Nature 401, 788–791 (1999)

    Article  Google Scholar 

  9. Wilson, K., Raj, B., Smaragdis, P., Divakaran, A.: Speech denoising using nonnegative matrix factorization with priors. In: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pp. 4029–4032 (2008)

    Google Scholar 

  10. Ludeña-Choez, J., Gallardo-Antolín, A.: Speech denoising using non-negative matrix factorization with kullback-leibler divergence and sparseness constraints. In: Torre Toledano, D., Ortega Giménez, A., Teixeira, A., González Rodríguez, J., Hernández Gómez, L., San Segundo Hernández, R., Ramos Castro, D. (eds.) IberSPEECH 2012. CCIS, vol. 328, pp. 207–216. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  11. Schuller, B., Weninger, F., Wollmer, M.: Non-negative matrix factorization as noise-robust feature extractor for speech recognition. In: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pp. 4562–4565 (2010)

    Google Scholar 

  12. FBK-Irst database of isolated meeting-room acoustic events, ELRA Catalog no. S0296

    Google Scholar 

  13. UPC-TALP database of isolated meeting-room acoustic events, ELRA Catalog no. S0268

    Google Scholar 

  14. The ShATR multiple simultaneous speaker corpus, http://www.dcs.shef.ac.uk/spandh/projects/shatrweb/index.html

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ludeña-Choez, J., Gallardo-Antolín, A. (2013). NMF-Based Spectral Analysis for Acoustic Event Classification Tasks. In: Drugman, T., Dutoit, T. (eds) Advances in Nonlinear Speech Processing. NOLISP 2013. Lecture Notes in Computer Science(), vol 7911. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38847-7_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-38847-7_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-38846-0

  • Online ISBN: 978-3-642-38847-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics