Skip to main content

Automatic Sound Classification for Improving Speech Intelligibility in Hearing Aids Using a Layered Structure

  • Conference paper
Book cover Intelligent Data Engineering and Automated Learning – IDEAL 2006 (IDEAL 2006)

Abstract

This paper presents some of our first results in the development of an automatic sound classification algorithm for hearing aids. The goal is to classify the input audio signal into four different categories: speech in quiet, speech in noise, stationary noise and non-stationary noise. In order to make the system more robust, a divide and conquer strategy is proposed, resulting thus in a layered structure. The considered classification algorithms will be based on the Fisher linear discriminant and neural networks. Some results will be given demonstrating the good behavior of the system compared with a classical approach with a four-classes classifier based on neural networks.

This work has been partially financed by the Universidad de Alcalá (UAH PI2005/081) and Comunidad de Madrid/Universidad de Alcalá (CAM-UAH2005/036).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Keidser, G.: The relationships between listening conditions and alterative amplification schemes for multiple memory hearing aids. Ear Hear 16, 575–586 (1995)

    Article  Google Scholar 

  • Keidser, G.: Selecting different amplification for different listening conditions. J. of the American Academy of Audiology 7, 92–104 (1996)

    Google Scholar 

  • Büchler, M.: Algorithms for sound classification in hearing instruments. PhD thesis, Swiss Federal Institute of Technology, Zurich (2002)

    Google Scholar 

  • Nordqvist, P., Leijon, A.: An efficient robust sound classification algorithm for hearing aids. J. Acoustic Soc. Am. 115(6), 3033–3041 (2004)

    Article  Google Scholar 

  • Alexandre, E., Rosa, M., Cuadra, L., Gil-Pita, R.: Application of Fisher linear discriminant analysis to speech/music classification. In: AES 120th Convention (2006)

    Google Scholar 

  • Guaus, E., Batlle, E.: A non-linear rhythm-based style classification for broadcast speech-music discrimination. In: AES 116th Convention (2004)

    Google Scholar 

  • Lu, L., Zhang, H.J., Jiang, H.: Content analysis for audio classification and segmentation. IEEE Transactions on speech and audio processing 10(7), 504–516 (2002)

    Article  Google Scholar 

  • Scheirer, E., Slaney, M.: Construction and evaluation of a robust multifeature speech/music discriminator. In: ICASSP (1997)

    Google Scholar 

  • Davis, S., Mermelstein, P.: Experiments in syllable-based recognition of continuous speech. IEEE Transactions on Acoustics, Speech and Signal Processing 28, 357–366 (1980)

    Article  Google Scholar 

  • Logan, B.: Mel frequency cepstral coefficients for music modeling. In: Int. Symp. Music Information Retrieval, ISMIR (2000)

    Google Scholar 

  • Tzanetakis, G., Cook, P.: Musical genre classification of audio signals. IEEE Transactions on speech and audio processing 10(5), 293–302 (2002)

    Article  Google Scholar 

  • Saunders, J.: Real time discrimination of broadcast speech/music. In: ICASSSP, pp. 993–996 (1996)

    Google Scholar 

  • ITU-T: Objective measurement of active speech level. Recommendation P.56 (1993)

    Google Scholar 

  • Batlle, E., Neuschmied, H., Uray, P., Ackerman, G.: Recognition and analysis of audio for copyright protection: the RAA project. Journal of the American Society for Information Science and Technology 55(12), 1084–1091 (2004)

    Article  Google Scholar 

  • Fisher, R.: The use of multiple measurements in taxonomic problems. Annals of Eugenics 7, 179–188 (1936)

    Google Scholar 

  • Haykin, S.: Neural Networks: A comprehensive foundation. Prentice-Hall, Englewood Cliffs (1999)

    MATH  Google Scholar 

  • Jain, A.K., Duin, R.P., Mao, J.: Statistical pattern recognition: a review. IEEE Transactions on pattern analysis and machine intelligence 22(1), 4–37 (2000)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Alexandre, E., Cuadra, L., Álvarez, L., Rosa-Zurera, M., López-Ferreras, F. (2006). Automatic Sound Classification for Improving Speech Intelligibility in Hearing Aids Using a Layered Structure. In: Corchado, E., Yin, H., Botti, V., Fyfe, C. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2006. IDEAL 2006. Lecture Notes in Computer Science, vol 4224. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11875581_37

Download citation

  • DOI: https://doi.org/10.1007/11875581_37

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-45485-4

  • Online ISBN: 978-3-540-45487-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics