Automatic Sound Classification for Improving Speech Intelligibility in Hearing Aids Using a Layered Structure

Alexandre, Enrique; Cuadra, Lucas; Álvarez, Lorena; Rosa-Zurera, Manuel; López-Ferreras, Francisco

doi:10.1007/11875581_37

Enrique Alexandre²⁰,
Lucas Cuadra²⁰,
Lorena Álvarez²⁰,
Manuel Rosa-Zurera²⁰ &
…
Francisco López-Ferreras²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4224))

Included in the following conference series:

International Conference on Intelligent Data Engineering and Automated Learning

1356 Accesses
10 Citations

Abstract

This paper presents some of our first results in the development of an automatic sound classification algorithm for hearing aids. The goal is to classify the input audio signal into four different categories: speech in quiet, speech in noise, stationary noise and non-stationary noise. In order to make the system more robust, a divide and conquer strategy is proposed, resulting thus in a layered structure. The considered classification algorithms will be based on the Fisher linear discriminant and neural networks. Some results will be given demonstrating the good behavior of the system compared with a classical approach with a four-classes classifier based on neural networks.

This work has been partially financed by the Universidad de Alcalá (UAH PI2005/081) and Comunidad de Madrid/Universidad de Alcalá (CAM-UAH2005/036).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Keidser, G.: The relationships between listening conditions and alterative amplification schemes for multiple memory hearing aids. Ear Hear 16, 575–586 (1995)
Article Google Scholar
Keidser, G.: Selecting different amplification for different listening conditions. J. of the American Academy of Audiology 7, 92–104 (1996)
Google Scholar
Büchler, M.: Algorithms for sound classification in hearing instruments. PhD thesis, Swiss Federal Institute of Technology, Zurich (2002)
Google Scholar
Nordqvist, P., Leijon, A.: An efficient robust sound classification algorithm for hearing aids. J. Acoustic Soc. Am. 115(6), 3033–3041 (2004)
Article Google Scholar
Alexandre, E., Rosa, M., Cuadra, L., Gil-Pita, R.: Application of Fisher linear discriminant analysis to speech/music classification. In: AES 120th Convention (2006)
Google Scholar
Guaus, E., Batlle, E.: A non-linear rhythm-based style classification for broadcast speech-music discrimination. In: AES 116th Convention (2004)
Google Scholar
Lu, L., Zhang, H.J., Jiang, H.: Content analysis for audio classification and segmentation. IEEE Transactions on speech and audio processing 10(7), 504–516 (2002)
Article Google Scholar
Scheirer, E., Slaney, M.: Construction and evaluation of a robust multifeature speech/music discriminator. In: ICASSP (1997)
Google Scholar
Davis, S., Mermelstein, P.: Experiments in syllable-based recognition of continuous speech. IEEE Transactions on Acoustics, Speech and Signal Processing 28, 357–366 (1980)
Article Google Scholar
Logan, B.: Mel frequency cepstral coefficients for music modeling. In: Int. Symp. Music Information Retrieval, ISMIR (2000)
Google Scholar
Tzanetakis, G., Cook, P.: Musical genre classification of audio signals. IEEE Transactions on speech and audio processing 10(5), 293–302 (2002)
Article Google Scholar
Saunders, J.: Real time discrimination of broadcast speech/music. In: ICASSSP, pp. 993–996 (1996)
Google Scholar
ITU-T: Objective measurement of active speech level. Recommendation P.56 (1993)
Google Scholar
Batlle, E., Neuschmied, H., Uray, P., Ackerman, G.: Recognition and analysis of audio for copyright protection: the RAA project. Journal of the American Society for Information Science and Technology 55(12), 1084–1091 (2004)
Article Google Scholar
Fisher, R.: The use of multiple measurements in taxonomic problems. Annals of Eugenics 7, 179–188 (1936)
Google Scholar
Haykin, S.: Neural Networks: A comprehensive foundation. Prentice-Hall, Englewood Cliffs (1999)
MATH Google Scholar
Jain, A.K., Duin, R.P., Mao, J.: Statistical pattern recognition: a review. IEEE Transactions on pattern analysis and machine intelligence 22(1), 4–37 (2000)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Dept. de Teoría de la Señal y Comunicaciones. Escuela Politécnica Superior, Universidad de Alcalá, 28805, Alcalá de Henares, Spain
Enrique Alexandre, Lucas Cuadra, Lorena Álvarez, Manuel Rosa-Zurera & Francisco López-Ferreras

Authors

Enrique Alexandre
View author publications
You can also search for this author in PubMed Google Scholar
Lucas Cuadra
View author publications
You can also search for this author in PubMed Google Scholar
Lorena Álvarez
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Rosa-Zurera
View author publications
You can also search for this author in PubMed Google Scholar
Francisco López-Ferreras
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Escuela Politécnica Superior, GICAP Research Group, Universidad de Burgo, Calle Francisco de Vitoria S/N, Edifico C, Campus Vena, 09006, Burgos, Spain
Emilio Corchado
School of Electrical and Electronic Engineering, University of Manchester, UK
Hujun Yin
Department of Information Systems and Computation, Technical University of Valencia, Camino de Vera, Valencia, Spain
Vicente Botti
University of West Scotland, PA1 2BE, Paisley, Scotland
Colin Fyfe

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Alexandre, E., Cuadra, L., Álvarez, L., Rosa-Zurera, M., López-Ferreras, F. (2006). Automatic Sound Classification for Improving Speech Intelligibility in Hearing Aids Using a Layered Structure. In: Corchado, E., Yin, H., Botti, V., Fyfe, C. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2006. IDEAL 2006. Lecture Notes in Computer Science, vol 4224. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11875581_37

Download citation

DOI: https://doi.org/10.1007/11875581_37
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45485-4
Online ISBN: 978-3-540-45487-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics