A Bio-inspired Architecture for Cognitive Audio
A comprehensive view of speech and voice technologies is now demanding better and more complex tools amenable of extracting as much knowledge about sound and speech as possible. Many knowledge-extraction tasks from speech and voice share well-known procedures at the algorithmic level under the point of view of bio-inspiration. The same resources employed to decode speech phones may be used in the characterization of the speaker (gender, age, speaking group, etc.). Based on these facts the present paper examines a hierarchy of sound processing levels at the auditory and perceptual levels on the brain neural paths which can be translated into a bio-inspired audio-processing architecture. Through this paper its fundamental characteristics are analyzed in relation with current tendencies in cognitive audio processing. Examples extracted from speech processing applications in the domain of acoustic-phonetics are presented. These may find applicability in speaker’s characterization, forensics, and biometry, among others.
KeywordsSpeech Recognition Lateral Inhibition Speech Perception Auditory Cortex Inferior Colliculus
Unable to display preview. Download preview PDF.
- 2.Deller, J.R., Proakis, J.G., Hansen, J.H.: Discrete-Time Processing of Speech Signals. Macmillan, New York (1993)Google Scholar
- 3.Gómez, P., Godino, J.I., Alvarez, A., Martínez, R., Nieto, V., Rodellar, V.: Evidence of Glottal Source Spectral Features found in Vocal Fold Dynamics. In: Proc. of the ICASSP’05, pp. 441–444 (2005)Google Scholar
- 4.Hermansky, H.: Should Recognizers Have Ears? In: ESCA-NATO Tutorial and Research Workshop on Robust Speech Recognition for Unknown Communication Channels, Pont-à-Mousson, France, 17-18 April 1997, pp. 1–10 (1997)Google Scholar
- 5.Ferrández, J.M.: Study and Realization of a Bio-inspired Hierarchical Architecture for Speech Recognition. Ph.D. Thesis (in Spanish), Universidad Politécnica de Madrid (1998)Google Scholar
- 6.Gómez, P., Martínez, R., Rodellar, V., Ferrández, J.M.: Bio-inspired Systems in Speech Perception: An overview and a study case. In: IEEE/NML Life Sciences Systems and Applications Workshop (by invitation), National Institute of Health, Bethesda, Maryland, July 13-14 (2006)Google Scholar
- 9.Jahne, B.: Digital Image Processing. Springer, Berlin (2005)Google Scholar
- 12.Ojemann, G.A.: Organization of language cortex derived from investigation during neurosurgery. Sem. Neuros. 2, 297–305 (1990)Google Scholar
- 13.O’Shaughnessy, D.: Speech Communication. IEEE Press, Los Alamitos (2000)Google Scholar
- 20.Suga, N.: Basic Acoustic Patterns and Neural Mechanism Shared By Humans and Animals for Auditory Perception: A Neuroethologist’s view. In: Proceedings of Workshop on the Auditory bases of Speech Perception, ESCA, July 1996, pp. 31–38 (1996)Google Scholar
- 21.Waibel, A.: Neural Network Approaches for Speech Recognition. In: Furui, S., Sondhi, M.M. (eds.) Advances in Speech Signal Processing, pp. 555–597. Marcel Dekker, New York (1992)Google Scholar