Dynamic Neural Network Model of Speech Perception
Research results in neurobiology showed that the spatial organization of the somatosensory cortex, with linear or planar topology, seems to be the underlying support for the internal representation of the environment. This paper examines the feasibility of constructing self-organizing feature maps (SOFMs) suitable to model speech perception. The objective was to construct a class of dynamic SOFMs that can extract the time–amplitude and time–frequency features of the phonemes that appear in the formation of words. Two approaches are presented. One is based on constructing time-based embedding maps. The second method involved the construction of a dynamic SOFM having the Gabor transform as a transfer function. The time–frequency features of the speech sounds are revealed in the second approach. The results may be useful in applications of speech recognition.
KeywordsSelf-organizing maps Speech processing Dynamic neural networks Semantic modeling Time series modeling
- 4.Takens, F.: Detecting strange attractors in turbulence. Lecture Notes in Mathematics 898, pp. 366–381, Springer, Berlin (1981)Google Scholar
- 5.Handel, S.: Listening. MIT Press, Cambridge, Massachusetts (1989)Google Scholar
- 10.Homma, T., Atlas, L., Marks, R.J.: An artificial neural network for spatio-temporal bipolar patterns: application to phoneme classification. NIPS, pp. 31–40. American Institute of Physics (1987)Google Scholar
- 11.Walker, J.S.: A primer on wavelets and their scientific applications, 2nd edn. Chapman and Hall/CRC (2008)Google Scholar
- 12.Velik, R.: Discrete fourier transform computation using neural networks. In: International Conference on Computational Intelligence and Security, CIS 2008, 13–17 Dec 2008, Suzhou, China, Vol. 1, pp. 120–123 (2008)Google Scholar