Semi-supervised Classifying of Modelled Auditory Nerve Patterns for Vowel Stimuli with Additive Noise

  • Anton YakovenkoEmail author
  • Eugene Sidorenko
  • Galina Malykhina
Conference paper
Part of the Studies in Computational Intelligence book series (SCI, volume 799)


The paper proposes an approach to stationary patterns of auditory neural activity analysis from the point of semi-supervised learning in self-organizing maps (SOM). The suggested approach has allowed to classify and identify complex auditory stimuli, such as vowels, given limited prior information about the data. A computational model of the auditory periphery has been used to obtain auditory nerve fiber responses. Label propagation through Delaunay triangulation proximity graph, derived by SOM algorithm, is implemented to classify unlabeled units. In order to avoid the “dead” unit problem in Emergent SOM and to improve method effectiveness, an adaptive conscience mechanism has been realized. The study has considered the influence of AWGN on the robustness of auditory stimuli identification under various SNRs. The representation of acoustic signals in the form of neural activity in the auditory nerve fibers has proven more noise-robust compared to that in the form of the most common acoustic features, such as MFCC and PLP. The approach has produced high accuracy, both in case of similar sounds and with high SNR.


Auditory nerve data analysis Unsupervised learning Neurogram Machine hearing Label propagation Self-organizing maps 



The reported study was funded by the Russian Foundation for Basic Research according to the research project 18-31-00304.


  1. 1.
    Meyer, B., Wächter, M., Brand, T., Kollmeier, B.: Phoneme confusions in human and automatic speech recognition. In: Proceedings of Interspeech, pp. 1485–1488 (2007)Google Scholar
  2. 2.
    Yousafzai, J., Ager, M., Cvetkovic, Z., Sollich, P.: Discriminative and generative machine learning approaches towards robust phoneme classification. In: Proceedings of IEEE Workshop on Information Theory and Application, pp. 471–475 (2008)Google Scholar
  3. 3.
    Miller, G.A., Nicely, P.E.: An analysis of perceptual confusions among some English consonants. J. Acoust. Soc. Am. 27(2), 338–352 (1955)CrossRefGoogle Scholar
  4. 4.
    Chapelle, O., Schölkopf, B., Zien, A.: Semi-Supervised Learning. MIT Press, Cambridge (2006)CrossRefGoogle Scholar
  5. 5.
    Yakovenko, A., Malykhina, G.: Bio-inspired approach for automatic speaker clustering using auditory modeling and self-organizing maps. Procedia Comput. Sci. 123, 547–552 (2018)CrossRefGoogle Scholar
  6. 6.
    Huang, X., Acero, A., Hon, H.: Spoken Language Processing: A Guide to Theory, Algorithm, and System Development. Prentice Hall, Upper Saddle River (2001)Google Scholar
  7. 7.
    Hermansky, H.: Perceptual linear predictive (PLP) analysis of speech. J. Acoust. Soc. Am. 87(4), 1738–1752 (1990)CrossRefGoogle Scholar
  8. 8.
    Imai, T.: Positional information in neural map development: lessons from the olfactory system. Dev. Growth. Differ 54(3), 358–365 (2012)CrossRefGoogle Scholar
  9. 9.
    Kohonen, T.: Self-Organizing Maps, 3rd edn. Springer, Heidelberg (2001)CrossRefGoogle Scholar
  10. 10.
    Ultsch, A., Mörchen, F.: ESOM-maps: tools for clustering, visualization, and classification with Emergent SOM. Technical Report, Department of Mathematics and Computer Science, University of Marburg, Germany, p. 46 (2005)Google Scholar
  11. 11.
    Ultsch, A., Lötsch, J.: Machine-learned cluster identification in high-dimensional data. J. Biomed. Inform. 66, 95–104 (2017)CrossRefGoogle Scholar
  12. 12.
    DeSieno, D.: Adding a Conscience to Competitive Learning. In: Proceedings of the Second Annual IEEE International Conference on Neural Networks, pp. 117–124 (1988)Google Scholar
  13. 13.
    Zhu, X.: Semi-supervised learning with graphs. Doctoral dissertation, Carnegie Mellon University. CMU-LTI-05-192 (2005)Google Scholar
  14. 14.
    Herrmann, L., Ultsch, A.: Label propagation for semi-supervised learning in self-organizing maps. In: Proceedings of the 6th International Workshop on Self-Organizing Maps (WSOM). Bielefeld University, Germany (2007)Google Scholar
  15. 15.
    Hawkins, S., Midgley, J.: Formant frequencies of RP monophthongs in four age groups of speakers. J. Int. Phon. Assoc. 35(2), 183–199 (2005)CrossRefGoogle Scholar
  16. 16.
    Meddis, R., et al.: A computer model of the auditory periphery and its application to the study of hearing. In: Proceedings of the 16th International Symposium on Hearing, Cambridge, UK, pp. 23–27 (2012)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  • Anton Yakovenko
    • 1
    Email author
  • Eugene Sidorenko
    • 1
  • Galina Malykhina
    • 1
  1. 1.Peter the Great St.Petersburg Polytechnic UniversitySt.PetersburgRussia

Personalised recommendations