Skip to main content

Semi-supervised Classifying of Modelled Auditory Nerve Patterns for Vowel Stimuli with Additive Noise

  • Conference paper
  • First Online:

Part of the book series: Studies in Computational Intelligence ((SCI,volume 799))

Abstract

The paper proposes an approach to stationary patterns of auditory neural activity analysis from the point of semi-supervised learning in self-organizing maps (SOM). The suggested approach has allowed to classify and identify complex auditory stimuli, such as vowels, given limited prior information about the data. A computational model of the auditory periphery has been used to obtain auditory nerve fiber responses. Label propagation through Delaunay triangulation proximity graph, derived by SOM algorithm, is implemented to classify unlabeled units. In order to avoid the “dead” unit problem in Emergent SOM and to improve method effectiveness, an adaptive conscience mechanism has been realized. The study has considered the influence of AWGN on the robustness of auditory stimuli identification under various SNRs. The representation of acoustic signals in the form of neural activity in the auditory nerve fibers has proven more noise-robust compared to that in the form of the most common acoustic features, such as MFCC and PLP. The approach has produced high accuracy, both in case of similar sounds and with high SNR.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Meyer, B., Wächter, M., Brand, T., Kollmeier, B.: Phoneme confusions in human and automatic speech recognition. In: Proceedings of Interspeech, pp. 1485–1488 (2007)

    Google Scholar 

  2. Yousafzai, J., Ager, M., Cvetkovic, Z., Sollich, P.: Discriminative and generative machine learning approaches towards robust phoneme classification. In: Proceedings of IEEE Workshop on Information Theory and Application, pp. 471–475 (2008)

    Google Scholar 

  3. Miller, G.A., Nicely, P.E.: An analysis of perceptual confusions among some English consonants. J. Acoust. Soc. Am. 27(2), 338–352 (1955)

    Article  Google Scholar 

  4. Chapelle, O., Schölkopf, B., Zien, A.: Semi-Supervised Learning. MIT Press, Cambridge (2006)

    Book  Google Scholar 

  5. Yakovenko, A., Malykhina, G.: Bio-inspired approach for automatic speaker clustering using auditory modeling and self-organizing maps. Procedia Comput. Sci. 123, 547–552 (2018)

    Article  Google Scholar 

  6. Huang, X., Acero, A., Hon, H.: Spoken Language Processing: A Guide to Theory, Algorithm, and System Development. Prentice Hall, Upper Saddle River (2001)

    Google Scholar 

  7. Hermansky, H.: Perceptual linear predictive (PLP) analysis of speech. J. Acoust. Soc. Am. 87(4), 1738–1752 (1990)

    Article  Google Scholar 

  8. Imai, T.: Positional information in neural map development: lessons from the olfactory system. Dev. Growth. Differ 54(3), 358–365 (2012)

    Article  Google Scholar 

  9. Kohonen, T.: Self-Organizing Maps, 3rd edn. Springer, Heidelberg (2001)

    Book  Google Scholar 

  10. Ultsch, A., Mörchen, F.: ESOM-maps: tools for clustering, visualization, and classification with Emergent SOM. Technical Report, Department of Mathematics and Computer Science, University of Marburg, Germany, p. 46 (2005)

    Google Scholar 

  11. Ultsch, A., Lötsch, J.: Machine-learned cluster identification in high-dimensional data. J. Biomed. Inform. 66, 95–104 (2017)

    Article  Google Scholar 

  12. DeSieno, D.: Adding a Conscience to Competitive Learning. In: Proceedings of the Second Annual IEEE International Conference on Neural Networks, pp. 117–124 (1988)

    Google Scholar 

  13. Zhu, X.: Semi-supervised learning with graphs. Doctoral dissertation, Carnegie Mellon University. CMU-LTI-05-192 (2005)

    Google Scholar 

  14. Herrmann, L., Ultsch, A.: Label propagation for semi-supervised learning in self-organizing maps. In: Proceedings of the 6th International Workshop on Self-Organizing Maps (WSOM). Bielefeld University, Germany (2007)

    Google Scholar 

  15. Hawkins, S., Midgley, J.: Formant frequencies of RP monophthongs in four age groups of speakers. J. Int. Phon. Assoc. 35(2), 183–199 (2005)

    Article  Google Scholar 

  16. Meddis, R., et al.: A computer model of the auditory periphery and its application to the study of hearing. In: Proceedings of the 16th International Symposium on Hearing, Cambridge, UK, pp. 23–27 (2012)

    Google Scholar 

Download references

Acknowledgments

The reported study was funded by the Russian Foundation for Basic Research according to the research project 18-31-00304.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anton Yakovenko .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Yakovenko, A., Sidorenko, E., Malykhina, G. (2019). Semi-supervised Classifying of Modelled Auditory Nerve Patterns for Vowel Stimuli with Additive Noise. In: Kryzhanovsky, B., Dunin-Barkowski, W., Redko, V., Tiumentsev, Y. (eds) Advances in Neural Computation, Machine Learning, and Cognitive Research II. NEUROINFORMATICS 2018. Studies in Computational Intelligence, vol 799. Springer, Cham. https://doi.org/10.1007/978-3-030-01328-8_28

Download citation

Publish with us

Policies and ethics