Biologically Inspired Methods for Automatic Speech Understanding

Salvi, Giampiero

doi:10.1007/978-3-642-34274-5_49

Biologically Inspired Methods for Automatic Speech Understanding

Giampiero Salvi⁵

Conference paper

1392 Accesses

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 196))

Abstract

Automatic Speech Recognition (ASR) and Understanding (ASU) systems heavily rely on machine learning techniques to solve the problem of mapping spoken utterances into words and meanings. The statistical methods employed, however, greatly deviate from the processes involved in human language acquisition in a number of key aspects. Although ASR and ASU have recently reached a level of accuracy that is sufficient for some practical applications, there are still severe limitations due, for example, to the amount of training data required and the lack of generalization of the resulting models. In our opinion, there is a need for a paradigm shift and speech technology should address some of the challenges that humans face when learning a first language and that are currently ignored by the ASR and ASU methods. In this paper, we point out some of the aspects that could lead to more robust and flexible models, and we describe some of the research we and other researchers have performed in the area.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ananthakrishnan, G., Salvi, G.: Using imitation to learn infant-adult acoustic mappings. In: Proc. of Interspeech, Firenze, Italy (2011)
Google Scholar
Bailly, G.: Learning to speak. Sensori-motor control of speech movements* 1. Speech Communication 22(2-3), 251–267 (1997)
Article Google Scholar
Driesen, J., ten Bosch, L., van Hamme, H.: Adaptive non-negative matrix factorization in a computational model of language acquisition. In: Proc. Interspeech (2009)
Google Scholar
Guenther, F.H.: Speech sound acquisition, coarticulation, and rate effects in a neural network model of speech production. Psychological Review 102(3), 594–620 (1995)
Article Google Scholar
Guenther, F.H., Gjaja, M.N.: The perceptual magnet effect as an emergent property of neural map formation 100(2), 1111–1121 (1996)
Google Scholar
Markey, K.: The sensorimotor foundations of phonology: a computational model of early childhood articulatory and phonetic development. Ph.D. thesis, University of Colorado Doctoral Dissertation (1994)
Google Scholar
Salvi, G.: Ecological language acquisition via incremental model-based clustering. In: Proceedings of Eurospeech, Lisbon, Portugal, pp. 1181–1184 (2005)
Google Scholar
Salvi, G., Montesano, L., Bernardino, A., Santos-Victor, J.: Language bootstrapping: Learning word meanings from perception-action association. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics 42(3), 660–671 (2012)
Article Google Scholar
Stouten, V., Demuynck, K., van Hamme, H.: Discovering phone patterns in spoken utterances by non-negative matrix factorization. IEEE Signal Processing Lett. 15, 131–134 (2008)
Article Google Scholar
Vanhainen, N., Salvi, G.: Word discovery with beta process factor analysis. In: Proc. of Interspeech, Portland, Oregon (2012)
Google Scholar
Westermann, G., Reck Miranda, E.: A new model of sensorimotor coupling in the development of speech. Brain and Language 89(2), 393–400 (2004)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and Communication, Dept. for Speech, Music and Hearing, KTH (Royal Institute of Technology), Stockholm, Sweden
Giampiero Salvi

Authors

Giampiero Salvi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Giampiero Salvi .

Editor information

Editors and Affiliations

Department of Chemical, Management,, Computer, Mechanical Engineering, Università di Palermo, Viale delle Scienze, Building 6 6, Palermo, 90128, Italy
Antonio Chella
Department of Chemical, Management,, Computer, Mechanical Engineering, Università di Palermo, Viale delle Scienze, Building 6 6, Palermo, 90128, Italy
Roberto Pirrone
Department of Chemical, Management,, Computer, Mechanical Engineering, Università di Palermo, Viale delle Scienze, Building 6 6, Palermo, 90128, Italy
Rosario Sorbello
, Department of Psychology, Reykjavik University, Menntavegur 1, Reykjavik, 101, Iceland
Kamilla Rún Jóhannsdóttir

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Salvi, G. (2013). Biologically Inspired Methods for Automatic Speech Understanding. In: Chella, A., Pirrone, R., Sorbello, R., Jóhannsdóttir, K. (eds) Biologically Inspired Cognitive Architectures 2012. Advances in Intelligent Systems and Computing, vol 196. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34274-5_49

Download citation

DOI: https://doi.org/10.1007/978-3-642-34274-5_49
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34273-8
Online ISBN: 978-3-642-34274-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics