Skip to main content

Neural Networks for Continuous Speech Recognition

  • Conference paper
Speech Recognition and Understanding

Part of the book series: NATO ASI Series ((NATO ASI F,volume 75))

  • 279 Accesses

Abstract

The paper reviews a number of methods for continuous speech recognition, concentrating mostly on work at Cambridge University. The methods reviewed are a ‘sound’ and phoneme recogniser using duration sensitive nets; the modified Kanerva model for phoneme recognition; a recurrent net for phoneme recognition; a Classification and Regressive Tree (CART) for phoneme recognition; together with methods for lexical access including the NET-gram, the modified Kanerva model, and the ‘Compositional Representation’ approach

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Breiman, L., Friedman, J.H., Olshen, R.A. & Stone, C.J.: Classification and Regression Trees, Wadsworth & Brooks/Cole, 1984.

    MATH  Google Scholar 

  2. Bridle, J.: Alpha-Nets: A recurrent neural network architecture with HMM interpretation, Speech Communication, April, 1990.

    Google Scholar 

  3. Chan, L.W & Fallside, F..: An adaptive training algorithm for back propagation networks, Computer Speech & Language, 3, 3/4, Sept/Dec 1987, 205–218.

    Article  Google Scholar 

  4. Chong, M.W. & Fallside, F.: Classification and regression tree neural networks for automatic speech recognition, Proc. INNC90, Paris, 1, 187–190.

    Google Scholar 

  5. Chow, Y.L., Dunham, M.O., Kimball, O.A., Krasner, M.A., Kubela, G.F., Makhoul, J., Roucos, S., & Schwartz, R.M.: BYBLOS: The BBN continuous speech recognition system, Proc. ICASSP 87, Dallas 1987, 89–92.

    Google Scholar 

  6. Fallside, F., Lucke, H., Marsland, T.P., O’Shea, P.J., Owen, M. St.J., Prager, R.W., Robinson, A.J. & Russell, N.H.: Continuous speech recognition for the TIMIT database using neural networks, Proc. ICASSP 90, Albuquerque, 1990, pp.445–448.

    Google Scholar 

  7. Fisher, W.M., Zue, V., Bernstein, J., Pallett, D.: An acoustic-phonetic database, Proc. Acoustic Soc. Amer., May 1987.

    Google Scholar 

  8. Harrison, T.D. & Fallside, F.: A connectionist model for phoneme recognition in continuous speech, Proc. ICASSP 89, Glasgow, 1989, 417–420.

    Google Scholar 

  9. Kanerva, P.: Self-propagating search: a unified theory, Ph.D thesis, Stanford University, Centre for Study of Language and Information, 1984.

    Google Scholar 

  10. Lee, K-F.: Automatic Speech Recognition: The Development of the SPHINX System, Kluwer (Boston), 1989.

    Google Scholar 

  11. Lee, K-F & Hon, H-W.: Speaker independent phoneme recognition using HMMs, IEE Trans. ASSP, 37, (11), 1989, 1641–1648.

    Article  Google Scholar 

  12. Lucke, H. & Fallside.: Application of the Compositional Representation to lexical access using neural networks, Proc. Intl. Conf. on Spoken Language Processing, Kobe, November 1990, paper 31.13.

    Google Scholar 

  13. Nakamura, M. & Shikano, K.: A study of English word category prediction based on neural networks, Proc. ICASSP 89, Glasgow 1989, 731–734.

    Google Scholar 

  14. Prager, R.W. & Fallside, F.: The modified Kanerva model for automatic speech recognition, Computer Speech & Language, 3, 1, 1989, 61–81.

    Article  Google Scholar 

  15. Prager, R.W., Clarke, T.J.W. & Fallside, F.: The modified Kanerva model: results for real time word recognition, IEE Intl. Conf. on Artificial Neural Networks, London, 1989, 105–109.

    Google Scholar 

  16. Price, P.J., Fisher, W., Bernstein, J., Pallett, D.: The DARPA 1000-word resource management database for continuous speech recognition, Proc. ICASSP 88, New York, 1988, 651–654.

    Google Scholar 

  17. Robinson, A.J. & Fallside, F.: A dynamic connectionist model for phoneme recognition, Proc. n’Euro ‘88, ENST, Paris 1988.

    Google Scholar 

  18. Robinson, A.J. & Fallside, F.: Phoneme recognition for the TIMIT database using recurrent error propagation networks, Technical Report CUED/F/INFENG/TR42, March 1990, Cambridge University Engineering Department, (submitted for publication).

    Google Scholar 

  19. Robinson, A.J., Holdsworth, J., Patterson, R. & Fallside, F.: A comparison of preprocessors for the Cambridge recurrent error propagation network speech recognition system, Proc. Intl. Conf. on Spoken Language Processing, Kobe, November 1990, paper 23.17.

    Google Scholar 

  20. Robinson, A.J. & Fallside, F.: Further results from the DARPA resource management database with the Cambridge Recurrent Error Propagation Speech Recognition System, submitted to ICASSP 90.

    Google Scholar 

  21. Young, S.J.: Competitive training in Hidden Markov Models, Proc. ICASSP 90, Albuquerque, 1990, 681–684.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1992 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Fallside, F. (1992). Neural Networks for Continuous Speech Recognition. In: Laface, P., De Mori, R. (eds) Speech Recognition and Understanding. NATO ASI Series, vol 75. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-76626-8_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-76626-8_27

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-76628-2

  • Online ISBN: 978-3-642-76626-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics