Skip to main content

Dealing with Loss of Synchronism in Multi-Band Continuous Speech Recognition Systems

  • Chapter
Computational Models of Speech Pattern Processing

Part of the book series: NATO ASI Series ((NATO ASI F,volume 169))

Summary

In multi-band systems, the signal is decomposed into several frequency bands, which are processed separately. Then, the recombination part must compute a unique sentence from all these different solutions. The task is quite easy in isolated word recognition, each word ending at the same time, but it becomes more difficult in continuous speech recognition, where each band has a different segmentation. The problem here is to decide when the recombination should be done. Two major solutions have been tested: the first one introduces synchronism between the bands, and recombination is done when all the bands are synchronous. The second one leaves the sub-recognizers totally independent and tries to extract from their solutions a phonetic structure which will allow us to process the recombination part. We will briefly present an example of the first solution, then we will focus on the algorithm we have developed for the second one.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. J. B. Allen. How do humans process and recognize speech ? IEEE Trans. on Speech and Audio Processing, 2(4), October 1994

    Google Scholar 

  2. H. Bourlard and S. Dupont. Subband-based speech recognition. In Proc. ICASSP’97, Munich, Germany, 1997

    Google Scholar 

  3. C. Cerisara, J.-P. Haton, J.-F. Mari, and D. Fohr. Multi-band continuous speech recognition. In proc. EUROSPEECH’97, Rhodes, Greece, 1997

    Google Scholar 

  4. J.-L. Gauvain, L.-F. Lamel, and M. Eskénazi. Bref, a large vocabulary spoken corpus for french. In Proc. EUROSPEECH’91, pages 505–508, Genova, Italy, 1991

    Google Scholar 

  5. J.-F. Mari, J.-P. Haton, and A. Kriouile. Automatic word recognition based on second-order hidden markov models. IEEE Trans. on Speech and Audio Processing, 5(1), January 1997

    Google Scholar 

  6. S. Tibrewala and H. Hermansky. Sub-band based recognition of noisy speech. In Proc. ICASSP ’97, pages 1255–1258, Munich, Germany, Apr. 1997

    Google Scholar 

  7. D. Xu, C. Fancourt, and C. Wang. Multi-channel HMM. In Proc. ICASSP ’96, pages 841–844, Atlanta, GA, May 1996

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Cerisara, C. (1999). Dealing with Loss of Synchronism in Multi-Band Continuous Speech Recognition Systems. In: Ponting, K. (eds) Computational Models of Speech Pattern Processing. NATO ASI Series, vol 169. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-60087-6_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-60087-6_9

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-64250-0

  • Online ISBN: 978-3-642-60087-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics