Dealing with Loss of Synchronism in Multi-Band Continuous Speech Recognition Systems

Cerisara, Christophe

doi:10.1007/978-3-642-60087-6_9

Christophe Cerisara²

Part of the book series: NATO ASI Series ((NATO ASI F,volume 169))

228 Accesses
2 Citations

Summary

In multi-band systems, the signal is decomposed into several frequency bands, which are processed separately. Then, the recombination part must compute a unique sentence from all these different solutions. The task is quite easy in isolated word recognition, each word ending at the same time, but it becomes more difficult in continuous speech recognition, where each band has a different segmentation. The problem here is to decide when the recombination should be done. Two major solutions have been tested: the first one introduces synchronism between the bands, and recombination is done when all the bands are synchronous. The second one leaves the sub-recognizers totally independent and tries to extract from their solutions a phonetic structure which will allow us to process the recombination part. We will briefly present an example of the first solution, then we will focus on the algorithm we have developed for the second one.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

J. B. Allen. How do humans process and recognize speech ? IEEE Trans. on Speech and Audio Processing, 2(4), October 1994
Google Scholar
H. Bourlard and S. Dupont. Subband-based speech recognition. In Proc. ICASSP’97, Munich, Germany, 1997
Google Scholar
C. Cerisara, J.-P. Haton, J.-F. Mari, and D. Fohr. Multi-band continuous speech recognition. In proc. EUROSPEECH’97, Rhodes, Greece, 1997
Google Scholar
J.-L. Gauvain, L.-F. Lamel, and M. Eskénazi. Bref, a large vocabulary spoken corpus for french. In Proc. EUROSPEECH’91, pages 505–508, Genova, Italy, 1991
Google Scholar
J.-F. Mari, J.-P. Haton, and A. Kriouile. Automatic word recognition based on second-order hidden markov models. IEEE Trans. on Speech and Audio Processing, 5(1), January 1997
Google Scholar
S. Tibrewala and H. Hermansky. Sub-band based recognition of noisy speech. In Proc. ICASSP ’97, pages 1255–1258, Munich, Germany, Apr. 1997
Google Scholar
D. Xu, C. Fancourt, and C. Wang. Multi-channel HMM. In Proc. ICASSP ’96, pages 841–844, Atlanta, GA, May 1996
Google Scholar

Download references

Author information

Authors and Affiliations

CRIN-CNRS & INRIA Lorraine, BP 239, F-54506, Vandoeuvre-les-Nancy Cedex, France
Christophe Cerisara

Authors

Christophe Cerisara
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Speech Research Unit, DERA Malvern, St. Andrew’s Road, WR14 4DT, Great Malvern, Worcs, UK
Keith Ponting

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Cerisara, C. (1999). Dealing with Loss of Synchronism in Multi-Band Continuous Speech Recognition Systems. In: Ponting, K. (eds) Computational Models of Speech Pattern Processing. NATO ASI Series, vol 169. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-60087-6_9

Download citation

DOI: https://doi.org/10.1007/978-3-642-60087-6_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-64250-0
Online ISBN: 978-3-642-60087-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics