Summary
In multi-band systems, the signal is decomposed into several frequency bands, which are processed separately. Then, the recombination part must compute a unique sentence from all these different solutions. The task is quite easy in isolated word recognition, each word ending at the same time, but it becomes more difficult in continuous speech recognition, where each band has a different segmentation. The problem here is to decide when the recombination should be done. Two major solutions have been tested: the first one introduces synchronism between the bands, and recombination is done when all the bands are synchronous. The second one leaves the sub-recognizers totally independent and tries to extract from their solutions a phonetic structure which will allow us to process the recombination part. We will briefly present an example of the first solution, then we will focus on the algorithm we have developed for the second one.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
J. B. Allen. How do humans process and recognize speech ? IEEE Trans. on Speech and Audio Processing, 2(4), October 1994
H. Bourlard and S. Dupont. Subband-based speech recognition. In Proc. ICASSP’97, Munich, Germany, 1997
C. Cerisara, J.-P. Haton, J.-F. Mari, and D. Fohr. Multi-band continuous speech recognition. In proc. EUROSPEECH’97, Rhodes, Greece, 1997
J.-L. Gauvain, L.-F. Lamel, and M. Eskénazi. Bref, a large vocabulary spoken corpus for french. In Proc. EUROSPEECH’91, pages 505–508, Genova, Italy, 1991
J.-F. Mari, J.-P. Haton, and A. Kriouile. Automatic word recognition based on second-order hidden markov models. IEEE Trans. on Speech and Audio Processing, 5(1), January 1997
S. Tibrewala and H. Hermansky. Sub-band based recognition of noisy speech. In Proc. ICASSP ’97, pages 1255–1258, Munich, Germany, Apr. 1997
D. Xu, C. Fancourt, and C. Wang. Multi-channel HMM. In Proc. ICASSP ’96, pages 841–844, Atlanta, GA, May 1996
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Cerisara, C. (1999). Dealing with Loss of Synchronism in Multi-Band Continuous Speech Recognition Systems. In: Ponting, K. (eds) Computational Models of Speech Pattern Processing. NATO ASI Series, vol 169. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-60087-6_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-60087-6_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-64250-0
Online ISBN: 978-3-642-60087-6
eBook Packages: Springer Book Archive