Mandarin tone recognition based on wavelet transform and hidden Markov modeling

Cheng, Jun; Yi, Kechu; Li, Bingbing

doi:10.1007/s11767-000-0015-y

Mandarin tone recognition based on wavelet transform and hidden Markov modeling

Published: January 2000

Volume 17, pages 1–8, (2000)
Cite this article

Journal of Electronics (China)

Cheng Jun¹,
Yi Kechu¹ &
Li Bingbing¹

21 Accesses
Explore all metrics

Abstract

This paper presents a method of tone recognition for Mandarin speech by using combination of wavelet transform and hidden Markov modeling techniques. A pitch detector based on singularity detection and multi-resolution analysis of wavelet transform is employed for estimation of pitch periods, and hidden Markov modeling with partition Gaussian mixtures probability density function is used for the tone recognition. The algorithm can provide recognition accuracy of 97.22% and 94.47% for speaker-dependent and speaker-independent tone recognition, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

L. R. Rabiner, R. W. Shielf, Digital Processing of Speech Signals, 1978, Section 7.3.
W. Yang, J. Lee, Y. Chang, H. Wang, Hidden Markov model for Mandarin lexical tone recognition, IEEE Trans. on ASSP, ASSP-36(1988)7, 988–992.
Google Scholar
S. Mallat, S. Zhong, Characterization of signals from multiscale edges, IEEE Trans. on PAMI, PAMI-14(1992)7, 710–732.
Google Scholar
J. Cheng, P. Zhang, S. R. Dai, K. C. Yi, Singularity detection of signals with wavelet transform, Chinese Journal of Communications, 16(1995)3, 96–104, (in Chinese).
Google Scholar
S. Kadambe, G. F. Boudreaux-Bartels, Application of the wavelet transform for pitch detection of speech signals, IEEE Trans. on IT, IT-38(1992)2, 917–924.
Article Google Scholar
J. Cheng, P. Zhang, S. R. Dai, Z. Hu, An event based pitch detector using fast wavelet transform. In ed. B. Z. Yuan, Proc. of Int. Conf. on Signal Processing(vol.1), Beijing: International Academic Publishers, 1993, 683–686.
Google Scholar
B. H. Juang, L. R. Rabiner, Mixture autoregressive hidden Markov models for speech signals. IEEE Trans. on ASSP, ASSP-33(1985)6, 1404–1413.
Article MathSciNet Google Scholar
Y. Lee, L. Lee, Continuous hidden Markov models integrating transitional and instantaneous features for Mandarin syllable recognition. Computer Speech and Language, 7(1993), 247–263.
Article Google Scholar
Hu Zheng, Yang Yuwei, The Principle and Application of Vector Quantization, Xi’an: Xidian University Press, 1988, Chapter 2, (in Chinese).
Google Scholar

Download references

Author information

Authors and Affiliations

National Key Laboratory on ISN, Xidian University, 710071, Xi’an
Cheng Jun, Yi Kechu & Li Bingbing

Authors

Cheng Jun
View author publications
You can also search for this author in PubMed Google Scholar
Yi Kechu
View author publications
You can also search for this author in PubMed Google Scholar
Li Bingbing
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

Supported by the National Natural Science Foundation of China

About this article

Cite this article

Cheng, J., Yi, K. & Li, B. Mandarin tone recognition based on wavelet transform and hidden Markov modeling. J. of Electron.(China) 17, 1–8 (2000). https://doi.org/10.1007/s11767-000-0015-y

Download citation

Issue Date: January 2000
DOI: https://doi.org/10.1007/s11767-000-0015-y

Key words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Mandarin tone recognition based on wavelet transform and hidden Markov modeling

Abstract

Access this article

Similar content being viewed by others

In “Tone” with dogs: exploring canine musicality

Comparative analysis of audio classification with MFCC and STFT features using machine learning techniques

Speech Emotion Recognition: A Comprehensive Survey

References

Author information

Authors and Affiliations

Additional information

About this article

Cite this article

Key words

Navigation

Mandarin tone recognition based on wavelet transform and hidden Markov modeling

Abstract

Access this article

Similar content being viewed by others

In “Tone” with dogs: exploring canine musicality

Comparative analysis of audio classification with MFCC and STFT features using machine learning techniques

Speech Emotion Recognition: A Comprehensive Survey

References

Author information

Authors and Affiliations

Additional information

About this article

Cite this article

Share this article

Key words

Search

Navigation