Abstract
Nowadays, VoIP has become the core of communication over internet in daily life. While, most objective speech measurement researches are based on studies of Western languages. In linguistic way, tonal language is the language that uses tone to distinguish the meaning of the words and widely use in East Asia and South East Asia like Thai, Vietnam and Chinese. In this paper, we investigate the effect of tonal languages on VoIP speech quality with Speex codec. We encoded the speech samples from three different languages English, Thai and Chinese Mandarin with Speex codec and send through IP network with RTP session. Objective Listening Mean Opinion Score (MOS-LQO) which, measure from PESQ is used as an index of speech quality. The PESQ measurement shows average MOS patterns of three languages is not significantly different but Chinese show more fluctuated in standard deviation of each quality mode more than Thai and English.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
ITU-T P.800: Methods for subjective determination of transmission quality (1996)
Donald, G., Jamieson, D.G., Parsa, V., Price, M.C.: Interaction of speech coders and atypical speech i: Effects on speech intelligibility. J. Speech, Language, Hearing 45, 482–493 (2002)
Sun, L., Ifeachor, E.C.: Perceived speech quality prediction for voice over IP-based networks. In: IEEE International Conference on Communications, ICC 2002, vol. 4, pp. 2573–2577 (2002)
Duanmu, S.: Tone and non-tone languages: An alternative to language typology and parameters. Language and Linguistics 5(4), 891–923 (2004)
Chang, Y.-C., Halle, P., Best Catherine T., Abramson, A.: Do non-native language listeners perceive Mandarin tone continua categorically? In: 8th Phonetic Conference of China and the International Symposium on Phonetic Frontiers, Beijing (2008)
Chong, F.L., Pawkikowski, K., McLoughlin, I.V.: Evaluation of ITU-T G.728 as a Voice over IP codec for Chinese Speech. In: Australian Telecommunication Networks and Applications Conference (2003)
Cai, Z., Kitawaki, N., Yamada, T., Makino, S.: Comparison of MOS evaluation characteristics for Chinese, Japanese and English in IP telephony. In: Proc. International Universal Communication Symposium, pp. 1–4 (2010)
Daengsi, T., Wutiwiwatchai, C., Preechayasomboon, A., Sukparungsee, S.: A study of VoIP quality evaluation: User perception of voice quality from G.729, G.711 and G.722. In: 2012 IEEE Consumer Communications and Networking Conference (CCNC), pp. 342–245 (2012)
Valin, J.M.: The Speex codec manual (version 1.2 Beta 3), (2007)
Speex: a free codec for free speech, http://www.speex.org/
ITU-T P.862: Perceptual evaluation of speech quality (PESQ), an ojective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs (2001)
Chong, F.L., Chloughlin, I.V., Pawlikowski, K.: A Methodology for Improving PESQ Accuracy for Chinese Speech. In: TENCON 2005 IEEE Region, vol. 10, pp. 1–6 (2005)
Delancey, S.: Sino-Tibetan Languages. In: International Encyclopedia of Linguistics, New York, vol. 4, pp. 445–449 (1992)
Diller, A.V.N., Edmondson, J.: The Tai-Kadai Languages. RoutledgeCurzon, London (2005)
Abramson, A.S.: The vowels and tones of standard Thai: Acoustical measurement and experiments (Publication No. 20), Bloomington: Indiana University Research Center in Anthropology, Folklore, and Linguistics. International Journal of American Linguistics 28 (1962)
Hall, T.A.: Objective Speech Quality Measures for Internet Telephony. In: Proc. of SPIE, vol. 4522, pp. 128–136 (2001)
Li, Z., Tan, E.C., Mcloughlin, I., And Teo, T.T.: Proposal of standard for intelligibility tests of Chinese speech. IEEE Procedding-Vision, Image Signal Processing 147(3), 254–260 (2000)
Open Speech Repository, http://www.voiptroubleshooter.com/open_speech/index.html
Patcharikra, C., Treepop, S., Sawit, K., Nattanun, T., Chai, W.: LOTUS: Large vOcabulary Thai continUous Speech Recognition Corpus. In: NSTDA Annual Conference S&T in Thailand: Towards the Molecular Economy (2005)
GL Communications Inc. RTP ToolBox product, http://www.gl.com/rtptoolbox.html
GL Communications Inc. IPNetSim product, http://www.gl.com/ipnetsim.html
GL Communications Inc. VQT product, http://www.gl.com/voicequalitytesting.html
Itani, M., Paulikas, S.: Influence of Language on CELP Codecs Performance. 124X Information Technology and Control 37 (2008)
ITU-T P.862.1: Mapping function for transforming P.862 raw result scores to MOS-LQO (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Triyason, T., Kanthamanon, P. (2012). Perceptual Evaluation of Speech Quality Measurement on Speex Codec VoIP with Tonal Language Thai. In: Papasratorn, B., Charoenkitkarn, N., Lavangnananda, K., Chutimaskul, W., Vanijja, V. (eds) Advances in Information Technology. IAIT 2012. Communications in Computer and Information Science, vol 344. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35076-4_17
Download citation
DOI: https://doi.org/10.1007/978-3-642-35076-4_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35075-7
Online ISBN: 978-3-642-35076-4
eBook Packages: Computer ScienceComputer Science (R0)