Abstract
While the experienced quality of a conversation transmitted over a telephone network is dependent on the parameters of the network and possible degradations, it has been shown that the sensitivity to these degradations is influenced by the type of the conversation and its contents. Especially for delayed speech transmission the conversational scenario and the adaption of turn-taking behavior of the interlocutors factor into the conversational quality rating of a specific conversation. Parametric Conversation Analysis has been proven to be a good method to extract parameters from a recorded conversation that are representable of the interactivity. While the narrowband version of a popular and standardized quality prediction model, the E-model, uses the interactivity to predict a conversational quality MOS for the given conditions, there are no attempts to predict the conversational quality of individual conversations so far. In this paper, we propose a model to predict the quality of a conversation under the influence of transmission delay based on the interactivity parameters extracted from that conversation. We evaluate which parameters are most suited for such a prediction, and compare our results to the parameter-based predictions of the E-model.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Egger, S., Schatz, R., Scherer, S.: It takes two to tango-assessing the impact of delay on conversational interactivity on perceived speech quality. In: Eleventh Annual Conference of the International Speech Communication Association, pp. 1321–1324. ISCA (2010)
Egger, S., Schatz, R., Schoenenberg, K., Raake, A., Kubin, G.: Same but different? - Using speech signal features for comparing conversational VoIP quality studies. In: 2012 IEEE International Conference on Communications (ICC), pp. 1320–1324. IEEE (2012)
Hammer, F.: Quality aspects of packet-based interactive speech communication. Forschungszentrum Telekommunikation Wien (2006)
ITU-T Recommandation G.107: The E-model: a computational model for use in transmission planning. International Telecommunication Union, Geneva (2011). http://handle.itu.int/11.1002/1000/12505
ITU-T Recommandation G.107.1: Wideband E-model. International Telecommunication Union, Geneva (2015)
ITU-T Recommandation G.107.2: Fullband E-model. International Telecommunication Union, Geneva (2019)
ITU-T Recommendation P.59: Artificial Conversational Speech. International Telecommunication Union (1993)
ITU-T Recommendation P.805: Subjective Evaluation of Conversational Quality. International Telecommunication Union, Geneva (2007)
Kitawaki, N., Itoh, K.: Pure delay effects on speech quality in telecommunications. IEEE J. Sel. Areas Commun. 9(4), 586–593 (1991)
Köster, F., Guse, D., Wältermann, M., Möller, S.: Comparison between the discrete ACR scale and an extended continuous scale for the quality assessment of transmitted speech. Fortschritte der Akustik-DAGA (2015)
Lee, H., Un, C.: A study of on-off characteristics of conversational speech. IEEE Trans. Commun. 34(6), 630–637 (1986)
Michael, T., Möller, S.: Effects of delay and packet-loss on the conversational quality. Fortschritte der Akustik-DAGA (2020)
Raake, A., Schoenenberg, K., Skowronek, J., Egger, S.: Predicting speech quality based on interactivity and delay. In: Proceedings of INTERSPEECH, pp. 1384–1388 (2013)
Reichl, P., Hammer, F.: Hot discussion or frosty dialogue? Towards a temperature metric for conversational interactivity. In: Eighth International Conference on Spoken Language Processing (2004)
Sacks, H., Schegloff, E., Jefferson, G.: A simplest systematics for the organization of turn-taking for conversation. Language 50(4), 696–735 (1974). https://doi.org/10.2307/412243. http://www.jstor.org/stable/412243?origin=crossref
Schoenenberg, K.: The Quality of Mediated-Conversations under Transmission Delay. Ph.D. thesis, TU Berlin (2015). http://dx.doi.org/10.14279/depositonce-4990
Uhrig, S., Michael, T., Möller, S., Keller, P.E., Voigt-Antons, J.N.: Effects of delay on perceived quality, behavior and oscillatory brain activity in dyadic telephone conversations. In: 2018 Tenth International Conference on Quality of Multimedia Experience (QoMEX), pp. 1–6. IEEE (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Michael, T., Möller, S. (2020). Interactivity-Based Quality Prediction of Conversations with Transmission Delay. In: Karpov, A., Potapova, R. (eds) Speech and Computer. SPECOM 2020. Lecture Notes in Computer Science(), vol 12335. Springer, Cham. https://doi.org/10.1007/978-3-030-60276-5_33
Download citation
DOI: https://doi.org/10.1007/978-3-030-60276-5_33
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60275-8
Online ISBN: 978-3-030-60276-5
eBook Packages: Computer ScienceComputer Science (R0)