Skip to main content

Interactivity-Based Quality Prediction of Conversations with Transmission Delay

  • Conference paper
  • First Online:
Speech and Computer (SPECOM 2020)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12335))

Included in the following conference series:

Abstract

While the experienced quality of a conversation transmitted over a telephone network is dependent on the parameters of the network and possible degradations, it has been shown that the sensitivity to these degradations is influenced by the type of the conversation and its contents. Especially for delayed speech transmission the conversational scenario and the adaption of turn-taking behavior of the interlocutors factor into the conversational quality rating of a specific conversation. Parametric Conversation Analysis has been proven to be a good method to extract parameters from a recorded conversation that are representable of the interactivity. While the narrowband version of a popular and standardized quality prediction model, the E-model, uses the interactivity to predict a conversational quality MOS for the given conditions, there are no attempts to predict the conversational quality of individual conversations so far. In this paper, we propose a model to predict the quality of a conversation under the influence of transmission delay based on the interactivity parameters extracted from that conversation. We evaluate which parameters are most suited for such a prediction, and compare our results to the parameter-based predictions of the E-model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Egger, S., Schatz, R., Scherer, S.: It takes two to tango-assessing the impact of delay on conversational interactivity on perceived speech quality. In: Eleventh Annual Conference of the International Speech Communication Association, pp. 1321–1324. ISCA (2010)

    Google Scholar 

  2. Egger, S., Schatz, R., Schoenenberg, K., Raake, A., Kubin, G.: Same but different? - Using speech signal features for comparing conversational VoIP quality studies. In: 2012 IEEE International Conference on Communications (ICC), pp. 1320–1324. IEEE (2012)

    Google Scholar 

  3. Hammer, F.: Quality aspects of packet-based interactive speech communication. Forschungszentrum Telekommunikation Wien (2006)

    Google Scholar 

  4. ITU-T Recommandation G.107: The E-model: a computational model for use in transmission planning. International Telecommunication Union, Geneva (2011). http://handle.itu.int/11.1002/1000/12505

  5. ITU-T Recommandation G.107.1: Wideband E-model. International Telecommunication Union, Geneva (2015)

    Google Scholar 

  6. ITU-T Recommandation G.107.2: Fullband E-model. International Telecommunication Union, Geneva (2019)

    Google Scholar 

  7. ITU-T Recommendation P.59: Artificial Conversational Speech. International Telecommunication Union (1993)

    Google Scholar 

  8. ITU-T Recommendation P.805: Subjective Evaluation of Conversational Quality. International Telecommunication Union, Geneva (2007)

    Google Scholar 

  9. Kitawaki, N., Itoh, K.: Pure delay effects on speech quality in telecommunications. IEEE J. Sel. Areas Commun. 9(4), 586–593 (1991)

    Article  Google Scholar 

  10. Köster, F., Guse, D., Wältermann, M., Möller, S.: Comparison between the discrete ACR scale and an extended continuous scale for the quality assessment of transmitted speech. Fortschritte der Akustik-DAGA (2015)

    Google Scholar 

  11. Lee, H., Un, C.: A study of on-off characteristics of conversational speech. IEEE Trans. Commun. 34(6), 630–637 (1986)

    Article  Google Scholar 

  12. Michael, T., Möller, S.: Effects of delay and packet-loss on the conversational quality. Fortschritte der Akustik-DAGA (2020)

    Google Scholar 

  13. Raake, A., Schoenenberg, K., Skowronek, J., Egger, S.: Predicting speech quality based on interactivity and delay. In: Proceedings of INTERSPEECH, pp. 1384–1388 (2013)

    Google Scholar 

  14. Reichl, P., Hammer, F.: Hot discussion or frosty dialogue? Towards a temperature metric for conversational interactivity. In: Eighth International Conference on Spoken Language Processing (2004)

    Google Scholar 

  15. Sacks, H., Schegloff, E., Jefferson, G.: A simplest systematics for the organization of turn-taking for conversation. Language 50(4), 696–735 (1974). https://doi.org/10.2307/412243. http://www.jstor.org/stable/412243?origin=crossref

  16. Schoenenberg, K.: The Quality of Mediated-Conversations under Transmission Delay. Ph.D. thesis, TU Berlin (2015). http://dx.doi.org/10.14279/depositonce-4990

  17. Uhrig, S., Michael, T., Möller, S., Keller, P.E., Voigt-Antons, J.N.: Effects of delay on perceived quality, behavior and oscillatory brain activity in dyadic telephone conversations. In: 2018 Tenth International Conference on Quality of Multimedia Experience (QoMEX), pp. 1–6. IEEE (2018)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Thilo Michael .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Michael, T., Möller, S. (2020). Interactivity-Based Quality Prediction of Conversations with Transmission Delay. In: Karpov, A., Potapova, R. (eds) Speech and Computer. SPECOM 2020. Lecture Notes in Computer Science(), vol 12335. Springer, Cham. https://doi.org/10.1007/978-3-030-60276-5_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-60276-5_33

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-60275-8

  • Online ISBN: 978-3-030-60276-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics