Exploring Verbal Uncanny Valley Effects with Vague Language in Computer Speech

Clark, Leigh; Ofemile, Abdulmalik; Cowan, Benjamin R.

doi:10.1007/978-981-15-6627-1_17

Leigh Clark^8,9,
Abdulmalik Ofemile¹⁰ &
Benjamin R. Cowan⁸

Part of the book series: Prosody, Phonology and Phonetics ((PRPHPH))

1023 Accesses
7 Citations
2 Altmetric

Abstract

Interactions with speech interfaces are growing, helped by the advent of intelligent personal assistants like Amazon Alexa and Google Assistant. This software is utilised in hardware such as smart home devices (e.g. Amazon Echo and Google Home), smartphones and vehicles. Given the unprecedented level of spoken interactions with machines, it is important we understand what is considered appropriate, desirable and attractive computer speech. Previous research has suggested that the overuse of humanlike voices in limited-communication devices can induce uncanny valley effects—a perceptual tension arising from mismatched stimuli causing incongruence between users’ expectations of a system and its actual capabilities. This chapter explores the possibility of verbal uncanny valley effects in computer speech by utilising the interpersonal linguistic strategies of politeness, relational work and vague language. This work highlights that using these strategies can create perceptual tension and negative experiences due to the conflicting stimuli of computer speech and ‘humanlike’ language. This tension can be somewhat moderated with more humanlike than robotic voices, though not alleviated completely. Considerations for the design of computer speech and subsequent future research directions are discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 119.00; Price excludes VAT (USA)

Softcover Book: USD 159.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Effect of Speech Entrainment in Human-Computer Conversation: A Review

Is Spoken Language All-or-Nothing? Implications for Future Speech-Based Human-Machine Interaction

An Interaction Framework for Designing Systems for Virtual Home Assistants and People with Dysarthria

Article Open access 05 September 2023

Notes

1.
Discourse markers may also be referred to, amongst other terms, as discourse particles, pragmatic particles and pragmatic expressions. Their purposes can include switching topics, marking boundaries between segments of talk, helping to conduct linguistic repair and being used as hedging devices (Jucker & Ziv, 1998).
2.
These were adaptors, e.g. more or less, somewhat (reduce assertiveness, minimise imposition); discourse markers, e.g. so, now (structure talk, mitigate assertive impact of utterance); minimisers, e.g. just, basically (structure talk, reduce perceived difficulty, mitigate utterance impact) and vague nouns, e.g. thing, bit (improve language efficiency) (Clark et al., 2016).
3.
https://www.cepstral.com.
4.
https://www.cereproc.com.

References

Abercrombie, D. (1967). Elements of general phonetics (Vol. 203). Edinburgh: Edinburgh University Press.
Google Scholar
Aylett, M. P., Cowan, B. R., & Clark, L. (2019). Siri, echo and performance: You have to suffer darling. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. ACM.
Google Scholar
Bickmore, T. W., Trinh, H., Olafsson, S., O’Leary, T. K., Asadi, R., Rickles, N. M., & Cruz, R. (2018). Patient and consumer safety risks when using conversational assistants for medical information: An observational study of Siri, Alexa, and Google Assistant. Journal of Medical Internet Research, 20(9). https://doi.org/10.2196/11510.
Brown, P., & Levinson, S. C. (1987). Politeness: Some universals in language usage. Cambridge University Press.
Google Scholar
Cameron, D. (2001). Working with spoken discourse. SAGE.
Google Scholar
Carr, E. W., Hofree, G., Sheldon, K., Saygin, A. P., & Winkielman, P. (2017). Is that a human? Categorization (dis)fluency drives evaluations of agents ambiguous on human-likeness. Journal of Experimental Psychology: Human Perception and Performance, 43(4), 651–666. https://doi.org/10.1037/xhp0000304.
Article Google Scholar
Channell, J. (1994). Vague language. Oxford University Press.
Google Scholar
Clark, L. (2018). Social boundaries of appropriate speech in HCI: A politeness perspective. In Proceedings of British HCI.
Google Scholar
Clark, L., Cabral, J. & Cowan, B. R. (2018). The CogSIS project: Examining the cognitive effects of speech interface synthesis. In Proceedings of British HCI.
Google Scholar
Clark, L., Doyle, P., Garaialde, D., Gilmartin, E., Schlögl, S., Edlund, J., ... & Cowan, B. R. (2019a). The state of speech in HCI: Trends, themes and challenges. Interacting with Computers, 31(4), 349–371. https://doi.org/10.1093/iwc/iwz016.
Clark, L., Pantidi, N., Cooney, O., Doyle, P., Garaialde, D., Edwards, J., ... & Cowan, B.R. (2019b, May). What makes a good conversation? challenges in designing truly conversational agents. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (pp. 1–12). https://doi.org/10.1145/3290605.3300705.
Clark, L. M. H., Bachour, K., Ofemile, A., Adolphs, S., & Rodden, T. (2014). Potential of imprecision: Exploring vague language in agent instructors (pp. 339–344). ACM Press. https://doi.org/10.1145/2658861.2658895
Clark, L., Ofemile, A., Adolphs, S., & Rodden, T. (2016). A multimodal approach to assessing user experiences with agent helpers. ACM Transactions on Interactive Intelligent Systems, 6(4), 29:1–29:31. https://doi.org/10.1145/2983926.
Coulthard, M. (2013). Advances in spoken discourse analysis. Routledge.
Google Scholar
Cowan, B. R., Branigan, H. P., Obregón, M., Bugis, E., & Beale, R. (2015). Voice anthropomorphism, interlocutor modelling and alignment effects on syntactic choices in human − computer dialogue. International Journal of Human-Computer Studies, 83, 27–42. https://doi.org/10.1016/j.ijhcs.2015.05.008.
Article Google Scholar
Cowan, B. R., Pantidi, N., Coyle, D., Morrissey, K., Clarke, P., Al-Shehri, S., … Bandeira, N. (2017). ‘What can I help you with?’: Infrequent users’ experiences of intelligent personal assistants (pp. 1–12). ACM Press. https://doi.org/10.1145/3098279.3098539.
Gilmartin, E., Cowan, B. R., Vogel, C., & Campbell, N. (2017). Exploring multiparty casual talk for social human-machine dialogue. In International Conference on Speech and Computer (pp. 370–378). Springer.
Google Scholar
Goffman, E. (1955). On face-work. Psychiatry, 18(3), 213–231. https://doi.org/10.1080/00332747.1955.11023008.
Article Google Scholar
Goffman, E. (2005). Interaction ritual: Essays in face to face behavior. AldineTransaction.
Google Scholar
Grimshaw, M. (2009). The audio Uncanny Valley: Sound, fear and the horror game. Audio Mostly, 21–26.
Google Scholar
Hone, K. S., & Graham, R. (2000). Towards a tool for the subjective assessment of speech system interfaces (SASSI). Natural Language Engineering, 6(3–4), 287–303.
Article Google Scholar
Jucker, A. H., & Ziv, Y. (1998). Discourse markers: Descriptions and theory. John Benjamins Publishing.
Google Scholar
Kätsyri, J., Förger, K., Mäkäräinen, M., & Takala, T. (2015). A review of empirical evidence on different uncanny valley hypotheses: Support for perceptual mismatch as one road to the valley of eeriness. Frontiers in Psychology, 6. https://doi.org/10.3389/fpsyg.2015.00390.
Large, D. R., Clark, L., Quandt, A., Burnett, G., & Skrypchuk, L. (2017). Steering the conversation: A linguistic exploration of natural language interactions with a digital assistant during simulated driving. Applied Ergonomics, 63, 53–61. https://doi.org/10.1016/j.apergo.2017.04.003.
Article Google Scholar
Laver, J. (1980). The phonetic description of voice quality (Cambridge Studies in Linguistics). Cambridge: Cambridge University Press.
Google Scholar
Locher, M. A. (2004). Power and politeness in action: Disagreements in oral communication. Walter de Gruyter.
Google Scholar
Locher, M. A. (2006). Polite behavior within relational work: The discursive approach to politeness. Walter de Gruyter.
Google Scholar
Locher, M. A., & Watts, R. J. (2005). Politeness theory and relational work. Journal of Politeness Research. Language, Behaviour, Culture, 1(1). https://doi.org/10.1515/jplr.2005.1.1.9
Locher, M. A., & Watts, R. J. (2008). Relational work and impoliteness: Negotiating norms of linguistic behaviour. Mouton de Gruyter.
Google Scholar
Luger, E., & Sellen, A. (2016). ‘Like having a really bad PA’: The gulf between user expectation and experience of conversational agents. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (pp. 5286–5297). New York, NY, USA: ACM. https://doi.org/10.1145/2858036.2858288.
McCarthy, M., & Carter, R. (2006). This that and the other: Multi-word clusters in spoken English as visible patterns of interaction. Explorations in Corpus Linguistics, 7.
Google Scholar
Meah, L. F. S., & Moore, R. K. (2014). The uncanny valley: A focus on misaligned cues. In M. Beetz, B. Johnston, & M.-A. Williams (Eds.), Social robotics (pp. 256–265). Springer International Publishing.
Google Scholar
Mitchell, W. J., Szerszen, K. A., Lu, A. S., Schermerhorn, P. W., Scheutz, M., & MacDorman, K. F. (2011). A mismatch in the human realism of face and voice produces an Uncanny Valley. I-Perception, 2(1), 10–12. https://doi.org/10.1068/i0415.
Article Google Scholar
Moore, R. K. (2012). A Bayesian explanation of the ‘Uncanny Valley’ effect and related psychological phenomena. Scientific Reports, 2(1). https://doi.org/10.1038/srep00864.
Moore, R. K. (2015). From talking and listening robots to intelligent communicative machines. In Robots that talk and listen: de Gruyter.
Google Scholar
Moore, R. K. (2017a). Appropriate voices for artefacts: Some key insights. In 1st International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots.
Google Scholar
Moore, R. K. (2017b). Is spoken language all-or-nothing? Implications for future speech-based human-machine interaction. In Dialogues with Social Robots (pp. 281–291). Springer, Singapore. https://doi.org/10.1007/978-981-10-2585-3_22.
Mori, M. (1970). The uncanny valley. Energy, 7(4), 33–35.
Google Scholar
Mori, M., MacDorman, K. F., & Kageki, N. (2012). The uncanny valley [from the field]. IEEE Robotics and Automation Magazine, 19(2), 98–100.
Article Google Scholar
Porcheron, M., Fischer, J. E., Reeves, S., & Sharples, S. (2018). Voice interfaces in everyday life. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems (p. 640). ACM.
Google Scholar
Porcheron, M., Fischer, J. E., & Sharples, S. (2017). ‘Do animals have accents?’: Talking with agents in multi-party conversation (pp. 207–219). ACM Press. https://doi.org/10.1145/2998181.2998298.
Strait, M., Canning, C., & Scheutz, M. (2014). Let me tell you! Investigating the effects of robot communication strategies in advice-giving situations based on robot appearance, interaction modality and distance (pp. 479–486). ACM Press. https://doi.org/10.1145/2559636.2559670.
Torrey, C., Fussell, S. R., & Kiesler, S. (2013). How a robot should give advice (pp. 275–282). IEEE. https://doi.org/10.1109/HRI.2013.6483599
Trappes-Lomax, H. (2007). Vague language as a means of self-protective avoidance: Tension management in conference talks. In Vague language explored (pp. 117–137). Springer.
Google Scholar
Wang, N., Johnson, W. L., Mayer, R. E., Rizzo, P., Shaw, E., & Collins, H. (2008). The politeness effect: Pedagogical agents and learning outcomes. International Journal of Human-Computer Studies, 66(2), 98–112. https://doi.org/10.1016/j.ijhcs.2007.09.003.
Article Google Scholar
Watts, R. J. (2003). Politeness. Cambridge University Press.
Google Scholar
Zuckerman, M., & Driver, R. E. (1988). What sounds beautiful is good: The vocal attractiveness stereotype. Journal of Nonverbal Behavior, 13(2), 67–82. https://doi.org/10.1007/BF00990791.
Article Google Scholar

Download references

Acknowledgments

This research was funded by a New Horizons grant from the Irish Research Council entitled “The COG-SIS Project: Cognitive effects of Speech Interface Synthesis” (Grant R17339).

Author information

Authors and Affiliations

School of Information, & Communication Studies, University College Dublin, Dublin, Ireland
Leigh Clark & Benjamin R. Cowan
Computational Foundry, Swansea University, Swansea, UK
Leigh Clark
English Department, FCT College of Education, Zuba, Abuja, Nigeria
Abdulmalik Ofemile

Authors

Leigh Clark
View author publications
You can also search for this author in PubMed Google Scholar
Abdulmalik Ofemile
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin R. Cowan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Leigh Clark .

Editor information

Editors and Affiliations

Technische Universität Berlin, Berlin, Germany
Benjamin Weiss
Saarland University, Saarbrücken, Germany
Jürgen Trouvain
ISEM, Montpellier, France
Melissa Barkat-Defradas
International Computer Science Institute, Berkeley, CA, USA
John J. Ohala

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Clark, L., Ofemile, A., Cowan, B.R. (2021). Exploring Verbal Uncanny Valley Effects with Vague Language in Computer Speech. In: Weiss, B., Trouvain, J., Barkat-Defradas, M., Ohala, J.J. (eds) Voice Attractiveness. Prosody, Phonology and Phonetics. Springer, Singapore. https://doi.org/10.1007/978-981-15-6627-1_17

Download citation

DOI: https://doi.org/10.1007/978-981-15-6627-1_17
Published: 11 October 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-6626-4
Online ISBN: 978-981-15-6627-1
eBook Packages: Social SciencesSocial Sciences (R0)

Publish with us

Policies and ethics

Exploring Verbal Uncanny Valley Effects with Vague Language in Computer Speech

Abstract

Access this chapter

Similar content being viewed by others

Effect of Speech Entrainment in Human-Computer Conversation: A Review

Is Spoken Language All-or-Nothing? Implications for Future Speech-Based Human-Machine Interaction

An Interaction Framework for Designing Systems for Virtual Home Assistants and People with Dysarthria

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Exploring Verbal Uncanny Valley Effects with Vague Language in Computer Speech

Abstract

Access this chapter

Similar content being viewed by others

Effect of Speech Entrainment in Human-Computer Conversation: A Review

Is Spoken Language All-or-Nothing? Implications for Future Speech-Based Human-Machine Interaction

An Interaction Framework for Designing Systems for Virtual Home Assistants and People with Dysarthria

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation