International Journal of Social Robotics

, Volume 3, Issue 3, pp 253–262 | Cite as

The Effects of Synthesized Voice Accents on User Perceptions of Robots

  • Rie Tamagawa
  • Catherine I. Watson
  • I. Han Kuo
  • Bruce A. MacDonald
  • Elizabeth Broadbent
Original Paper

Abstract

Human voice accents have been shown to affect people’s perceptions of the speaker, but little research has looked at how synthesized voice accents affect perceptions of robots. This research investigated people’s perceptions of three synthesized voice accents. Three male robot voices were generated: British (UK), American (US), and New Zealand (NZ). In study one, twenty adults listened through headphones to a recorded script repeated in the three different accents, rated the nationality, roboticness, and overall impression of each voice, and chose their preferred accent. Study two used these voices on a healthcare robot to investigate the influence of accent on user perceptions of the robot. Ninety-one individuals were randomized to one of three conditions. In each condition they interacted with a healthcare robot that assisted with blood pressure measurement but the conditions differed in the accent the robot spoke with. In study one, each accent was correctly identified. There was no difference in impression ratings of each voice, but the US accent was rated as more robotic than the NZ accent, and the UK accent was preferred to the US accent. Study two showed that people randomized to the NZ accent had more positive feelings towards the robot and rated the robot’s overall performance as higher compared to the robot with the US voice. These results suggest that the employment of a less robotic voice with a local accent may positively affect user perceptions of robots.

Keywords

Human-robot interaction Acceptability Robot Voice Perception 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Alamsaputra DM, Kohnert KJ, Munson B, Reichle J (2006) Synthesized speech intelligibility among native speakers and non-native speakers of English. Augment Altern Commun 22:258–268 CrossRefGoogle Scholar
  2. 2.
    Aronovitch CD (1976) The voice of personality: Stereotyped judgment and their relation to voice quality and sex of speaker. J Soc Psychol 99:207–220 CrossRefGoogle Scholar
  3. 3.
    Arras KO, Cerqui D (2005) Do we want to share our lives and bodies with robots? A 2000 people survey. Technical report, Autonomous Systems Lab, Swiss Federal Institute of Technology Lausanne Google Scholar
  4. 4.
    Atrash A, Kaplow R, Villemure J, West R, Yamani H, Pineau J (2009) Development and validation of a robust speech interface for improved human-robot interaction. Int J Soc Robot 1:345–356 CrossRefGoogle Scholar
  5. 5.
    Ball P (1983) Stereotypes of anglo-saxon and non anglo-saxon accents: some exploratory Australian studies with the matched guise technique. Lang Sci 5:163–183 CrossRefGoogle Scholar
  6. 6.
    Bayard D (1995) Kiwitalk: sociolinguistics and New Zealand society. Dunmore Press, Palmerston North Google Scholar
  7. 7.
    Bayard D (1999) The cultural cringe revisited: changes through time in KIWI Attitudes towards accents. In: Bell A, Kuiper K (eds) New Zealand English. Benjamins, Amsterdam, pp 297–324 Google Scholar
  8. 8.
    Bayard D, Weatherall A, Gallois C, Pittam J (2001) Pax Americana? Accent attitudinal evaluations in New Zealand, Australia, and America. J Socioling 5:22–49 CrossRefGoogle Scholar
  9. 9.
    Bennewitz M, Faber F, Joho D, Behnke S (2007) Fritz—a humanoid communication robot. In: Proc IEEE international workshop of robot and human interactive communication (ROMAN), Jeju Island, Korea, pp 1072–1077 Google Scholar
  10. 10.
    Berry DS, Hansen JS (1996) Positive affect, negative affect, and social interaction. J Pers Soc Psychol 71:796–809 CrossRefGoogle Scholar
  11. 11.
    Black AW, Lenzo KA (2007) Building synthetic voices. http://festvox.org/festvox/festvox_toc.html
  12. 12.
    Black AW, Taylor P, Caley R (1999) The festival speech synthesis system. http://www.cstr.ed.ac.uk/projects/festival/
  13. 13.
    Breazeal C (2001) Emotive qualities in robot speech. In: Proc the 2001 IEEE/RSJ international conference on intelligent robots and systems, Maui, pp 1388–1394 Google Scholar
  14. 14.
    Broadbent E, MacDonald BA, Jago L, Juergens M, Mazharullah O (2007) Human reactions to good and bad robots. In: Proc IEEE/RSJ international conference on intelligent robots and systems IROS, pp 3703–3708 Google Scholar
  15. 15.
    Cargile A, Giles H (1997) Understanding language attitudes: exploring listener affect and identity. Lang Commun 17:195–217 CrossRefGoogle Scholar
  16. 16.
    Cesta A, Cortellessa G, Giuliani MV, Pecora F, Scopelliti M, Tiberio L (2007) Psychological implications of domestic assistive technology for the elderly. Psychol J 5:229–252 Google Scholar
  17. 17.
    Fitt S (2000) Documentation and user guide to Unisyn Lexicon and Post-Lexical rules. Technical report, Centre for Speech Technology Research, University of Edinburgh Google Scholar
  18. 18.
    Giles H (1970) Evaluative reactions to accents. Educ Rev 22:211–227 CrossRefGoogle Scholar
  19. 19.
    Giles H, Williams A, Mackie DM, Rosselli F (1995) Reactions to Anglo- and Hispanic-American-accented speakers: Affect, identity, persuasion, and the English-only controversy. Lang Commun 15:107–120 CrossRefGoogle Scholar
  20. 20.
    Goetz J, Kiesler S, Powers A (2003) Matching robot appearance and behavior to tasks to improve human-robot cooperation. In: Proc the 12th IEEE international symposium on robot and human interactive communication, Millbrae, California, USA, pp 55–60 Google Scholar
  21. 21.
    Hall JA, Roter DL, Rand CS (1981) Communication of affect between patient and physician. J Health Soc Behav 22:18–30 CrossRefGoogle Scholar
  22. 22.
    Huygens I, Vaughan GM (1983) Language attitudes, ethnicity and social class in New Zealand. J Multiling Multicult Dev 4:207–223 CrossRefGoogle Scholar
  23. 23.
    Igic A, Watson CI, Teutenberg J, Tamagawa R, Macdonald BA, Broadbent E (2009) Towards a flexible platform for voice accent and expression selection on a Healthcare Robot. In: Proc the 2009 Australasian language technology workshop, Sydney, pp 109–113 Google Scholar
  24. 24.
    Kuo IH, Rabindran J, Broadbent E, Lee YI, Kerse N, Stafford R, MacDonald BA (2009) Age and gender factors in user acceptance of healthcare robots. In: Proc the 18th IEEE international symposium on robot and human interactive communication, Toyama, Japan, pp 214–219 CrossRefGoogle Scholar
  25. 25.
    LeBaron S, Reyher J, Stack JM (1985) Paternalistic vs egalitarian physician styles: The treatment of patients in crisis. J Fam Pract 21:56–62 Google Scholar
  26. 26.
    Li X, Watson CI, Igic A, Macdonald BA (2009) Expressive speech for a virtual talking head. In: Australasian conference on robotics and automation, Sydney Google Scholar
  27. 27.
    Luhman R (1990) Appalachian English stereotypes: Language attitudes in Kentucky. Lang Soc 19:331–348 CrossRefGoogle Scholar
  28. 28.
    Mayer RE, Sobko K, Mautone PD (2003) Social cues in multimedia learning: roles of speaker’s voice. J Educ Psychol 95:419–425 CrossRefGoogle Scholar
  29. 29.
    Mullennix JW, Johnson K, Topcu-Durgun M, Farnsworth LW (1995) The perceptual representation of voice gender. J Acoust Soc Am 98:3080–3095 CrossRefGoogle Scholar
  30. 30.
    Mullennix JW, Stern SE, Wilson SJ, Dyson C (2003) Social perception of male and female computer synthesized speech. Comput Hum Behav 19:407–424 CrossRefGoogle Scholar
  31. 31.
    Nass C, Brave S (2005) Wired for speech: how voice activates and advances the human-computer relationship. MIT Press, Cambridge Google Scholar
  32. 32.
    Nass C, Gong L (2000) Social aspects of speech interfaces from an evolutionary perspective: experimental research and design implications. Commun ACM 43(9):36–43 CrossRefGoogle Scholar
  33. 33.
    Niculescu AI, White GM, Lan SS, Waloejo RU, Kawaguchi Y (2008) Impact of English regional accents on user acceptance of voice user interfaces. In: Proc NordiCHI 2008, vol 358. ACM, New York, pp 523–526 CrossRefGoogle Scholar
  34. 34.
    Oestreicher L (2007) Cognitive, social, sociable or just socially acceptable robots. In: Proc the 16th IEEE international symposium on robot and human interactive communication (ROMAN), Jeju Island, Korea, pp 558–563 CrossRefGoogle Scholar
  35. 35.
    Pucher M, Schuchmann G, Fröhlich P (2008) Regionalized text-to-speech systems: Persona design and application scenarios. In: COST action 2102 school, Vietri sul Mare, Italy. Lecture notes in artificial intelligence (LNAI), vol 5398, pp 216–222 Google Scholar
  36. 36.
    Robins B, Dautenhahn K, te Boekhorst R, Billard A (2004) Robots as assistive technology—does appearance matter. In: Proc the 13th IEEE international workshop on robot and human interactive communication (ROMAN), Okayama, Japan, pp 277–282 Google Scholar
  37. 37.
    Roehling S, MacDonald BA, Watson C (2006) Towards expressive speech synthesis in English on a robot platform. In: Proc the 11th Australian international conference on speech science and technology, Auckland, New Zealand, pp 130–135 Google Scholar
  38. 38.
    Stern SE (2008) Computer-synthesized speech and perceptions of the social influence of disabled users. J Lang Soc Psychol 27:254–265 CrossRefGoogle Scholar
  39. 39.
    Tusing KJ, Dillard JP (2000) The sound of dominance: vocal precursors of perceived dominance during interpersonal influence. Hum Commun Res 26:148–171 Google Scholar
  40. 40.
    Walters ML, Syrdal DS, Koay KL, Dautenhahn K, te Boekhorst R (2008) Human approach distance to a mechanical-looking robot with different robot voice styles. In: Proc the 17th IEEE international symposium on robot and human interactive communication, Munich, Germany, pp 707–712 CrossRefGoogle Scholar
  41. 41.
    Watson D, Clark LA, Tellegen A (1988) Development and validation of a brief measure of positive and negative affect: the PANAS scales. J Pers Soc Psychol 54:1063–1070 CrossRefGoogle Scholar
  42. 42.
    Watson CI, Teutenberg J, Thompson L, Roehling S, Igic A (2009) How to build a New Zealand voice. In: NZ linguistic society conference, Palmerston North Google Scholar

Copyright information

© Springer Science & Business Media BV 2011

Authors and Affiliations

  • Rie Tamagawa
    • 1
  • Catherine I. Watson
    • 2
  • I. Han Kuo
    • 2
  • Bruce A. MacDonald
    • 2
  • Elizabeth Broadbent
    • 1
  1. 1.Department of Psychological MedicineThe University of AucklandAucklandNew Zealand
  2. 2.Department of Electronic and Computer EngineeringThe University of AucklandAucklandNew Zealand

Personalised recommendations