Skip to main content

Iterative Design of an Emotive Voice for the Tabletop Robot Haru

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13086))

Abstract

Designing a voice for a social robot is particularly challenging because the voice needs to convincingly convey a target personality while maintaining rich, emotive capabilities in order to foster the development of bonds with humans. In this paper, we describe the ongoing design and implementation process of a voice for a social robot. To aid in our design and analysis, we identify three desirable characteristics for its voice: 1. convincingness, 2. emotiveness, and 3. consistency. In this paper, we present a preliminary study that investigates convincingness by comparing samples taken from human voice talents and eliciting human judgements about their appropriateness. This study compares human judgements, elicited through surveys, on a range of characteristics related to convincingness, emotions conveyed, and impressions of the overall consistency of the voice. Finally, we discuss the implications of the survey findings for designing a voice for a social robot.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   109.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   139.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    \(p<0.05\) as measured by a single-tail t-test.

References

  1. Alonso-Martín, F., Malfaz, M., Castro-González, Á., Castillo, J.C., Salichs, M.A., et al.: Online evaluation of text to speech systems for three social robots. In: Salichs, M.A. (ed.) ICSR 2019. LNCS (LNAI), vol. 11876, pp. 155–164. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-35888-4_15

    Chapter  Google Scholar 

  2. Barchard, K.A., Williams, J.: Practical advice for conducting ethical online experiments and questionnaires for united states psychologists. Behav. Res. Methods 40(4), 1111–1128 (2008)

    Article  Google Scholar 

  3. Barnes, J., Richie, E., Lin, Q., Jeon, M., Park, C.H.: Emotive voice acceptance in human-robot interaction. In: Proceedings of the 24th International Conference on Auditory Display (2018)

    Google Scholar 

  4. Behrend, T.S., Sharek, D.J., Meade, A.W., Wiebe, E.N.: The viability of crowdsourcing for survey research. Behav. Res. Methods 43(3), 800–813 (2011)

    Article  Google Scholar 

  5. Breazeal, C.: Emotive qualities in robot speech. In: Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No.01CH37180), vol. 3, pp. 1388–1394, October 2001. https://doi.org/10.1109/IROS.2001.977175

  6. Dandurand, F., Shultz, T.R., Onishi, K.H.: Comparing online and lab methods in a problem-solving experiment. Behav. Res. Methods 40(2), 428–434 (2008)

    Article  Google Scholar 

  7. Dou, X., Wu, C.-F., Lin, K.-C., Tseng, T.-M.: The effects of robot voice and gesture types on the perceived robot personalities. In: Kurosu, M. (ed.) HCII 2019. LNCS, vol. 11566, pp. 299–309. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-22646-6_21

    Chapter  Google Scholar 

  8. Edwards, C., Edwards, A., Stoll, B., Lin, X., Massey, N.: Evaluations of an artificial intelligence instructor’s voice: social identity theory in human-robot interactions. Comput. Hum. Behav. 90, 357–362 (2019)

    Article  Google Scholar 

  9. Eyssel, F., De Ruiter, L., Kuchenbrandt, D., Bobinger, S., Hegel, F.: ‘If you sound like me, you must be more human’: on the interplay of robot and user features on human-robot acceptance and anthropomorphism. In: 2012 7th ACM/IEEE International Conference on Human-Robot Interaction (HRI), pp. 125–126 (2012)

    Google Scholar 

  10. Gomez, R., Nakamura, K., Szapiro, D., Merino, L.: A holistic approach in designing tabletop robot’s expressivity. In: Proceedings of the International Conference on Robotics and Automation (2020)

    Google Scholar 

  11. Gomez, R., Szapiro, D., Galindo, K., Nakamura, K.: Haru: hardware design of an experimental tabletop robot assistant. In: Proceedings of the 2018 ACM/IEEE International Conference on Human-Robot Interaction, February 2018

    Google Scholar 

  12. Heerink, M., Kröse, B., Evers, V., Wielinga, B.: Relating conversational expressiveness to social presence and acceptance of an assistive social robot. Virtual Reality 14(1), 77–84 (2010)

    Article  Google Scholar 

  13. Levy, D.B.: Animation Development: From Pitch to Prod. Simon & Schuster (2010)

    Google Scholar 

  14. Macdonald, I.W.: Tablets of stone or DNA? TV series bibles. J. Screenwriting 9(1), 3–23 (2018)

    Article  Google Scholar 

  15. McGinn, C., Torre, I.: Can you tell the robot by the voice? an exploratory study on the role of voice in the perception of robots. In: 2019 14th ACM/IEEE International Conference on Human-Robot Interaction (HRI), pp. 211–221. IEEE (2019)

    Google Scholar 

  16. Niculescu, A., van Dijk, B., Nijholt, A., Li, H., See, S.L.: Making social robots more attractive: the effects of voice pitch, humor and empathy. Int. J. Soc. Robot. 5(2), 171–191 (2013)

    Article  Google Scholar 

  17. Plutchik, R.: A psychoevolutionary theory of emotions. Soc. Sci. Inf. 21(4–5), 529–553 (1982). https://doi.org/10.1177/053901882021004003

  18. Roehling, S., MacDonald, B., Watson, C.: Towards expressive speech synthesis in English on a robotic platform. In: Proceedings of the Australasian International Conference on Speech Science and Technology, pp. 130–135. Citeseer (2006)

    Google Scholar 

  19. Tang, H., Fu, Y., Tu, J., Hasegawa-Johnson, M., Huang, T.S.: Humanoid audio-visual avatar with emotive text-to-speech synthesis. IEEE Trans. Multimedia 10(6), 969–981 (2008). https://doi.org/10.1109/TMM.2008.2001355

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Eric Nichols .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Nichols, E., Siskind, S.R., Kamino, W., Šabanović, S., Gomez, R. (2021). Iterative Design of an Emotive Voice for the Tabletop Robot Haru. In: Li, H., et al. Social Robotics. ICSR 2021. Lecture Notes in Computer Science(), vol 13086. Springer, Cham. https://doi.org/10.1007/978-3-030-90525-5_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-90525-5_31

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-90524-8

  • Online ISBN: 978-3-030-90525-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics