Skip to main content

Speaking with Harmony

Finding the Right Thing to Do or Say … While in Bed (or Anywhere Else)

  • Chapter
  • First Online:
Maschinenliebe

Abstract

Doing or saying the right thing in response to circumstances is a constant problem, especially for embodied personal companions like Realbotix’s Harmony. In this paper we will describe the Harmony system, how it finds the right thing to say or do, and how recent advances in neural network-based natural language processing and generation will be integrated into next-generation systems. These advances will allow the transition from pattern-oriented responses to dynamic narrative-oriented response generation. Future systems will be able adapt to their situation much more flexibly, and allow a wider range of role-playing and interaction.

You’re a robot? I should have known. No human is that humane.

(Ellen Louise Ripley in Alien: Resurrection)

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 69.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Part of Patent Pending work.

References

  • Adiwardana D, Luong MT, So DR, Hall J, Fiedel N, Thoppilan R, Le QV (2020) Towards a human-like open-domain chatbot. arXiv preprint arXiv:2001.09977. Accessed: 15 May 2020

  • Amazon Alexa (2020) https://developer.amazon.com/en-US/alexa. Accessed: 15 May 2020

  • Becker-Asano C, Ishiguro H (2011) Evaluating facial displays of emotion for the android robot Geminoid F. In: 2011 IEEE workshop on affective computational intelligence (WACI), April 2011:1–8

    Google Scholar 

  • Bendel O (2018) SSML for sex robots. In: Cheok AD, Levy D (eds) Love and sex with robots. Third International Conference, LSR 2017, London, UK, December 19–20, 2017, Revised Selected Paper. Springer International Publishing, Cham, pp 1–11

    Google Scholar 

  • Bendel O, Studer D, Richards B (2019) The BESTBOT Project. In: Bendel O (ed) Handbuch Maschinenethik (Springer Reference Geisteswissenschaften). Springer VS, Wiesbaden, pp 335–353

    Chapter  Google Scholar 

  • BINA48 (2020) Wikipedia https://en.wikipedia.org/wiki/BINA48. Retrieved: 26 April 2020

  • Brown TB, Mann B, Ryder N, Subbiah M, Kaplan J, Dhariwal P, Agarwal S (2020) Language models are few-shot learners. arXiv preprint arXiv:2005.14165

  • Buss DM, Meston CM (2007) Why humans have sex. Arch Sex Behav 36(4):477–507

    Article  Google Scholar 

  • Clark P, Tafjord O, Richardson K (2020) Transformers as soft reasoners over language. arXiv preprint arXiv:2002.05867. Accessed: 15 May 2020

  • Cosmai A (2008) Keywords in the mist: Automated keyword extraction for very large documents and back of the book indexing. University of North Texas, Denton.

    Google Scholar 

  • Cosmai A, Mihalcea R (2008) Linking documents to encyclopedic knowledge. IEEE Intelligent Systems 23(5):34–41

    Google Scholar 

  • Coursey K (2009) The value of everything: ranking and association with encyclopedic knowledge. University of North Texas, Denton

    Google Scholar 

  • Coursey K, Pirzchalski S, McMullen M, Lindroth G, Furuushi Y (2019) Living with harmony: a personal companion system by realbotix™. In: Zhou Y, Fischer MH (eds) AI love you. Springer, Cham, pp 77–95

    Chapter  Google Scholar 

  • Crawl BookCorpus (2020) Retrieved April 25, 2020, from github.com website https://github.com/soskek/bookcorpus. Accessed: 15 May 2020

  • Cummings W (2019) “Whitney Cummings: Can I touch it?” Netflix website. https://www.netflix.com/title/80213715. Accessed: 15 May 2020

  • Daz 3D (2020) Daz3d website. https://www.daz3d.com. Accessed: 15 May 2020

  • De Raad B (2000) The big five personality factors: the psycholexical approach to personality. Hogrefe & Huber, Bern

    Google Scholar 

  • Devlin J, Chang MW, Lee K, Toutanova K (2018) Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805. Accessed: 15 May 2020

  • Ellison M (Producer), Jonze S (Producer, Director), Landay V (Producer) (2013) Her [Motion Picture]. Warner Bros. Pictures, Los Angeles

    Google Scholar 

  • Glas DF, Minato T, Ishi CT, Kawahara T, Ishiguro H (2016) ERICA: the erato intelligent conversational android. In: 2016 25th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), August 2016, pp 22–29

    Google Scholar 

  • Google Assistant (2020) Google website. https://assistant.google.com. Accessed: 15 May 2020

  • Greene SM (2016) Bina48: Gender, race, and queer artificial life. Ada: A Journal of Gender, New Media & Technology, 9. https://adanewmedia.org/2016/05/issue9-greene/. Accessed: 13 August 2020

  • Hanson D, Olney A, Prilliman S, Mathews E, Zielke M, Hammons D, Stephanou H (2005) Upending the uncanny valley. AAAI 5:1728–1729

    Google Scholar 

  • Harmon A (2010) Making friends with a robot named bina48. The New York Times, 4 July 2010, p 4

    Google Scholar 

  • Hauswald J, Laurenzano MA, Zhang Y, Li C, Rovinski A, Khurana A, Mars J (2015) Sirius: an open end-to-end voice and vision personal assistant and its implications for future warehouse scale computers. In: Proceedings of the Twentieth International Conference on Architectural Support for Programming Languages and Operating Systems, March 2015, pp 223–238

    Google Scholar 

  • Hutter M (2012) One decade of universal artificial intelligence. Theoretical Foundations of Artificial General Intelligence 4(2012):67–88

    Article  Google Scholar 

  • Itti L, Dhavale N, Pighin F (2003) Realistic avatar eye and head animation using a neurobiological model of visual attention. Applications and Science of Neural Networks, Fuzzy Systems, and Evolutionary Computation VI 5200:64–78

    Article  Google Scholar 

  • Kawahara T (2019) Spoken dialogue system for a human-like conversational robot ERICA. In: 9th International Workshop on Spoken Dialogue System Technology. Springer, Singapore, pp 65–75

    Google Scholar 

  • Lee SP, Badler JB, Badler NI (2002) Eyes alive. In: Proceedings of the 29th annual conference on Computer graphics and interactive techniques, July 2002, pp 637–644

    Google Scholar 

  • Liu P, Glas DF, Kanda T, Ishiguro H, Hagita N (2014) How to train your robot-teaching service robots to reproduce human social behavior. In: The 23rd IEEE International Symposium on Robot and Human Interactive Communication, August 2014, pp 961–968

    Google Scholar 

  • Loebner Prize (2020) Wikipedia. https://en.wikipedia.org/wiki/Loebner_Prize. Accessed: 15 May 2020

  • McMullen M (2014) Doll head having a magnetically adjustable facial contour and method of assembling same. U.S. Patent No. 8,888,553. Washington, DC: U.S. Patent and Trademark Office

    Google Scholar 

  • Microsoft Cortana (2020) Microsoft website. https://blogs.windows.com/windowsexperience/2015/02/10/how-cortana-comes-to-life-in-windows-10/. Accessed: 15 May 2020

  • Mihalcea R, Tarau P (2004) Textrank: bringing order into text. In: Proceedings of the 2004 conference on empirical methods in natural language processing, July 2004, pp 404–411

    Google Scholar 

  • Mori M, MacDorman KF, Kageki N (2012) The uncanny valley [from the field]. IEEE Robot Autom Mag 19(2):98–100

    Article  Google Scholar 

  • Nishio S, Ishiguro H, Hagita N (2007) Geminoid: teleoperated android of an existing person. Humanoid robots: New developments 14:343–352

    Google Scholar 

  • Oh JH, Hanson D, Kim WS, Han Y, Kim JY, Park IW (2006) Design of android type humanoid robot Albert HUBO. In: 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 2006, pp 1428–1433

    Google Scholar 

  • OpenWebText Corpus (2020) github.io website https://skylion007.github.io/OpenWebTextCorpus/. Accessed: 15 May 2020

  • Radford A, Wu J, Child R, Luan D, Amodei D, Sutskever I (2019) Language models are unsupervised multitask learners. OpenAI Blog 1(8):9

    Google Scholar 

  • Ramos J (2003) Using tf-idf to determine word relevance in document queries. In: Proceedings of the first instructional conference on machine learning (Vol. 242), December 2003, pp 133–142

    Google Scholar 

  • Roller S, Dinan E, Goyal N, Ju D, Williamson M, Liu Y, Xu J, Ott M, Shuster K, Smith EM, Boureau YL (2020) Recipes for building an open-domain chatbot. arXiv preprint arXiv:2004.13637. Accessed: 15 May 2020

  • Samsung Bixby (2020) Samsung website. https://www.samsung.com/us/explore/bixby/. Accessed: 15 May 2020

  • Shoeybi M, Patwary M, Puri R, LeGresley P, Casper J, Catanzaro B (2019) Megatron-lm: training multi-billion parameter language models using gpu model parallelism. arXiv preprint arXiv:1909.08053. Accessed: 15 May 2020

  • Siri-Apple (2020) Apple website: https://www.apple.com/siri/. Accessed: 15 May 2020

  • Springfieldspringfield.co.uk (2020) “Springfield! Springfield!” Archive.org website https://web.archive.org/web/sitemap/ https://www.springfieldspringfield.co.uk/. Accessed: 15 May 2020

  • Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Polosukhin I (2017) Attention is all you need. In: Advances in neural information processing systems, pp 5998–6008

    Google Scholar 

  • Turing AM (2009) Computing machinery and intelligence. In: Epstein R, Roberts G, Beber G (eds) Parsing the turing test. Springer, Dordrecht, pp 23–65

    Chapter  Google Scholar 

  • Unity (2020) Unity 3D website. https://www.unity.com/. Accessed: 15 May 2020

  • Wallace RS (2003) The elements of AIML style. Alice AI Foundation, 139

    Google Scholar 

  • Wallace RS (2014) Method for personalizing chat bots. U.S. Patent No. 8,818,926. Washington, DC: U.S. Patent and Trademark Office

    Google Scholar 

  • Wolf T, Debut L, Sanh V, Chaumond J, Delangue C, Moi A, Brew J (2019) Transformers: State-of-the-art Natural Language Processing. arXiv preprint arXiv:1910.03771. Accessed: 15 May 2020

  • Zhang Y, Sun S, Galley M, Chen YC, Brockett C, Gao X, Dolan B (2019) DialoGPT: large-scale generative pre-training for conversational response generation. arXiv preprint arXiv:1911.00536. Accessed: 15 May 2020

  • Zhou L, Gao J, Li D, Shum HY (2020) The design and implementation of Xiaoice, an empathetic social chatbot. Computational Linguistics 46(1):53–93

    Article  Google Scholar 

  • Zhu Y, Kiros R, Zemel RS, Salakhutdinov R, Urtasun R, Torralba A, Fidler S (2015) Aligning books and movies: towards story-like visual explanations by watching movies and reading books. 2015 IEEE International Conference on Computer Vision (ICCV), pp 19–27

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kino Coursey .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Der/die Herausgeber bzw. der/die Autor(en), exklusiv lizenziert durch Springer Fachmedien Wiesbaden GmbH, ein Teil von Springer Nature

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Coursey, K. (2020). Speaking with Harmony. In: Bendel, O. (eds) Maschinenliebe. Springer Gabler, Wiesbaden. https://doi.org/10.1007/978-3-658-29864-7_3

Download citation

Publish with us

Policies and ethics