Abstract
Doing or saying the right thing in response to circumstances is a constant problem, especially for embodied personal companions like Realbotix’s Harmony. In this paper we will describe the Harmony system, how it finds the right thing to say or do, and how recent advances in neural network-based natural language processing and generation will be integrated into next-generation systems. These advances will allow the transition from pattern-oriented responses to dynamic narrative-oriented response generation. Future systems will be able adapt to their situation much more flexibly, and allow a wider range of role-playing and interaction.
You’re a robot? I should have known. No human is that humane.
(Ellen Louise Ripley in Alien: Resurrection)
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Part of Patent Pending work.
References
Adiwardana D, Luong MT, So DR, Hall J, Fiedel N, Thoppilan R, Le QV (2020) Towards a human-like open-domain chatbot. arXiv preprint arXiv:2001.09977. Accessed: 15 May 2020
Amazon Alexa (2020) https://developer.amazon.com/en-US/alexa. Accessed: 15 May 2020
Becker-Asano C, Ishiguro H (2011) Evaluating facial displays of emotion for the android robot Geminoid F. In: 2011 IEEE workshop on affective computational intelligence (WACI), April 2011:1–8
Bendel O (2018) SSML for sex robots. In: Cheok AD, Levy D (eds) Love and sex with robots. Third International Conference, LSR 2017, London, UK, December 19–20, 2017, Revised Selected Paper. Springer International Publishing, Cham, pp 1–11
Bendel O, Studer D, Richards B (2019) The BESTBOT Project. In: Bendel O (ed) Handbuch Maschinenethik (Springer Reference Geisteswissenschaften). Springer VS, Wiesbaden, pp 335–353
BINA48 (2020) Wikipedia https://en.wikipedia.org/wiki/BINA48. Retrieved: 26 April 2020
Brown TB, Mann B, Ryder N, Subbiah M, Kaplan J, Dhariwal P, Agarwal S (2020) Language models are few-shot learners. arXiv preprint arXiv:2005.14165
Buss DM, Meston CM (2007) Why humans have sex. Arch Sex Behav 36(4):477–507
Clark P, Tafjord O, Richardson K (2020) Transformers as soft reasoners over language. arXiv preprint arXiv:2002.05867. Accessed: 15 May 2020
Cosmai A (2008) Keywords in the mist: Automated keyword extraction for very large documents and back of the book indexing. University of North Texas, Denton.
Cosmai A, Mihalcea R (2008) Linking documents to encyclopedic knowledge. IEEE Intelligent Systems 23(5):34–41
Coursey K (2009) The value of everything: ranking and association with encyclopedic knowledge. University of North Texas, Denton
Coursey K, Pirzchalski S, McMullen M, Lindroth G, Furuushi Y (2019) Living with harmony: a personal companion system by realbotix™. In: Zhou Y, Fischer MH (eds) AI love you. Springer, Cham, pp 77–95
Crawl BookCorpus (2020) Retrieved April 25, 2020, from github.com website https://github.com/soskek/bookcorpus. Accessed: 15 May 2020
Cummings W (2019) “Whitney Cummings: Can I touch it?” Netflix website. https://www.netflix.com/title/80213715. Accessed: 15 May 2020
Daz 3D (2020) Daz3d website. https://www.daz3d.com. Accessed: 15 May 2020
De Raad B (2000) The big five personality factors: the psycholexical approach to personality. Hogrefe & Huber, Bern
Devlin J, Chang MW, Lee K, Toutanova K (2018) Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805. Accessed: 15 May 2020
Ellison M (Producer), Jonze S (Producer, Director), Landay V (Producer) (2013) Her [Motion Picture]. Warner Bros. Pictures, Los Angeles
Glas DF, Minato T, Ishi CT, Kawahara T, Ishiguro H (2016) ERICA: the erato intelligent conversational android. In: 2016 25th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), August 2016, pp 22–29
Google Assistant (2020) Google website. https://assistant.google.com. Accessed: 15 May 2020
Greene SM (2016) Bina48: Gender, race, and queer artificial life. Ada: A Journal of Gender, New Media & Technology, 9. https://adanewmedia.org/2016/05/issue9-greene/. Accessed: 13 August 2020
Hanson D, Olney A, Prilliman S, Mathews E, Zielke M, Hammons D, Stephanou H (2005) Upending the uncanny valley. AAAI 5:1728–1729
Harmon A (2010) Making friends with a robot named bina48. The New York Times, 4 July 2010, p 4
Hauswald J, Laurenzano MA, Zhang Y, Li C, Rovinski A, Khurana A, Mars J (2015) Sirius: an open end-to-end voice and vision personal assistant and its implications for future warehouse scale computers. In: Proceedings of the Twentieth International Conference on Architectural Support for Programming Languages and Operating Systems, March 2015, pp 223–238
Hutter M (2012) One decade of universal artificial intelligence. Theoretical Foundations of Artificial General Intelligence 4(2012):67–88
Itti L, Dhavale N, Pighin F (2003) Realistic avatar eye and head animation using a neurobiological model of visual attention. Applications and Science of Neural Networks, Fuzzy Systems, and Evolutionary Computation VI 5200:64–78
Kawahara T (2019) Spoken dialogue system for a human-like conversational robot ERICA. In: 9th International Workshop on Spoken Dialogue System Technology. Springer, Singapore, pp 65–75
Lee SP, Badler JB, Badler NI (2002) Eyes alive. In: Proceedings of the 29th annual conference on Computer graphics and interactive techniques, July 2002, pp 637–644
Liu P, Glas DF, Kanda T, Ishiguro H, Hagita N (2014) How to train your robot-teaching service robots to reproduce human social behavior. In: The 23rd IEEE International Symposium on Robot and Human Interactive Communication, August 2014, pp 961–968
Loebner Prize (2020) Wikipedia. https://en.wikipedia.org/wiki/Loebner_Prize. Accessed: 15 May 2020
McMullen M (2014) Doll head having a magnetically adjustable facial contour and method of assembling same. U.S. Patent No. 8,888,553. Washington, DC: U.S. Patent and Trademark Office
Microsoft Cortana (2020) Microsoft website. https://blogs.windows.com/windowsexperience/2015/02/10/how-cortana-comes-to-life-in-windows-10/. Accessed: 15 May 2020
Mihalcea R, Tarau P (2004) Textrank: bringing order into text. In: Proceedings of the 2004 conference on empirical methods in natural language processing, July 2004, pp 404–411
Mori M, MacDorman KF, Kageki N (2012) The uncanny valley [from the field]. IEEE Robot Autom Mag 19(2):98–100
Nishio S, Ishiguro H, Hagita N (2007) Geminoid: teleoperated android of an existing person. Humanoid robots: New developments 14:343–352
Oh JH, Hanson D, Kim WS, Han Y, Kim JY, Park IW (2006) Design of android type humanoid robot Albert HUBO. In: 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 2006, pp 1428–1433
OpenWebText Corpus (2020) github.io website https://skylion007.github.io/OpenWebTextCorpus/. Accessed: 15 May 2020
Radford A, Wu J, Child R, Luan D, Amodei D, Sutskever I (2019) Language models are unsupervised multitask learners. OpenAI Blog 1(8):9
Ramos J (2003) Using tf-idf to determine word relevance in document queries. In: Proceedings of the first instructional conference on machine learning (Vol. 242), December 2003, pp 133–142
Roller S, Dinan E, Goyal N, Ju D, Williamson M, Liu Y, Xu J, Ott M, Shuster K, Smith EM, Boureau YL (2020) Recipes for building an open-domain chatbot. arXiv preprint arXiv:2004.13637. Accessed: 15 May 2020
Samsung Bixby (2020) Samsung website. https://www.samsung.com/us/explore/bixby/. Accessed: 15 May 2020
Shoeybi M, Patwary M, Puri R, LeGresley P, Casper J, Catanzaro B (2019) Megatron-lm: training multi-billion parameter language models using gpu model parallelism. arXiv preprint arXiv:1909.08053. Accessed: 15 May 2020
Siri-Apple (2020) Apple website: https://www.apple.com/siri/. Accessed: 15 May 2020
Springfieldspringfield.co.uk (2020) “Springfield! Springfield!” Archive.org website https://web.archive.org/web/sitemap/ https://www.springfieldspringfield.co.uk/. Accessed: 15 May 2020
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Polosukhin I (2017) Attention is all you need. In: Advances in neural information processing systems, pp 5998–6008
Turing AM (2009) Computing machinery and intelligence. In: Epstein R, Roberts G, Beber G (eds) Parsing the turing test. Springer, Dordrecht, pp 23–65
Unity (2020) Unity 3D website. https://www.unity.com/. Accessed: 15 May 2020
Wallace RS (2003) The elements of AIML style. Alice AI Foundation, 139
Wallace RS (2014) Method for personalizing chat bots. U.S. Patent No. 8,818,926. Washington, DC: U.S. Patent and Trademark Office
Wolf T, Debut L, Sanh V, Chaumond J, Delangue C, Moi A, Brew J (2019) Transformers: State-of-the-art Natural Language Processing. arXiv preprint arXiv:1910.03771. Accessed: 15 May 2020
Zhang Y, Sun S, Galley M, Chen YC, Brockett C, Gao X, Dolan B (2019) DialoGPT: large-scale generative pre-training for conversational response generation. arXiv preprint arXiv:1911.00536. Accessed: 15 May 2020
Zhou L, Gao J, Li D, Shum HY (2020) The design and implementation of Xiaoice, an empathetic social chatbot. Computational Linguistics 46(1):53–93
Zhu Y, Kiros R, Zemel RS, Salakhutdinov R, Urtasun R, Torralba A, Fidler S (2015) Aligning books and movies: towards story-like visual explanations by watching movies and reading books. 2015 IEEE International Conference on Computer Vision (ICCV), pp 19–27
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Der/die Herausgeber bzw. der/die Autor(en), exklusiv lizenziert durch Springer Fachmedien Wiesbaden GmbH, ein Teil von Springer Nature
About this chapter
Cite this chapter
Coursey, K. (2020). Speaking with Harmony. In: Bendel, O. (eds) Maschinenliebe. Springer Gabler, Wiesbaden. https://doi.org/10.1007/978-3-658-29864-7_3
Download citation
DOI: https://doi.org/10.1007/978-3-658-29864-7_3
Published:
Publisher Name: Springer Gabler, Wiesbaden
Print ISBN: 978-3-658-29863-0
Online ISBN: 978-3-658-29864-7
eBook Packages: Computer Science and Engineering (German Language)