Toward Conscious-Like Conversational Agents

  • Milan GnjatovićEmail author
  • Branislav Borovac
Part of the Intelligent Systems Reference Library book series (ISRL, volume 106)


Although considerable effort has been already devoted to studying various aspects of human-machine interaction, we are still a long way from developing socially believable conversational agents. This paper identifies some of the main causes of the current state in the field: (i) socially believable behaviour of a technical system is misinterpreted as a functional requirement, rather than a qualitative, (ii) the currently prevalent statistical approaches cannot address research problems of managing human-machine interaction that require some sort of contextual analysis, and (iii) the structure of human-machine interaction is unjustifiably reduced to a task structure. In addition, we propose a way to address these pitfalls. We consider the capability of a technical system to simulate fundamental features of human consciousness as one of the key desiderata to perform socially believable behaviour. In line with this, the paper discusses the possibilities for the computational realization of (iv) unified interpretation, (v) learning through interaction, and (vi) context-dependent perception in the context of human-machine interaction.


Consciousness Socially believable behaviour Conversational agents Human-machine interaction Focus tree 



The presented study was sponsored by the Ministry of Education, Science and Technological Development of the Republic of Serbia under the Research grants III44008 and TR32035. The responsibility for the content of this paper lies with the authors.


  1. 1.
    Alexandersson J, Reithinger N (1997) Learning dialogue structures from a corpus. In: Proceedings of EuroSpeech-97, Rhodes, pp 2231–2235Google Scholar
  2. 2.
    Bacchiani M, Rybach D (2014) Context dependent state tying for speech recognition using deep neural network acoustic models. In: IEEE international conference on acoustics, speech and signal processing (ICASSP 2014), Florence, pp 230–234Google Scholar
  3. 3.
    Bernsen NO, Dybkjær L (2010) Multimodal usability. Springer, LondonGoogle Scholar
  4. 4.
    Bilange E (2000) In: Néel F, Bouwhuis DG, Taylor MM (eds) The structure of multimodal dialoque II. An approach to oral dialogue modelling. John Benjamins Publishing Company, AmsterdamGoogle Scholar
  5. 5.
    Bledowski C, Kaiser J, Rahm B (2010) Basic operations in working memory: contributions from functional imaging studies. Behav Brain Res 214(2):172–179CrossRefGoogle Scholar
  6. 6.
    Boehm BW (1988) A spiral model of software development and enhancement. Computer 21(5):61–72CrossRefGoogle Scholar
  7. 7.
    Brouwer H, Fitz H, Hoeks J (2012) Getting real about semantic illusions: rethinking the functional role of the P600 in language comprehension. Brain Res 1446:127–143CrossRefGoogle Scholar
  8. 8.
    Casadio M, Giannoni P, Morasso PG, Sanguineti V (2009) A proof of concept study for the integration of robot therapy with physiotherapy in the treatment of stroke patients. Clin Rehabil 23(3):217–228CrossRefGoogle Scholar
  9. 9.
    Chomsky N (2000) New Horizons in the study of language and mind. Cambridge University Press, CambridgeGoogle Scholar
  10. 10.
    Chomsky N (2002) Syntactic Structures. Walter De GruyterGoogle Scholar
  11. 11.
    Christensen HI, Sloman A, Kruijff G-J, Wyatt JL (2010) Cognitive systems introduction. In: Christensen HI, Kruijff G-J, Wyatt JL (eds) Cognitive systems. Springer, Berlin, pp 3–50CrossRefGoogle Scholar
  12. 12.
    Clodic A, Alami R, Montreuil V, Li S, Wrede B, Swadzba A (2007) A study of interaction between dialog and decision for human-robot collaborative task achievement. In: Proceedings of the 16th IEEE international symposium on robot and human interactive communication, RO-MAN 2007, Jeju Island, Korea, pp 913–918Google Scholar
  13. 13.
    Cortés U, Annicchiarico R, Urdiales C, Barrué C, Martínez A, Villar A, Caltagirone C (2008) Supported Human Autonomy for Recovery and Enhancement of Cognitive and Motor Abilities Using Agent Technologies. Agent Technology and e-Health. Whitestein Series in Software Agent Technologies and Autonomic Computing, Birkhäuser Basel, pp 117–140CrossRefGoogle Scholar
  14. 14.
    Dahl G, Yu D, Deng L, Acero A (2012) Context-dependent pre-trained deep neural networks for large vocabulary speech recognition. IEEE Trans Audio Speech Lang Process 20(1):30–42CrossRefGoogle Scholar
  15. 15.
    Douglas-Cowie E, Cowie R, Sneddon I, Cox C, Lowry O, McRorie M, Martin JC, Devillers L, Batliner A (2007) The humaine database: addressing the needs of the affective. In: Paiva A, Prada R, Picard R (eds) 2nd international conference on affective computing and intelligent interaction (ACII’, 2007), LNCS, 4738. Lisbon, Portugal, pp 488–500Google Scholar
  16. 16.
    Esposito A, Vinciarelli A, Haykin S, Hussain A, Faundez-Zanuy M (2011) Cognitive behavioural systems. Cognit Comput 3(3):417–418CrossRefGoogle Scholar
  17. 17.
    Esposito A, Fortunati L, Lugano G (2014) Modeling emotion, behavior and context in socially believable robots and ICT interfaces. Cognit Comput 6(4):623–627CrossRefGoogle Scholar
  18. 18.
    Garlan D, Schmerl B (2007) The RADAR architecture for personal cognitive assistance. Int J Softw Eng Knowl Eng 17(2):171–190CrossRefGoogle Scholar
  19. 19.
    Gnjatović M (2014) Therapist-centered design of a robot’s dialogue behavior. Cognit Comput 6(4):775–788CrossRefGoogle Scholar
  20. 20.
    Gnjatović M (2014) Changing concepts of machine dialogue management. In: Proceedings of the 5th IEEE international conference on cognitive infocommunications, cogInfoCom 2014, Vietri sul Mare, Italy, pp 367–372Google Scholar
  21. 21.
    Gnjatović M, Delić V (2012) A cognitively-inspired method for meaning representation in dialogue systems. In: Proceedings of the 3th IEEE international conference on cognitive infocommunications, cogInfoCom 2012, Košice, Slovakia, pp 383–388Google Scholar
  22. 22.
    Gnjatović M, Delić V (2013) Electrophysiologically-inspired evaluation of dialogue act complexity. In: Proceedings of the 4th IEEE international conference on cognitive infocommunications, cogInfoCom 2013, Budapest, Hungary, pp 167–172Google Scholar
  23. 23.
    Gnjatović M, Delić V (2014) Cognitively-inspired representational approach to meaning in machine dialogue. Knowl-Based Syst 71:25–33CrossRefGoogle Scholar
  24. 24.
    Gnjatović M, Janev M, Delić V (2012) Focus tree: modeling attentional information in task-oriented human-machine interaction. Appl Intell 37(3):305–320CrossRefGoogle Scholar
  25. 25.
    Grosz B, Sidner C (1986) Attention, intentions, and the structure of discourse. Comput Linguist 12(3):175–204Google Scholar
  26. 26.
    Hawes N, Wyatt JL, Sridharan M, Jacobsson H, Dearden R, Sloman A, Kruijff G-J (2010) Architecture and representations. In: Christensen HI, Kruijff G-J, Wyatt JL (eds) Cognit Syst. Springer, Berlin, pp 51–93CrossRefGoogle Scholar
  27. 27.
    Hermansky H (2006) A Brief description of DIRAC project. In: Proceedings of the euCognition inaugural meeting: the European network for the advancement of artificial cognitive systems, nice, FranceGoogle Scholar
  28. 28.
    Karreman DE, van Dijk EMAG, Evers V (2012) Using the visitor experiences for mapping the possibilities of implementing a robotic guide in outdoor sites. In: Proceedings of the 21st IEEE international symposium on robot and human interactive communication, RO-MAN 2012. France, Paris, pp 1059–1065Google Scholar
  29. 29.
    Kautz H, Fox D, Etzioni O, Borriello G, Arnstein L (2002) An Overview of the Assisted Cognition Project. In: Proc. of the AAAI (2002) workshop on Automation as Caregiver: The Role of Intelligent Technology in Elder Care. Edmonton, Alberta, Canada, pp 60–65Google Scholar
  30. 30.
    Kutas M, Federmeier KD (2000) Electrophysiology reveals semantic memory use in language comprehension. Trends Cognit Sci 4(12):463–470CrossRefGoogle Scholar
  31. 31.
    Leishman TR, Cook DA (2002) Requirements Risks Can Drown Software Projects. CrossTalk: The Journal of Defense Software Engineering (April 2002), pp 4–8Google Scholar
  32. 32.
    MacLeod CM (2007) The concept of inhibition in cognition. In: Gorfein DS, MacLeod CM (eds) Inhibition in cognition. American Psychological Association, Washington, pp 3–23CrossRefGoogle Scholar
  33. 33.
    Maheswaran R, Tambe M, Varakantham P, Myers K (2004) Adjustable autonomy challenges in personal assistant agents: a position paper. In: Nickles M, Rovatsos M, Weiss G (eds) Agents and computational autonomy. Lecture notes in computer science, vol 2969. Springer, Berlin, pp 187–194Google Scholar
  34. 34.
    Reithinger N, Gebhard P, Löckelt M, Ndiaye A, Pfleger N, Klesen M (2006) VirtualHuman—dialogic and emotional interaction with virtual characters. In: Proceedings of the eighth international conference on multimodal interfaces, (ICMI, 2006), Banff, CanadaGoogle Scholar
  35. 35.
    Rickheit G, Wachsmuth I (1996) Collaborative research centre situated artificial communicators at the University of Bielefeld, Germany. In: Mc Kevitt P (ed) Integration of natural language and vision processing, Springer, Netherlands, pp 165–170Google Scholar
  36. 36.
    Roulet E (1992) On the structure of conversation as negotiation. In: Parret H, Verschueren J (eds) (On) Searle on conversation. John Benjamins Publishing Company, Amsterdam, pp 91–99CrossRefGoogle Scholar
  37. 37.
    Saffran JR (2003) Statistical language learning: mechanisms and constraints. Curr Direct Psychol Sci 12(4):110–114CrossRefGoogle Scholar
  38. 38.
    Schegloff EA (1968) Sequencing in conversational openings. Am Anthropol 70:1075–1095CrossRefGoogle Scholar
  39. 39.
    Searle J (1992) Conversation. In: Parret H, Verschueren J (eds) (On) Searle on conversation. John Benjamins Publishing Company, Amsterdam, pp 7–29CrossRefGoogle Scholar
  40. 40.
    Searle J (2002) Consciousness. Consciousness and language. Cambridge University Press, CambridgeGoogle Scholar
  41. 41.
    Searle J (2002) The problem of consciousness. Consciousness and language. Cambridge University Press, CambridgeGoogle Scholar
  42. 42.
    Siekmann J, Crocker MW (2010) Resource-Adaptive Cognitive Processes. In: Crocker MW, Siekmann J (eds) Resource-adaptive cognitive processes. Springer-Verlag, Berlin Heidelberg, pp 1–10CrossRefGoogle Scholar
  43. 43.
    Sinclair J (2005) Corpus and text—basic principles. In: Wynne M (ed) Developing linguistic corpora: a guide to good practice. Oxbow Books, Oxford, pp 1–16Google Scholar
  44. 44.
    Tognini-Bonelli E (2001) Corpus linguistics at work. John Benjamins, AmsterdamCrossRefGoogle Scholar
  45. 45.
    Tomasello M (2003) Constructing a Language: A Usage-Based Theory of Language Acquisition. Harvard University Press, CambridgeGoogle Scholar
  46. 46.
    Turkle S (2011) Alone together: why we expect more from technology and less from each other. Basic Books, New YorkGoogle Scholar
  47. 47.
    Vergados D, Alevizos A, Mariolis A, Caragiozidis M (2008) Intelligent Services for Assisting Independent Living of Elderly People at Home. In: Proceedings of the international ACM conference on pervasive technologies related to assistive environments (PETRA’08), Athens, Greece, 79:1–4Google Scholar
  48. 48.
    Wahlster W (2003) SmartKom: Symmetric multimodality in an adaptive and reusable dialogue shell. In: Krahl R, Günther D (eds) Proceedings of the human computer interaction status conference 2003, Berlin, Germany, pp 47–62Google Scholar
  49. 49.
    Waibel A, Steusloff H, Stiefelhagen R, Watson K (2009) Computers in the human interaction loop. In: Waibel A, Stiefelhagen R (eds) Computers in the human interaction loop. Springer, London, pp 3–6CrossRefGoogle Scholar
  50. 50.
    Wendemuth A, Biundo S (2012) A companion technology for cognitive technical systems. In: Esposito A, Esposito AM, Vinciarelli A, Hoffmann R, Müller VC (eds) Cognitive behavioural systems, LNCS 7403. Springer, Berlin, pp 89–103CrossRefGoogle Scholar
  51. 51.
    Wendemuth A, Braun J, Michaelis B, Ohl F, Rösner D, Scheich H, Warnemünde R (2008) Neurobiologically inspired, multimodal intention recognition for technical communication systems (NIMITEK). In: Proceedings of the 4th IEEE tutorial and research workshop on perception and interactive technologies for speech-based systems, PIT 2008, Kloster Irsee, Germany, pp 141–144Google Scholar
  52. 52.
    Wilks Y (2007) Is there progress on talking sensibly to machines? Science 318(5852):927–928CrossRefGoogle Scholar
  53. 53.
    Wilks Y (2010) Introducing artificial companions. In: Wilks Y (ed) Close engagements with artificial companions. Key social, psychological, ethical and design issues. John Benjamins: Amsterdam, pp 11–20Google Scholar
  54. 54.
    Winkler I, Denham SL, Nelken I (2009) Modeling the auditory scene: predictive regularity representations and perceptual objects. Trends Cognit Sci 13(12):532–540CrossRefGoogle Scholar
  55. 55.
    Ziemke T (2008) On the role of emotion in biological and robotic autonomy. BioSystems 91:401–408CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  1. 1.Graduate School of Computer SciencesJohn Naisbitt UniveristyBelgradeSerbia
  2. 2.Faculty of Technical SciencesUniversity of Novi SadNovi SadSerbia
  3. 3.Faculty of Technical SciencesUniversity of Novi SadNovi SadSerbia

Personalised recommendations