Separating Representation, Reasoning, and Implementation for Interaction Management: Lessons from Automated Planning

Foster, Mary Ellen; A. Petrick, Ronald P.

doi:10.1007/978-981-10-2585-3_7

Separating Representation, Reasoning, and Implementation for Interaction Management: Lessons from Automated Planning

Mary Ellen Foster³ &
Ronald P. A. Petrick⁴

Chapter
First Online: 25 December 2016

1551 Accesses
2 Citations

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 427))

Abstract

Numerous toolkits are available for developing speech-based dialogue systems. We survey a range of currently available toolkits, highlighting the different facilities provided by each. Most of these toolkits include not only a method for representing states and actions, but also a mechanism for reasoning about and selecting the actions, often combined with a technical framework designed to simplify the task of creating end-to-end systems. This near-universal tight coupling of representation, reasoning, and implementation in a single toolkit makes it difficult both to compare different approaches to dialogue system design, as well as to analyse the properties of individual techniques. We contrast this situation with the state of the art in a related research area—automated planning—where a set of common representations have been defined and are widely used to enable direct comparison of different reasoning approaches. We argue that adopting a similar separation would greatly benefit the dialogue research community.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
This differs somewhat from the task of interaction management, where the goal is (usually) to find the next system action, rather than a complete action sequence. However, we note that a system that is able to achieve the latter can also be used in the former context.
2.
http://www.icaps-conference.org/.
3.
http://planning.domains/.
4.
http://jamesproject.eu/.

References

Bui, T.H.: Multimodal dialogue management—state of the art. Technical Report 06–01, University of Twente (UT), Enschede, The Netherlands (2006)
Google Scholar
Asher, N., Lascarides, A.: Logics of Conversation. Cambridge University Press (2003)
Google Scholar
Jokinen, K., McTear, M.: Spoken Dialogue Systems. Morgan & Claypool (2009)
Google Scholar
McTear, M., Callejas, Z., Griol, D.: The Conversational Interface. Springer International Publishing (2016)
Google Scholar
Peltason, J., Wrede, B.: The curious robot as a case-study for comparing dialog systems. AI Mag. 32(4), 85–99 (2011)
Google Scholar
Olaso, J.M., Milhorat, P., Himmelsbach, J., Boudy, J., Chollet, G., Schlögl, S., Torres, M.I.: A multi-lingual evaluation of the vAssist spoken dialog system: comparing Disco and RavenClaw. In: Jokinen, K. & Wilcock, G. (Eds.) Dialogues with Social Robots, Springer pp.221–238 (this volume) (2016)
Google Scholar
Coles, A., Coles, A., García Olaya, A., Jiménez, S., Linares López, C., Sanner, S., Yoon, S.: A survey of the seventh international planning competition. AI Mag. 33(1), 83–88 (2012)
Google Scholar
Gat, E.: Three-layered architectures. In: AI-Based Mobile Robots: Case Studies of Successful Robot Systems. MIT Press (1998)
Google Scholar
Dumas, B., Lalanne, D., Oviatt, S.: Multimodal interfaces: a survey of principles, models and frameworks. In: Human Machine Interaction, Lecture Notes in Computer Science, vol. 5440, pp. 3–26 (2009)
Google Scholar
Atrey, P.K., Hossain, M.A., El Saddik, A., Kankanhalli, M.S.: Multimodal fusion for multimedia analysis: a survey. Multimedia Syst. 16(6), 345–379 (2010)
Article Google Scholar
Foster, M.E.: State of the art review: multimodal fission. Deliverable 6.1, COMIC project (2002)
Google Scholar
Larsson, S., Traum, D.R.: Information state and dialogue management in the TRINDI dialogue move engine toolkit. Nat. Lang. Eng. 6(3&4), 323–340 (2000)
Article Google Scholar
Bos, J., Klein, E., Lemon, O., Oka, T.: DIPPER: Description and formalisation of an information-state update dialogue system architecture. In: Proceedings of SIGdial, pp. 115–124 (2003)
Google Scholar
Martin, D.L., Cheyer, A.J., Moran, D.B.: The open agent architecture: a framework for building distributed software systems. Appl. Artif. Intell. 13(1–2), 91–128 (1999)
Article Google Scholar
Johnston, M., Bangalore, S., Vasireddy, G., Stent, A., Ehlen, P., Walker, M., Whittaker, S., Maloor, P.: MATCH: an architecture for multimodal dialogue systems. In: Proceedings of ACL, pp. 376–383, Philadelphia, Pennsylvania, USA (2002)
Google Scholar
ter Maat, M., Heylen, D.: Flipper: an information state component for spoken dialogue systems. In: Intelligent Virtual Agents. Lecture Notes in Computer Science, vol. 6895, pp. 470–472. Springer, Berlin (2011)
Google Scholar
Janarthanam, S., Hastie, H., Deshmukh, A., Aylett, R., Foster, M.E.: A reusable interaction management module: use case for empathic robotic tutoring. In: Proceedings of goDIAL, Gothenburg, Sweden (2015)
Google Scholar
Bohus, D., Rudnicky, A.I.: The RavenClaw dialog management framework: architecture and systems. Comput. Speech Lang. 23(3), 332–361 (2009)
Article Google Scholar
Rich, C., Sidner, C.L.: COLLAGEN: a collaboration manager for software interface agents. User Model. User-Adap. Inter. 8(3–4), 315–350 (1998)
Article Google Scholar
Rich, C., Sidner, C.L.: Using collaborative discourse theory to partially automate dialogue tree authoring. In: Intelligent Virtual Agents, Lecture Notes in Computer Science, vol. 7502, pp. 327–340 (2012)
Google Scholar
Lison, P.: A hybrid approach to dialogue management based on probabilistic rules. Comput. Speech Lang. (2015)
Google Scholar
Skantze, G., Al Moubayed, S.: IrisTK: a statechart-based toolkit for multi-party face-to-face interaction. In: Proceedings of ICMI, pp. 69–76 (2012)
Google Scholar
Al Moubayed, S., Beskow, J., Skantze, G., Granström, B.: Furhat: a back-projected human-like robot head for multiparty human-machine interaction. In: Cognitive Behavioural Systems, Lecture Notes in Computer Science, vol. 7403, pp. 114–130 (2012)
Google Scholar
Harel, D.:Statecharts: a visual formalism for complex systems. Sci. Comput. Program. 8(3), 231–274 (1987). ISSN 0167-6423. http://dx.doi.org/10.1016/0167-6423(87)90035-9
Baumann, T., Schlangen, D.: The InproTK 2012 release. In: Proceedings of the NAACL-HLT Workshop on Future directions and needs in the Spoken Dialog Community: Tools and Data, pp. 29–32 (2012). http://projects.ict.usc.edu/nld/SDCTD2012/
Kennington, C., Kousidis, S., Schlangen, D.: InproTKs: a toolkit for incremental situated processing. In: Proceedings of SIGdial, pp. 84–88 (2014)
Google Scholar
Wienke, J., Wrede, S.: A middleware for collaborative research in experimental robotics. In: Proceedings of the 2011 IEEE/SICE International Symposium on System Integration, pp. 1183–1190 (2011)
Google Scholar
Kousidis, S., Kennington, C., Schlangen, D.: Investigating speaker gaze and pointing behaviour in human-computer interaction with the mint.tools collection. In: Proceedings of SIGDIAL, pp. 319–323, Metz, France (2013)
Google Scholar
Peltason, J., Wrede, B.: Pamini: a framework for assembling mixed-initiative human-robot interaction from generic interaction patterns. In: Proceedings of SIGdial, pp. 229–232 (2010)
Google Scholar
Wrede, S., Hanheide, M., Bauckhage, C., Sagerer, G.: An active memory as a model for information fusion. In: Proceedings of the 7th International Conference on Information Fusion, pp. 198–205 (2004)
Google Scholar
Henning, M.: A new approach to object-oriented middleware. IEEE Internet Comput. 8(1), 66–75 (2004)
Article Google Scholar
Keizer, S., Foster, M.E., Lemon, O., Gaschler, A., Giuliani, M.: Training and evaluation of an MDP model for social multi-user human-robot interaction. In: Proceedings of SIGdial (2013)
Google Scholar
Ghallab, M., Nau, D., Traverso, P.: Automated Planning: Theory and Practice. Morgan Kaufmann (2004)
Google Scholar
McDermott, D., Ghallab, M., Howe, A., Knoblock, C., Ram, A., Veloso, M., Weld, D., Wilkins, D.: PDDL—The Planning Domain Definition Language (Version 1.2). Technical Report CVC TR-98-003/DCS TR-1165, Yale Center for Computational Vision and Control (1998)
Google Scholar
Younes, H.L.S., Littman, M.L.: PPDDL1.0: an extension to PDDL for expressing planning domains with probabilistic effects. Technical Report CMU-CS-04-162, Carnegie Mellon University (2004)
Google Scholar
Sanner, S.: Relational dynamic influence diagram language (RDDL): language description. http://users.cecs.anu.edu.au/~ssanner/IPPC_2011/RDDL.pdf (2010)
ICAPS: ICAPS competitions. http://www.icaps-conference.org/index.php/Main/Competitions (2015)
Rintanen, J.: Complexity of planning with partial observability. In: Proceedings of ICAPS, pp. 345–354 (2004)
Google Scholar
Palacios, H., Geffner, H.: Compiling uncertainty away in conformant planning problems with bounded width. J. Artif. Intell. Res. 35, 623–675 (2009)
MathSciNet MATH Google Scholar
Albore, A., Palacios, H., Geffner, H.: A translation-based approach to contingent planning. In: Proceedings of IJCAI, pp. 1623–1628 (2009)
Google Scholar
Petrick, R.P.A., Foster, M.E.: Planning for social interaction in a robot bartender domain. In: Proceedings of ICAPS 2013 (2013)
Google Scholar
Petrick, R.P.A., Bacchus, F.: A knowledge-based approach to planning with incomplete information and sensing. In: Proceedings of AIPS, pp. 212–221 (2002)
Google Scholar
Petrick, R.P.A., Bacchus, F.: Extending the knowledge-based approach to planning with incomplete information and sensing. In: Proceedings of ICAPS, pp. 2–11 (2004)
Google Scholar
Sharma, V.: Automated Planning for Natural Language Robot Dialogue. M.Sc. Project, University of Edinburgh, Edinburgh (2012)
Google Scholar
Petrick, R.P.A., Gaschler, A.: Extending knowledge-level planning with sensing for robot task planning. In: Proceedings of PlanSIG (2014)
Google Scholar
Perrault, C.R., Allen, J.F.: A plan-based analysis of indirect speech acts. Am. J. Comput. Linguist. 6(3–4), 167–182 (1980)
Google Scholar
Appelt, D.: Planning English Sentences. Cambridge University Press, Cambridge (1985)
Book MATH Google Scholar
Hovy, E.: Generating Natural Language Under Pragmatic Constraints. Lawrence Erlbaum Associates, Hillsdale (1988)
Google Scholar
Cohen, P., Levesque, H.: Rational interaction as the basis for communication. In: Cohen, P., Morgan, J., Pollack, M. (eds.) Intentions in Communication, pp. 221–255. MIT Press, Cambridge (1990)
Google Scholar
Young, R.M., Moore, J.D.: DPOCL: a principled approach to discourse planning. In: Proceedings of INLG, pp. 13–20, Kennebunkport, Maine, USA (1994)
Google Scholar
Koller, A., Stone, M.: Sentence generation as planning. In: Proceedings of ACL, pp. 336–343, Prague, Czech Republic (2007)
Google Scholar
Benotti, L.: Accommodation through tacit sensing. In: Proceedings of LONDIAL, pp. 75–82, London, UK (2008)
Google Scholar
Brenner, M., Kruijff-Korbayová, I.: A continual multiagent planning approach to situated dialogue. In: Proceedings of LONDIAL, pp. 67–74 (2008)
Google Scholar
Koller, A., Petrick, R.P.A.: Experiences with planning for natural language generation. Comput. Intell. 27(1), 23–40 (2011)
Article MathSciNet MATH Google Scholar
Mackaness, W., Boye, J., Clark, S., Fredriksson, M., Geffner, H., Lemon, O., Minnock, M., Webber, B.: The SpaceBook project: pedestrian exploration of the city using dialogue based interaction over smartphones. In: Proceedings of the 8th Symposium on Location-Based Services, Vienna, Austria (2011)
Google Scholar
Henderson, M., Thomson, B., Williams, J.D.: The second dialog state tracking challenge. In: Proceedings of SIGdial, pp. 263–272, Philadelphia, PA, USA (2014)
Google Scholar
Cashmore, M., Fox, M., Long, D., Magazzeni, D., Ridder, B., Carrera, A., Palomeras, N., Hurtos, N., Carreras, M.: ROSPlan: planning in the robot operating system. In: Proceedings of ICAPS (2015)
Google Scholar

Download references

Acknowledgements

This research has been partially funded by the European Union’s Seventh Framework Programme for research, technological development and demonstration under grant no. 270435 (JAMES, http://james-project.eu/) and grant no. 610917 (STAMINA, http://stamina-robot.eu/), and by the European Union’s Horizon 2020 research and innovation programme under grant no. 688147 (MuMMER, http://mummer-project.eu/).

Author information

Authors and Affiliations

School of Computing Science, University of Glasgow, Glasgow, UK
Mary Ellen Foster
Department of Computer Science, Heriot-Watt University, Edinburgh, UK
Ronald P. A. Petrick

Authors

Mary Ellen Foster
View author publications
You can also search for this author in PubMed Google Scholar
Ronald P. A. Petrick
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mary Ellen Foster .

Editor information

Editors and Affiliations

Institute of Behavioural Sciences, University of Helsinki Institute of Behavioural Sciences, Helsinki, Finland
Kristiina Jokinen
University of Helsinki , Helsinki, Finland
Graham Wilcock

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Foster, M.E., A. Petrick, R.P. (2017). Separating Representation, Reasoning, and Implementation for Interaction Management: Lessons from Automated Planning. In: Jokinen, K., Wilcock, G. (eds) Dialogues with Social Robots. Lecture Notes in Electrical Engineering, vol 427. Springer, Singapore. https://doi.org/10.1007/978-981-10-2585-3_7

Download citation

DOI: https://doi.org/10.1007/978-981-10-2585-3_7
Published: 25 December 2016
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-2584-6
Online ISBN: 978-981-10-2585-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics