Skip to main content
Log in

Modeling Task-Oriented Dialogue

  • Published:
Computers and the Humanities Aims and scope Submit manuscript

Abstract

A common tool for improving theperformance quality of natural languageprocessing systems is the use of contextualinformation for disambiguation. Here I describethe use of a finite state machine (FSM) todisambiguate speech acts in a machinetranslation system. The FSM has two layers thatmodel, respectively, the global and localstructures found in naturally-occurringconversations. The FSM has been modeled on acorpus of task-oriented dialogues in a travelplanning situation. In the dialogues, one ofthe interactants is a travel agent or hotelclerk, and the other a client requestinginformation or services. A discourse processorbased on the FSM was implemented in order toprocess contextual information in a machinetranslation system. Evaluation results showthat the discourse processor is able todisambiguate and improve the quality of thedialogue translation. Other applicationsinclude human-computer interaction andcomputer-assisted language learning.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Ahlen S. (1997) Enthusiast Data Collection. Language Technologies Institute Technical Report, Carnegie Mellon University and The University of Pittsburgh.

  • Alexandersson J., Engel R., Kipp M., Koch S., Küssner U., Reithinger N., Stede M. (2000a) Modeling Negotiation Dialogs. In Wahlster W. (ed.), Verbmobil: Foundations of Speech-to-Speech Translation. Springer, Berlin, pp. 441–451

    Google Scholar 

  • Alexandersson J., Poller P., Kipp M. (2000b) Generating Multilingual Dialog Summaries and Minutes. In Wahlster W. (ed.), Verbmobil: Foundations of Speech-to-Speech Translation. Springer, Berlin, pp. 507–518.

    Google Scholar 

  • Allen J., Core M. (1997) Draft of DAMSL: Dialog Act Markup in Several Layers. Draft produced by the Multiparty Discourse Group at the Discourse Research Initiative (DRI) meetings at the University of Pennsylvania and at Schloss Dagstuhl. (http://www.georgetown.edu/luperfoy/ Discourse-Treebank/dri-home.html).

  • Baker M. (2000) The Roles of Models in Artificial Intelligence and Education Research: A Prospective View. International Journal of Artificial Intelligence in Education, 11, pp. 122–143.

    Google Scholar 

  • Bakhtin M. (1986) Speech genres and Other Late Essays. University of Texas Press, Austin.

    Google Scholar 

  • Carletta J. (1996) Assessing Agreement on Classification Tasks: The Kappa statistic. Computational Linguistics, 22/2, pp. 249–254.

    Google Scholar 

  • Chafe W. (1994) Discourse, Consciousness and Time: The Flow and Displacement of Conscious Experience in Speaking and Writing. University of Chicago Press, Chicago.

    Google Scholar 

  • Conati C., Klawe M. (2002) Socially Intelligent Agents in Educational Games. In Dautenhahn K., Bond A., Cañamero D. and Edmonds B. (eds.), Socially Intelligent Agents: Creating Relationships with Computers and Robots. Kluwer Academic Publishers, Dordrecht, pp. 213–220.

    Google Scholar 

  • Core M., Ishizaki M., Moore J., Nakatani C., Reithinger N., Traum D., Tutiya S. (1999) Report of The Third Workshop of the Discourse Resource Initiative. Chiba Corpus Project Technical Report No. 3 (CC-TR-99–1), Department of Cognitive and Information Sciences, Chiba University, Japan.

    Google Scholar 

  • Elio R., Haddadi A., Singh A. (2000) Task Models for Agent Conversation Policies. Proceedings of Autonomous Agents-2000, pp. 229–230.

  • Fawcett R., van der Mije A., van Wissen C. (1988) Towards a Systemic Flowchart Model for Local Discourse Structure. In Fawcett R. and Young D. (eds.), New Developments in Systemic Linguistics, Vol. 2, Frances Pinter, London, pp. 116–143.

    Google Scholar 

  • FIPA (2001) Foundation for Intelligent Physical Agents ACL Message Structure Specification. Technical Report XC00061E.

  • Grosz B., Sidner C. (1986) Attentions, Intentions, and the Structure of Discourse. Computational Linguistics, 12/3, pp. 175–204.

    Google Scholar 

  • Halliday M.A.K. (1994) An Introduction to Functional Grammar (2nd edition). Edward Arnold, London.

    Google Scholar 

  • Halliday M.A.K., Martin J. (1993) Writing Science: Literacy and Discoursive Power. The Falmer Press, London.

    Google Scholar 

  • Hansen B., N ovick D., Sutton S. (1996) Systematic Design of Spoken-Dialogue Interfaces. Proceedings, Conference on Human Factors in Computing Systems (CHI'96), pp. 157–164.

  • Jekat S., Klein A., Maier E., Maleck I., Mast M., Quantz J.J. (1995) Dialogue Acts in Verbmobil. Verbmobil Technical Report.

  • Kipp M.J., Alexandersson R. Engel, Reithinger N. (2000) Dialog Processing. In Wahlster W. (ed.), Verbmobil: Foundations of Speech-to-Speech Translation. Springer, Berlin, pp. 452–465.

    Google Scholar 

  • Koch S., Küssner U., Stede M. (2000) Contextual Disambiguation. In Wahlster W. (ed.), Verbmobil: Foundations of Speech-to-Speech Translation. Springer, Berlin, pp. 466–477.

    Google Scholar 

  • Labrou Y., Finnin T., Peng Y. (1999) Agent Communication Languages: The Current Landscape. IEEE Intelligent Systems, 14/2, pp. 45–52.

    Google Scholar 

  • Labrou Y., Finnin T. (1997) A Proposal for a New KQML Specification. Technical Report, Computer Science and Electrical Engineering Department, University of Maryland Baltimore County.

  • Lambert L. (1993) Recognizing Complex Discourse Acts: A Tripartite Plan-Based Model of Dialogue. PhD Thesis, University of Delaware.

  • Lambert L., Carberry S. (1992) Modeling Negotiation Subdialogues. In Proceedings of 32nd Annual Meeting of the ACL.

  • Lavie A. (1995) A Grammar Based Robust Parser for Spontaneous Speech. PhD Thesis, Carnegie Mellon University, Pittsburgh, PA.

    Google Scholar 

  • Lavie A., Tomita M. (1993) GLR?: An Efficient Noise Skipping Parsing Algorithm for Context Free Grammars. Proceedings of the Third International Workshop on Parsing Technologies, IWPT 93, Tilburg, The Netherlands.

    Google Scholar 

  • Lavie A., Gates D., Coccaro N., Levin L. (1996a) Input Segmentation of Spontaneous Speech in JANUS: A Speech-to-Speech Translation System. Proceedings of ECAI 96, Budapest, Hungary.

    Google Scholar 

  • Lavie A., Gates D., Gavaldà M., Mayfield L., Waibel A., Levin L. (1996b) Multi-lingual Translation of Spontaneously Spoken Language in a Limited Domain. In Proceedings of COLING 96, Copenhagen.

  • Lavie A., Levin L., Zhan P., Taboada M., Gates D., Lapata M., Clark C., Broadhead M., Waibel A. (1997) Expanding the Domain of a Multi-lingual Speech-to-Speech Translation System. Proceedings of the Spoken Language Translation Workshop, 35th Annual Meeting of the Association for Computational Linguistics, ACL/EACL '97, Madrid, Spain, pp. 67–72.

  • Levin L., Ries K., Thymé-Gobbel A., Lavie A. (1999) Tagging of Speech Acts and Dialogue Games in Spanish Call Home. Proceedings, ACL '99 Workshop on Discourse Tagging.

  • Litman D., Allen J. (1990) Discourse Processing and Commonsense Plans. In Cohen P.R., Morgan J. and Pollack M.E. (eds.), Intentions in Communication. MIT Press, Cambridge,MA, pp. 365–388.

    Google Scholar 

  • Maier E. (1996) Context Construction as Subtask of Dialogue Processing: The Verbmobil Case. Proceedings of the Eleventh Twente Workshop on Language Technology, TWLT 11.

  • Martin J. (1992) English Text: System and Structure. John Benjamins, Philadelphia/Amsterdam.

    Google Scholar 

  • Mayfield L., Gavaldà M., Seo Y-H., Suhm B., Ward W., Waibel A. (1995) Parsing Real Input in JANUS: A Concept-Based Approach. Proceedings of TMI 95.

  • Moran T., Dourish P. (2001) Introduction, Special Issue on Context-Aware Computing. Human Computer Interaction, 16/(2–4), pp. 87–96.

    Google Scholar 

  • Ney H., Essen U., Kneser R. (1994) On Structuring Probabilistic Dependencies in Stochastic Language Modelling. Computer Speech and Language, 8, pp. 1–38.

    Google Scholar 

  • O'Donnell M. (1990) A Dynamic Model of Exchange. Word, 41/3, pp. 293–327.

    Google Scholar 

  • Qu Y., Di Eugenio B., Lavie A., Levin L., Rosé C.P. (1996a) Minimizing Cumulative Error in Discourse Context. Proceedings of ECAI 96, Budapest, Hungary.

    Google Scholar 

  • Qu Y., Rosé C.P., Di Eugenio B. (1996b) Using Discourse Predictions for Ambiguity Resolution. Proceedings of COLING 96, Copenhagen.

  • Reithinger N., Maier E. (1995) Utilizing Statistical Dialogue Act Processing in Verbmobil. Proceedings of ACL.

  • Reithinger N., Maier E., Alexandersson J. (1995) Treatment of Incomplete Dialogues in a Speech-to-Speech Translation System. Proceedings of the ESCA Workshop on Spoken Dialogue Systems, Denmark.

  • Rosé C.P., Qu Y. (1996) Discourse Information for Disambiguation. Manuscript, Carnegie Mellon University, Pittsburgh, PA.

    Google Scholar 

  • Rosé C.P., Di Eugenio B., Levin L., Van Ess-Dykema C. (1995) Discourse Processing of Dialogues with Multiple Threads. Proceedings of ACL, Boston, MA.

    Google Scholar 

  • Sacks H., Schegloff E., Jefferson G. (1974) A Simplest Systematics for the Organization of Turntaking for Conversation. Language, 50, pp. 696–735.

    Google Scholar 

  • Schegloff E., Sacks H. (1973) Opening up Closings. Semiotica, 7, pp. 289–327.

    Google Scholar 

  • Schmitz B., Quantz J.J. (1995) Dialogue Acts in Automatic Dialogue Interpreting. Proceedings, 6th Conference on Theoretical and Methodological Issues in Machine Translation, pp. 33–47.

  • Searle J. (1996 [1979]) A Taxonomy of Illocutionary Acts. Reprinted in Martinich A. (ed.), The Philosophy of Language (3rd edition). Oxford University Press, New York.

    Google Scholar 

  • Sinclair J., Coulthard M. (1975) Towards an Analysis of Discourse: The English Used by Teachers and Pupils. OUP, Oxford.

    Google Scholar 

  • Stolke A., Ries K., Coccaro N., Shriberg E., Bates R., Jurafsky D., Taylor P., Martin R., Van Ess-Dykema C., Meteer M. (2000) Dialogue Act Modeling for Automatic Tagging and Recognition of Conversational Speech. Computational Linguistics, 26/3, pp. 339–373.

    Google Scholar 

  • Taboada M. (1997) Discourse Information for Disambiguation: The Phoenix Approach in Janus. M.Sc. Thesis, Carnegie Mellon University, Pittsburgh, PA.

    Google Scholar 

  • Ventola E. (1987) The Structure of Social Interaction: A Systemic Approach to the Semiotics of Service Encounters. Pinter Publishers, London.

    Google Scholar 

  • Waibel A. (1996) Interactive Translation of Conversational Speech. IEEE Computer Society, 29/7.

  • Ward W. (1991) Understanding Spontaneous Speech: the Phoenix System. Proceedings of ICASSP.

  • Ward W. (1994) Extracting Information in Spontaneous Speech. Proceedings of ICSLP.

  • Yngve V. (1970) On Getting a Word in Edgewise. Papers from the Sixth Regional Meeting of the Chicago Linguistics Society. Chicago Linguistics Society, Chicago.

    Google Scholar 

  • Ziegler J. (2002) Modeling Cooperative Work Processes: A Multiple Perspectives Framework. International Journal of Human-Computer Interaction, 14/2, pp. 139–157.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Taboada, M. Modeling Task-Oriented Dialogue. Computers and the Humanities 37, 431–454 (2003). https://doi.org/10.1023/A:1025729107628

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1025729107628

Navigation