Modeling Task-Oriented Dialogue

Taboada, Maite

doi:10.1023/A:1025729107628

Modeling Task-Oriented Dialogue

Published: November 2003

Volume 37, pages 431–454, (2003)
Cite this article

Computers and the Humanities Aims and scope Submit manuscript

Maite Taboada¹

89 Accesses
4 Citations
Explore all metrics

Abstract

A common tool for improving theperformance quality of natural languageprocessing systems is the use of contextualinformation for disambiguation. Here I describethe use of a finite state machine (FSM) todisambiguate speech acts in a machinetranslation system. The FSM has two layers thatmodel, respectively, the global and localstructures found in naturally-occurringconversations. The FSM has been modeled on acorpus of task-oriented dialogues in a travelplanning situation. In the dialogues, one ofthe interactants is a travel agent or hotelclerk, and the other a client requestinginformation or services. A discourse processorbased on the FSM was implemented in order toprocess contextual information in a machinetranslation system. Evaluation results showthat the discourse processor is able todisambiguate and improve the quality of thedialogue translation. Other applicationsinclude human-computer interaction andcomputer-assisted language learning.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Ahlen S. (1997) Enthusiast Data Collection. Language Technologies Institute Technical Report, Carnegie Mellon University and The University of Pittsburgh.
Alexandersson J., Engel R., Kipp M., Koch S., Küssner U., Reithinger N., Stede M. (2000a) Modeling Negotiation Dialogs. In Wahlster W. (ed.), Verbmobil: Foundations of Speech-to-Speech Translation. Springer, Berlin, pp. 441–451
Google Scholar
Alexandersson J., Poller P., Kipp M. (2000b) Generating Multilingual Dialog Summaries and Minutes. In Wahlster W. (ed.), Verbmobil: Foundations of Speech-to-Speech Translation. Springer, Berlin, pp. 507–518.
Google Scholar
Allen J., Core M. (1997) Draft of DAMSL: Dialog Act Markup in Several Layers. Draft produced by the Multiparty Discourse Group at the Discourse Research Initiative (DRI) meetings at the University of Pennsylvania and at Schloss Dagstuhl. (http://www.georgetown.edu/luperfoy/ Discourse-Treebank/dri-home.html).
Baker M. (2000) The Roles of Models in Artificial Intelligence and Education Research: A Prospective View. International Journal of Artificial Intelligence in Education, 11, pp. 122–143.
Google Scholar
Bakhtin M. (1986) Speech genres and Other Late Essays. University of Texas Press, Austin.
Google Scholar
Carletta J. (1996) Assessing Agreement on Classification Tasks: The Kappa statistic. Computational Linguistics, 22/2, pp. 249–254.
Google Scholar
Chafe W. (1994) Discourse, Consciousness and Time: The Flow and Displacement of Conscious Experience in Speaking and Writing. University of Chicago Press, Chicago.
Google Scholar
Conati C., Klawe M. (2002) Socially Intelligent Agents in Educational Games. In Dautenhahn K., Bond A., Cañamero D. and Edmonds B. (eds.), Socially Intelligent Agents: Creating Relationships with Computers and Robots. Kluwer Academic Publishers, Dordrecht, pp. 213–220.
Google Scholar
Core M., Ishizaki M., Moore J., Nakatani C., Reithinger N., Traum D., Tutiya S. (1999) Report of The Third Workshop of the Discourse Resource Initiative. Chiba Corpus Project Technical Report No. 3 (CC-TR-99–1), Department of Cognitive and Information Sciences, Chiba University, Japan.
Google Scholar
Elio R., Haddadi A., Singh A. (2000) Task Models for Agent Conversation Policies. Proceedings of Autonomous Agents-2000, pp. 229–230.
Fawcett R., van der Mije A., van Wissen C. (1988) Towards a Systemic Flowchart Model for Local Discourse Structure. In Fawcett R. and Young D. (eds.), New Developments in Systemic Linguistics, Vol. 2, Frances Pinter, London, pp. 116–143.
Google Scholar
FIPA (2001) Foundation for Intelligent Physical Agents ACL Message Structure Specification. Technical Report XC00061E.
Grosz B., Sidner C. (1986) Attentions, Intentions, and the Structure of Discourse. Computational Linguistics, 12/3, pp. 175–204.
Google Scholar
Halliday M.A.K. (1994) An Introduction to Functional Grammar (2nd edition). Edward Arnold, London.
Google Scholar
Halliday M.A.K., Martin J. (1993) Writing Science: Literacy and Discoursive Power. The Falmer Press, London.
Google Scholar
Hansen B., N ovick D., Sutton S. (1996) Systematic Design of Spoken-Dialogue Interfaces. Proceedings, Conference on Human Factors in Computing Systems (CHI'96), pp. 157–164.
Jekat S., Klein A., Maier E., Maleck I., Mast M., Quantz J.J. (1995) Dialogue Acts in Verbmobil. Verbmobil Technical Report.
Kipp M.J., Alexandersson R. Engel, Reithinger N. (2000) Dialog Processing. In Wahlster W. (ed.), Verbmobil: Foundations of Speech-to-Speech Translation. Springer, Berlin, pp. 452–465.
Google Scholar
Koch S., Küssner U., Stede M. (2000) Contextual Disambiguation. In Wahlster W. (ed.), Verbmobil: Foundations of Speech-to-Speech Translation. Springer, Berlin, pp. 466–477.
Google Scholar
Labrou Y., Finnin T., Peng Y. (1999) Agent Communication Languages: The Current Landscape. IEEE Intelligent Systems, 14/2, pp. 45–52.
Google Scholar
Labrou Y., Finnin T. (1997) A Proposal for a New KQML Specification. Technical Report, Computer Science and Electrical Engineering Department, University of Maryland Baltimore County.
Lambert L. (1993) Recognizing Complex Discourse Acts: A Tripartite Plan-Based Model of Dialogue. PhD Thesis, University of Delaware.
Lambert L., Carberry S. (1992) Modeling Negotiation Subdialogues. In Proceedings of 32nd Annual Meeting of the ACL.
Lavie A. (1995) A Grammar Based Robust Parser for Spontaneous Speech. PhD Thesis, Carnegie Mellon University, Pittsburgh, PA.
Google Scholar
Lavie A., Tomita M. (1993) GLR?: An Efficient Noise Skipping Parsing Algorithm for Context Free Grammars. Proceedings of the Third International Workshop on Parsing Technologies, IWPT 93, Tilburg, The Netherlands.
Google Scholar
Lavie A., Gates D., Coccaro N., Levin L. (1996a) Input Segmentation of Spontaneous Speech in JANUS: A Speech-to-Speech Translation System. Proceedings of ECAI 96, Budapest, Hungary.
Google Scholar
Lavie A., Gates D., Gavaldà M., Mayfield L., Waibel A., Levin L. (1996b) Multi-lingual Translation of Spontaneously Spoken Language in a Limited Domain. In Proceedings of COLING 96, Copenhagen.
Lavie A., Levin L., Zhan P., Taboada M., Gates D., Lapata M., Clark C., Broadhead M., Waibel A. (1997) Expanding the Domain of a Multi-lingual Speech-to-Speech Translation System. Proceedings of the Spoken Language Translation Workshop, 35th Annual Meeting of the Association for Computational Linguistics, ACL/EACL '97, Madrid, Spain, pp. 67–72.
Levin L., Ries K., Thymé-Gobbel A., Lavie A. (1999) Tagging of Speech Acts and Dialogue Games in Spanish Call Home. Proceedings, ACL '99 Workshop on Discourse Tagging.
Litman D., Allen J. (1990) Discourse Processing and Commonsense Plans. In Cohen P.R., Morgan J. and Pollack M.E. (eds.), Intentions in Communication. MIT Press, Cambridge,MA, pp. 365–388.
Google Scholar
Maier E. (1996) Context Construction as Subtask of Dialogue Processing: The Verbmobil Case. Proceedings of the Eleventh Twente Workshop on Language Technology, TWLT 11.
Martin J. (1992) English Text: System and Structure. John Benjamins, Philadelphia/Amsterdam.
Google Scholar
Mayfield L., Gavaldà M., Seo Y-H., Suhm B., Ward W., Waibel A. (1995) Parsing Real Input in JANUS: A Concept-Based Approach. Proceedings of TMI 95.
Moran T., Dourish P. (2001) Introduction, Special Issue on Context-Aware Computing. Human Computer Interaction, 16/(2–4), pp. 87–96.
Google Scholar
Ney H., Essen U., Kneser R. (1994) On Structuring Probabilistic Dependencies in Stochastic Language Modelling. Computer Speech and Language, 8, pp. 1–38.
Google Scholar
O'Donnell M. (1990) A Dynamic Model of Exchange. Word, 41/3, pp. 293–327.
Google Scholar
Qu Y., Di Eugenio B., Lavie A., Levin L., Rosé C.P. (1996a) Minimizing Cumulative Error in Discourse Context. Proceedings of ECAI 96, Budapest, Hungary.
Google Scholar
Qu Y., Rosé C.P., Di Eugenio B. (1996b) Using Discourse Predictions for Ambiguity Resolution. Proceedings of COLING 96, Copenhagen.
Reithinger N., Maier E. (1995) Utilizing Statistical Dialogue Act Processing in Verbmobil. Proceedings of ACL.
Reithinger N., Maier E., Alexandersson J. (1995) Treatment of Incomplete Dialogues in a Speech-to-Speech Translation System. Proceedings of the ESCA Workshop on Spoken Dialogue Systems, Denmark.
Rosé C.P., Qu Y. (1996) Discourse Information for Disambiguation. Manuscript, Carnegie Mellon University, Pittsburgh, PA.
Google Scholar
Rosé C.P., Di Eugenio B., Levin L., Van Ess-Dykema C. (1995) Discourse Processing of Dialogues with Multiple Threads. Proceedings of ACL, Boston, MA.
Google Scholar
Sacks H., Schegloff E., Jefferson G. (1974) A Simplest Systematics for the Organization of Turntaking for Conversation. Language, 50, pp. 696–735.
Google Scholar
Schegloff E., Sacks H. (1973) Opening up Closings. Semiotica, 7, pp. 289–327.
Google Scholar
Schmitz B., Quantz J.J. (1995) Dialogue Acts in Automatic Dialogue Interpreting. Proceedings, 6th Conference on Theoretical and Methodological Issues in Machine Translation, pp. 33–47.
Searle J. (1996 [1979]) A Taxonomy of Illocutionary Acts. Reprinted in Martinich A. (ed.), The Philosophy of Language (3rd edition). Oxford University Press, New York.
Google Scholar
Sinclair J., Coulthard M. (1975) Towards an Analysis of Discourse: The English Used by Teachers and Pupils. OUP, Oxford.
Google Scholar
Stolke A., Ries K., Coccaro N., Shriberg E., Bates R., Jurafsky D., Taylor P., Martin R., Van Ess-Dykema C., Meteer M. (2000) Dialogue Act Modeling for Automatic Tagging and Recognition of Conversational Speech. Computational Linguistics, 26/3, pp. 339–373.
Google Scholar
Taboada M. (1997) Discourse Information for Disambiguation: The Phoenix Approach in Janus. M.Sc. Thesis, Carnegie Mellon University, Pittsburgh, PA.
Google Scholar
Ventola E. (1987) The Structure of Social Interaction: A Systemic Approach to the Semiotics of Service Encounters. Pinter Publishers, London.
Google Scholar
Waibel A. (1996) Interactive Translation of Conversational Speech. IEEE Computer Society, 29/7.
Ward W. (1991) Understanding Spontaneous Speech: the Phoenix System. Proceedings of ICASSP.
Ward W. (1994) Extracting Information in Spontaneous Speech. Proceedings of ICSLP.
Yngve V. (1970) On Getting a Word in Edgewise. Papers from the Sixth Regional Meeting of the Chicago Linguistics Society. Chicago Linguistics Society, Chicago.
Google Scholar
Ziegler J. (2002) Modeling Cooperative Work Processes: A Multiple Perspectives Framework. International Journal of Human-Computer Interaction, 14/2, pp. 139–157.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Linguistics, Simon Fraser University, Burnaby, B.C, V5A 1S6, Canada
Maite Taboada

Authors

Maite Taboada
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Taboada, M. Modeling Task-Oriented Dialogue. Computers and the Humanities 37, 431–454 (2003). https://doi.org/10.1023/A:1025729107628

Download citation

Issue Date: November 2003
DOI: https://doi.org/10.1023/A:1025729107628

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Modeling Task-Oriented Dialogue

Abstract

Access this article

Similar content being viewed by others

Data-Driven Methods for Spoken Language Understanding

A Pragmatic Approach to Disambiguation in Text Understanding

On the Use of Phoneme Lattices in Spoken Language Understanding

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Modeling Task-Oriented Dialogue

Abstract

Access this article

Similar content being viewed by others

Data-Driven Methods for Spoken Language Understanding

A Pragmatic Approach to Disambiguation in Text Understanding

On the Use of Phoneme Lattices in Spoken Language Understanding

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation