Abstract
A common tool for improving theperformance quality of natural languageprocessing systems is the use of contextualinformation for disambiguation. Here I describethe use of a finite state machine (FSM) todisambiguate speech acts in a machinetranslation system. The FSM has two layers thatmodel, respectively, the global and localstructures found in naturally-occurringconversations. The FSM has been modeled on acorpus of task-oriented dialogues in a travelplanning situation. In the dialogues, one ofthe interactants is a travel agent or hotelclerk, and the other a client requestinginformation or services. A discourse processorbased on the FSM was implemented in order toprocess contextual information in a machinetranslation system. Evaluation results showthat the discourse processor is able todisambiguate and improve the quality of thedialogue translation. Other applicationsinclude human-computer interaction andcomputer-assisted language learning.
Similar content being viewed by others
References
Ahlen S. (1997) Enthusiast Data Collection. Language Technologies Institute Technical Report, Carnegie Mellon University and The University of Pittsburgh.
Alexandersson J., Engel R., Kipp M., Koch S., Küssner U., Reithinger N., Stede M. (2000a) Modeling Negotiation Dialogs. In Wahlster W. (ed.), Verbmobil: Foundations of Speech-to-Speech Translation. Springer, Berlin, pp. 441–451
Alexandersson J., Poller P., Kipp M. (2000b) Generating Multilingual Dialog Summaries and Minutes. In Wahlster W. (ed.), Verbmobil: Foundations of Speech-to-Speech Translation. Springer, Berlin, pp. 507–518.
Allen J., Core M. (1997) Draft of DAMSL: Dialog Act Markup in Several Layers. Draft produced by the Multiparty Discourse Group at the Discourse Research Initiative (DRI) meetings at the University of Pennsylvania and at Schloss Dagstuhl. (http://www.georgetown.edu/luperfoy/ Discourse-Treebank/dri-home.html).
Baker M. (2000) The Roles of Models in Artificial Intelligence and Education Research: A Prospective View. International Journal of Artificial Intelligence in Education, 11, pp. 122–143.
Bakhtin M. (1986) Speech genres and Other Late Essays. University of Texas Press, Austin.
Carletta J. (1996) Assessing Agreement on Classification Tasks: The Kappa statistic. Computational Linguistics, 22/2, pp. 249–254.
Chafe W. (1994) Discourse, Consciousness and Time: The Flow and Displacement of Conscious Experience in Speaking and Writing. University of Chicago Press, Chicago.
Conati C., Klawe M. (2002) Socially Intelligent Agents in Educational Games. In Dautenhahn K., Bond A., Cañamero D. and Edmonds B. (eds.), Socially Intelligent Agents: Creating Relationships with Computers and Robots. Kluwer Academic Publishers, Dordrecht, pp. 213–220.
Core M., Ishizaki M., Moore J., Nakatani C., Reithinger N., Traum D., Tutiya S. (1999) Report of The Third Workshop of the Discourse Resource Initiative. Chiba Corpus Project Technical Report No. 3 (CC-TR-99–1), Department of Cognitive and Information Sciences, Chiba University, Japan.
Elio R., Haddadi A., Singh A. (2000) Task Models for Agent Conversation Policies. Proceedings of Autonomous Agents-2000, pp. 229–230.
Fawcett R., van der Mije A., van Wissen C. (1988) Towards a Systemic Flowchart Model for Local Discourse Structure. In Fawcett R. and Young D. (eds.), New Developments in Systemic Linguistics, Vol. 2, Frances Pinter, London, pp. 116–143.
FIPA (2001) Foundation for Intelligent Physical Agents ACL Message Structure Specification. Technical Report XC00061E.
Grosz B., Sidner C. (1986) Attentions, Intentions, and the Structure of Discourse. Computational Linguistics, 12/3, pp. 175–204.
Halliday M.A.K. (1994) An Introduction to Functional Grammar (2nd edition). Edward Arnold, London.
Halliday M.A.K., Martin J. (1993) Writing Science: Literacy and Discoursive Power. The Falmer Press, London.
Hansen B., N ovick D., Sutton S. (1996) Systematic Design of Spoken-Dialogue Interfaces. Proceedings, Conference on Human Factors in Computing Systems (CHI'96), pp. 157–164.
Jekat S., Klein A., Maier E., Maleck I., Mast M., Quantz J.J. (1995) Dialogue Acts in Verbmobil. Verbmobil Technical Report.
Kipp M.J., Alexandersson R. Engel, Reithinger N. (2000) Dialog Processing. In Wahlster W. (ed.), Verbmobil: Foundations of Speech-to-Speech Translation. Springer, Berlin, pp. 452–465.
Koch S., Küssner U., Stede M. (2000) Contextual Disambiguation. In Wahlster W. (ed.), Verbmobil: Foundations of Speech-to-Speech Translation. Springer, Berlin, pp. 466–477.
Labrou Y., Finnin T., Peng Y. (1999) Agent Communication Languages: The Current Landscape. IEEE Intelligent Systems, 14/2, pp. 45–52.
Labrou Y., Finnin T. (1997) A Proposal for a New KQML Specification. Technical Report, Computer Science and Electrical Engineering Department, University of Maryland Baltimore County.
Lambert L. (1993) Recognizing Complex Discourse Acts: A Tripartite Plan-Based Model of Dialogue. PhD Thesis, University of Delaware.
Lambert L., Carberry S. (1992) Modeling Negotiation Subdialogues. In Proceedings of 32nd Annual Meeting of the ACL.
Lavie A. (1995) A Grammar Based Robust Parser for Spontaneous Speech. PhD Thesis, Carnegie Mellon University, Pittsburgh, PA.
Lavie A., Tomita M. (1993) GLR?: An Efficient Noise Skipping Parsing Algorithm for Context Free Grammars. Proceedings of the Third International Workshop on Parsing Technologies, IWPT 93, Tilburg, The Netherlands.
Lavie A., Gates D., Coccaro N., Levin L. (1996a) Input Segmentation of Spontaneous Speech in JANUS: A Speech-to-Speech Translation System. Proceedings of ECAI 96, Budapest, Hungary.
Lavie A., Gates D., Gavaldà M., Mayfield L., Waibel A., Levin L. (1996b) Multi-lingual Translation of Spontaneously Spoken Language in a Limited Domain. In Proceedings of COLING 96, Copenhagen.
Lavie A., Levin L., Zhan P., Taboada M., Gates D., Lapata M., Clark C., Broadhead M., Waibel A. (1997) Expanding the Domain of a Multi-lingual Speech-to-Speech Translation System. Proceedings of the Spoken Language Translation Workshop, 35th Annual Meeting of the Association for Computational Linguistics, ACL/EACL '97, Madrid, Spain, pp. 67–72.
Levin L., Ries K., Thymé-Gobbel A., Lavie A. (1999) Tagging of Speech Acts and Dialogue Games in Spanish Call Home. Proceedings, ACL '99 Workshop on Discourse Tagging.
Litman D., Allen J. (1990) Discourse Processing and Commonsense Plans. In Cohen P.R., Morgan J. and Pollack M.E. (eds.), Intentions in Communication. MIT Press, Cambridge,MA, pp. 365–388.
Maier E. (1996) Context Construction as Subtask of Dialogue Processing: The Verbmobil Case. Proceedings of the Eleventh Twente Workshop on Language Technology, TWLT 11.
Martin J. (1992) English Text: System and Structure. John Benjamins, Philadelphia/Amsterdam.
Mayfield L., Gavaldà M., Seo Y-H., Suhm B., Ward W., Waibel A. (1995) Parsing Real Input in JANUS: A Concept-Based Approach. Proceedings of TMI 95.
Moran T., Dourish P. (2001) Introduction, Special Issue on Context-Aware Computing. Human Computer Interaction, 16/(2–4), pp. 87–96.
Ney H., Essen U., Kneser R. (1994) On Structuring Probabilistic Dependencies in Stochastic Language Modelling. Computer Speech and Language, 8, pp. 1–38.
O'Donnell M. (1990) A Dynamic Model of Exchange. Word, 41/3, pp. 293–327.
Qu Y., Di Eugenio B., Lavie A., Levin L., Rosé C.P. (1996a) Minimizing Cumulative Error in Discourse Context. Proceedings of ECAI 96, Budapest, Hungary.
Qu Y., Rosé C.P., Di Eugenio B. (1996b) Using Discourse Predictions for Ambiguity Resolution. Proceedings of COLING 96, Copenhagen.
Reithinger N., Maier E. (1995) Utilizing Statistical Dialogue Act Processing in Verbmobil. Proceedings of ACL.
Reithinger N., Maier E., Alexandersson J. (1995) Treatment of Incomplete Dialogues in a Speech-to-Speech Translation System. Proceedings of the ESCA Workshop on Spoken Dialogue Systems, Denmark.
Rosé C.P., Qu Y. (1996) Discourse Information for Disambiguation. Manuscript, Carnegie Mellon University, Pittsburgh, PA.
Rosé C.P., Di Eugenio B., Levin L., Van Ess-Dykema C. (1995) Discourse Processing of Dialogues with Multiple Threads. Proceedings of ACL, Boston, MA.
Sacks H., Schegloff E., Jefferson G. (1974) A Simplest Systematics for the Organization of Turntaking for Conversation. Language, 50, pp. 696–735.
Schegloff E., Sacks H. (1973) Opening up Closings. Semiotica, 7, pp. 289–327.
Schmitz B., Quantz J.J. (1995) Dialogue Acts in Automatic Dialogue Interpreting. Proceedings, 6th Conference on Theoretical and Methodological Issues in Machine Translation, pp. 33–47.
Searle J. (1996 [1979]) A Taxonomy of Illocutionary Acts. Reprinted in Martinich A. (ed.), The Philosophy of Language (3rd edition). Oxford University Press, New York.
Sinclair J., Coulthard M. (1975) Towards an Analysis of Discourse: The English Used by Teachers and Pupils. OUP, Oxford.
Stolke A., Ries K., Coccaro N., Shriberg E., Bates R., Jurafsky D., Taylor P., Martin R., Van Ess-Dykema C., Meteer M. (2000) Dialogue Act Modeling for Automatic Tagging and Recognition of Conversational Speech. Computational Linguistics, 26/3, pp. 339–373.
Taboada M. (1997) Discourse Information for Disambiguation: The Phoenix Approach in Janus. M.Sc. Thesis, Carnegie Mellon University, Pittsburgh, PA.
Ventola E. (1987) The Structure of Social Interaction: A Systemic Approach to the Semiotics of Service Encounters. Pinter Publishers, London.
Waibel A. (1996) Interactive Translation of Conversational Speech. IEEE Computer Society, 29/7.
Ward W. (1991) Understanding Spontaneous Speech: the Phoenix System. Proceedings of ICASSP.
Ward W. (1994) Extracting Information in Spontaneous Speech. Proceedings of ICSLP.
Yngve V. (1970) On Getting a Word in Edgewise. Papers from the Sixth Regional Meeting of the Chicago Linguistics Society. Chicago Linguistics Society, Chicago.
Ziegler J. (2002) Modeling Cooperative Work Processes: A Multiple Perspectives Framework. International Journal of Human-Computer Interaction, 14/2, pp. 139–157.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Taboada, M. Modeling Task-Oriented Dialogue. Computers and the Humanities 37, 431–454 (2003). https://doi.org/10.1023/A:1025729107628
Issue Date:
DOI: https://doi.org/10.1023/A:1025729107628