Abstract
This article compares one-dimensional and multi-dimensional dialogue act tagsets used for automatic labeling of utterances. The influence of tagset dimensionality on tagging accuracy is first discussed theoretically, then based on empirical data from human and automatic annotations of large scale resources, using four existing tagsets: damsl, swbd-damsl, icsi-mrda and maltus. The Dominant Function Approximation proposes that automatic dialogue act taggers could focus initially on finding the main dialogue function of each utterance, which is empirically acceptable and has significant practical relevance.
References
AMI Project. (2005). Guidelines for dialogue act and addressee annotation. Augmented Multiparty Interaction Project Document, v. 1.0, 13 October 2005. http://www.corpus.amiproject.org.
Bunt, H. (2000). Dynamic interpretation and dialogue theory. In M. M. Taylor, F. Néel, & D. G. Bouwhuis (Eds.), The structure of multimodal dialogue II (pp. 139–166). Amsterdam: John Benjamins.
Bunt, H. (2005). A framework for dialogue act specification. In Fourth Workshop on Multimodal Semantic Representation. Tilburg.
Carletta, J., Isard, A., Isard, S., Kowtko, J. C., Doherty-Sneddon, G., & Anderson, A. H. (1997). The reliability of a dialogue structure coding scheme. Computational Linguistics, 23(1), 13–32.
Clark, A., & Popescu-Belis, A. (2004). Multi-level dialogue act tags. In Fifth SIGdial Workshop on Discourse and Dialogue (pp. 163–170). Cambridge, MA.
Core, M. G., & Allen, J. F. (1997). Coding dialogues with the DAMSL annotation scheme. In D. R. Traum (Ed.), Working Notes: AAAI Fall Symposium on Communicative Action in Humans and Machines (pp. 28–35). Menlo Park, CA.
Di Eugenio, B., & Glass, M. (2004). The kappa statistic: A second look. Computational Linguistics, 30(1), 95–101.
Di Eugenio, B., Jordan, P. W., Thomason, R. H., & Moore, J. D. (2000). The agreement process: An empirical investigation of human-human computer-mediated collaborative dialogues. International Journal of Human Computer Studies, 53(6), 1017–1076.
Doran, C., Aberdeen, J., Damianos, L., & Hirschman, L. (2003). Comparing several aspects of human-computer and human-human dialogues. In J. van Kuppevelt & R. W. Smith (Eds.), Current and new directions in discourse and dialogue (pp. 133–159). Dordrecht: Kluwer.
Geertzen, J., & Bunt, H. (2006). Measuring annotator agreement in a complex hierarchical dialogue act annotation scheme. In Seventh SIGdial Workshop on Discourse and Dialogue (pp. 126–133). Sydney.
Jurafsky, D., Shriberg, E., Fox, B., & Curl, T. (1998). Lexical, prosodic, and syntactic cues for dialog acts. In Coling-ACL 1998 Workshop on Discourse Relations and Discourse Markers (pp. 114–120). Montreal.
Lesch, S., Kleinbauer, T., & Alexandersson, J. (2005). Towards a decent recognition rate for the automatic classification of a multidimensional dialogue act tagset. In Fourth Workshop on Knowledge and Reasoning in Practical Dialogue Systems (pp. 46–53). Edinburgh.
Manning, C. D., & Schütze, H. (1999). Foundations of statistical natural language processing. Cambridge, MA: The MIT Press.
Popescu-Belis, A. (2005). Dialogue act tagsets: One or more dimensions? ISSCO Working Paper 62, University of Geneva.
Samuel, K. (1999). Discourse learning: An investigation of dialogue act tagging using transformation-based learning. Ph.D. thesis, University of Delaware, Department of Computer and Information Sciences.
Shriberg, E., Dhillon, R., Bhagat, S., Ang, J., & Carvey, H. (2004). The ICSI Meeting Recorder Dialog Act (MRDA) Corpus. In Fifth SIGdial Workshop on Discourse and Dialogue (pp. 97–100). Cambridge, MA.
Traum, D. R. (2000). 20 Questions for dialogue act taxonomies. Journal of Semantics, 17(1), 7–30.
Traum, D. R., & Hinkelman, E. A. (1992). Conversation acts in task-oriented spoken dialogue. Computational Intelligence, 8(3), 575–599.
Webb, N., Hepple, M., & Wilks, Y. (2005). Dialogue act classification based on intra-utterance features. In Fifth AAAI Workshop on Spoken Language Understanding. Pittsburgh, PA.
Acknowledgements
This work has been supported by the Swiss National Science Foundation within the IM2 NCCR on Interactive Multimodal Information Management \(({\tt www.im2.ch}).\) The author would like to thank Jean Carletta, Hannah Carvey, Alex Clark, Liz Shriberg, Sandrine Zufferey, and the anonymous reviewers for their helpful comments on this paper.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Popescu-Belis, A. Dimensionality of dialogue act tagsets. Lang Resources & Evaluation 42, 99–107 (2008). https://doi.org/10.1007/s10579-008-9063-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10579-008-9063-y