Abstract
In the present work we used a word clustering algorithm based on the perplexity criterion, in a Dialogue Act detection framework in order to model the structure of the speech of a user at a dialogue system. Specifically, we constructed an n-gram based model for each target Dialogue Act, computed over the word classes. Then we evaluated the performance of our dialogue system on ten different types of dialogue acts, using an annotated database which contains 1,403,985 unique words. The results were very promising since we achieved about 70% of accuracy using trigram based models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Gianchin, E., Mc Glashan, S.: Corpus-based Methods in Speech Processing. Kluwer Academic, Dordrecht (1997)
Stolke, A., Coccaro, N., Bates, R., Taylor, P., van Ess-Dykema, C., Ries, K., Shriberg, E., Jurafsky, D., Martin, R., Meteer, M.: Dialogue Act Modeling for Automatic Tagging and Recognition of Conversational Speech, Computational Linguistics 26(3)
Alshawi, H.: Effective Utterance Classification with Unsupervised Phonotactic Models. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, Edmonton, Canada, vol. 1, pp. 1–7 (2003)
Grau, S., Sanchis, E., Castro, M.J., Vilar, D.: Dialogue Act Classification Using a Bayesian Approach. In: Proceedings of the 9th International Conference Speech and Computer, pp. 495–499 (2004)
Nagata, M., Morimoto, T.: First steps toward statistical modeling of dialogue to predict the speech act type of the next utterance, Speech Communication, 15 (1994)
Fernandez, R., Ginzburg, J., Lappin, S.: Using Machine Learning for Non-Sentential Utterance Classification. In: Proceedings of the 6th SIGdial Workshop on Discourse and Dialogue, Lisbon, Portugal, pp. 77–86 (2005)
Reithinger, N., Engel, R., Kipp, M., Klesen, M.: Predicting Dialogue Acts for a speech to speech translation system. In: Proceedings of the International Conference on Spoken Language Processing, Philadelphia, vol. 2, pp. 654–657 (1996)
Lendvai, P., van den Bosch, A., Krahmer, E.: Machine Learning for Shallow Interpretation of User Utterances in Spoken Dialogue Systems. In: Proceedings of the EACL-03 Workshop on Dialogue Systems: Interaction, Adaptation and Styles of Management, Budapest, Hungary, pp. 69–78 (2003)
Nagata, M.: Using pragmatics to rule out recognition errors in cooperative task-oriented dialogues. In: Proceedings of the International Conference on Spoken Language Processing, Banff, Canada, vol. 1, pp. 647–650 (1992)
Yoshimura, T., Hayamizu, S., Ohmura, H., Tanaka, K.: Pitch pattern clustering of user utterances in human-machine dialogue. In: Proceedings of the International Conference on Spoken Language Processing, Philadelphia, vol. 2, pp. 837–840 (1996)
Martin, S., Liermann, J., Ney, H.: Algorithms for bigram and trigram word clustering. Speech Communication 24 (1998)
Prasad, R., Walker, M.: 2000 Communicator Dialogue Act Tagged. Linguistic Data Consortium, Philadelphia (2002)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mporas, I., Lyras, D.P., Sgarbas, K.N., Fakotakis, N. (2007). Detection of Dialogue Acts Using Perplexity-Based Word Clustering. In: Matoušek, V., Mautner, P. (eds) Text, Speech and Dialogue. TSD 2007. Lecture Notes in Computer Science(), vol 4629. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74628-7_82
Download citation
DOI: https://doi.org/10.1007/978-3-540-74628-7_82
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74627-0
Online ISBN: 978-3-540-74628-7
eBook Packages: Computer ScienceComputer Science (R0)