Abstract
In ad hoc information retrieval (IR), some information need (e.g., find the advantages and disadvantages of smoking) requires the explicit identification of information related to the discourse type (e.g., advantages/ disadvantages) as well as to the topic (e.g., smoking). Such information need is not uncommon and may not be satisfied by using conventional retrieval methods. We extend existing retrieval models by adding a re-ranking strategy based on a novel graph-based retrieval model using document contexts that are called information units (IU). For evaluation, we focused on a discourse type that appeared in a subset of TREC topics where the retrieval effectiveness achieved by our conventional retrieval models for those topics was low. We showed that our approach is able to enhance the retrieval effectiveness for the selected TREC topics. This shows that our preliminary investigation is promising and deserves further investigation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Johnstone, B.: Discourse Analysis. Blackwell Publishing Ltd., Malden (2002)
Voorhees, E.: The TREC robust retrieval track. ACM SIGIR Forum 39(1), 11–20 (2005)
Li, X., Roth, D.: Learning question classifiers. In: Proceedings of COLING, pp. 556–562 (2002)
Kwong, Y., Luk, R., Lam, W., Ho, K., Chung, F.: Passage-based retrieval based on parameterized fuzzy operators. In: ACM SIGIR Workshop on Mathematical/Formal Methods for IR (2004)
Wu, H.C., Luk, R.W.P., Wong, K.F., Kwok, K.L., Li, W.J.: A retrospective study of probabilistic context-based retrieval. In: Proceedings of the 28th ACM SIGIR, pp. 663–664 (2005)
Brooks, H.M., Belkin, N.J.: Using discourse analysis for the design of information retrieval interaction mechanisms. In: Proceedings of the 6th ACM SIGIR, pp. 31–47 (1983)
Webber, B., Stone, M., Joshi, A., Knott, A.: Anaphora and Discourse Structure. Computational Linguistics 29(4), 545–587 (2003)
Knott, A.: A data-driven methodology for motivating a set of coherence relations. PhD thesis, University of Edinburgh (1996)
Hutchinson, B.: The Automatic Acquisition of Knowledge about Discourse Connectives. PhD thesis, University of Edinburgh (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, D.Y., Luk, R.W.P., Wong, K.F., Kwok, K.L. (2006). An Information Retrieval Approach Based on Discourse Type. In: Kop, C., Fliedl, G., Mayr, H.C., Métais, E. (eds) Natural Language Processing and Information Systems. NLDB 2006. Lecture Notes in Computer Science, vol 3999. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11765448_18
Download citation
DOI: https://doi.org/10.1007/11765448_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34616-6
Online ISBN: 978-3-540-34617-3
eBook Packages: Computer ScienceComputer Science (R0)