Model Tree Learning for Query Term Weighting in Question Answering

Monz, Christof

doi:10.1007/978-3-540-71496-5_55

Christof Monz¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4425))

Included in the following conference series:

European Conference on Information Retrieval

2063 Accesses
1 Citations

Abstract

Question answering systems rely on retrieval components to identify documents that contain an answer to a user’s question. The formulation of queries that are used for retrieving those documents has a strong impact on the effectiveness of the retrieval component. Here, we focus on predicting the importance of terms from the original question. We use model tree machine learning techniques in order to assign weights to query terms according to their usefulness for identifying documents that contain an answer. Incorporating the learned weights into a state-of-the-art retrieval system results in statistically significant improvements.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agichtein, E., Lawrence, S., Gravano, L.: Learning to find answers to questions on the web. ACM Transactions on Internet Technology 4(2), 129–162 (2004)
Article Google Scholar
Brill, E., Dumais, S., Banko, M.: An analysis of the AskMSR question-answering system. In: Proceedings of Emperical Methods in Natural Language Processing (EMNLP 2002), pp. 257–264 (2002)
Google Scholar
Chen, H., et al.: A machine learning approach to inductive query by examples: An experiment using relevance feedback, ID3, genetic algorithms, and simulated annealing. Journal of the American Society for Information Science 49(8), 693–705 (1998)
Article Google Scholar
Cooper, W., Chen, A., Gey, F.: Full text retrieval based on probalistic equations with coefficients fitted by logistic regression. In: Proc. of the 2nd Text REtrieval Conference, pp. 57–66 (1993)
Google Scholar
Efron, B.: Bootstrap methods: Another look at the jackknife. Annals of Statistics 7(1), 1–26 (1979)
Article MATH MathSciNet Google Scholar
Frank, E., et al.: Naive bayes for regression. Machine Learning 41(1), 5–25 (2000)
Article Google Scholar
Lin, D.: Dependency-based evaluation of minipar. In: Proceedings of the Workshop on the Evaluation of Parsing Systems (1998)
Google Scholar
Lita, L.V., Carbonell, J.: Unsupervised question answering data aquisition from local corpora. In: Proceedings of the Thirteenth Conference on Information and Knowledge Management (CIKM 2004), pp. 607–614 (2004)
Google Scholar
Mayfield, J., McNamee, P.: JHU/APL at TREC 2005: QA retrieval and robust tracks. In: Voorhees, E.M., Buckland, L.P. (eds.) Proceedings of the Fourteenth Text REtrieval Conference (TREC 2005). NIST Special Publication: SP 500-266 (2005)
Google Scholar
Monz, C.: Document retrieval in the context of question answering. In: Sebastiani, F. (ed.) ECIR 2003. LNCS, vol. 2633, pp. 571–579. Springer, Heidelberg (2003)
Chapter Google Scholar
Paşca, M.: High-Performance Open-Domain Question Answering from Large Text Collections. PhD thesis, Southern Methodist University (2001)
Google Scholar
Quinlan, J.R.: Learning with continuous classes. In: Proceedings of the 5th Australian Joint Conference on Artificial Intelligence, pp. 343–348 (1992)
Google Scholar
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)
Google Scholar
Ravichandran, D., Hovy, E.: Learning surface text patterns for a question answering system. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pp. 41–47 (2002)
Google Scholar
Wang, Y., Witten, I.H.: Induction of model trees for predicting continuous classes. In: Proceedings of the Poster Papers of the European Conference on Machine Learning (ECML), pp. 128–137 (1997)
Google Scholar
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Francisco (1999)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Queen Mary, University of London, Mile End Road, London E1 4NS, United Kingdom
Christof Monz

Authors

Christof Monz
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Giambattista Amati Claudio Carpineto Giovanni Romano

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Monz, C. (2007). Model Tree Learning for Query Term Weighting in Question Answering. In: Amati, G., Carpineto, C., Romano, G. (eds) Advances in Information Retrieval. ECIR 2007. Lecture Notes in Computer Science, vol 4425. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71496-5_55

Download citation

DOI: https://doi.org/10.1007/978-3-540-71496-5_55
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71494-1
Online ISBN: 978-3-540-71496-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics