Abstract
The User-over-Ranking hypothesis states that the best retrieval performance can be achieved with queries returning about as many results as can be considered at user site[21]. We apply this hypothesis to Lee et al.’s problem of automatically formulating a single promising query from a given set of keywords[16]. Lee et al.’s original approach requires unrestricted access to the retrieval system’s index and manual parameter tuning for each keyword set. Their approach is not applicable on larger scale, not to mention web search scenarios. By applying the User-over-Ranking hypothesis we overcome this restriction and present a fully automatic user-site heuristic for web query formulation from given keywords. Substantial performance gains of up to 60% runtime improvement over previous approaches for similar problems underpin the value of our approach.
Extended version of a paper presented at the TIR 2010 workshop[7].
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Balasubramanian, N., Kumaran, G., Carvalho, V.R.: Exploring reductions for long web queries. In: Proceedings of SIGIR 2010, pp. 571–578 (2010)
Bar-Yossef, Z., Gurevich, M.: Random sampling from a search engine’s index. Journal of the ACM 55(5) (2008)
Barker, K., Cornacchia, N.: Using noun phrase heads to extract document keyphrases. In: Proceedings of AI 2000, pp. 40–52 (2000)
Bendersky, M., Croft, W.B.: Discovering key concepts in verbose queries. In: Proceedings of SIGIR 2008, pp. 491–498 (2008)
Carmel, D., Yom-Tov, E., Darlow, A., Pelleg, D.: What makes a query difficult? In: Proceedings of SIGIR 2006, pp. 390–397 (2006)
Cronen-Townsend, S., Zhou, Y., Croft, W.B.: Predicting query performance. In: Proceedings of SIGIR 2002, pp. 299–306 (2002)
Hagen, M., Stein, B.: Search strategies for keyword-based queries. In: Proceedings of DEXA 2010 Workshop TIR 2010, pp. 37–41 (2010)
Hagen, M., Rüb, T., Stein, B.: Query session detection as a cascade. In: Proceedings of ECIR 2011 Workshop SIR 2011 (2011)
Hauff, C., Hiemstra, D., de Jong, F.: A survey of pre-retrieval query performance predictors. In: Proceedings of CIKM 2008, pp. 1419–1420 (2008)
He, B., Ounis, I.: Inferring query performance using pre-retrieval predictors. In: Apostolico, A., Melucci, M. (eds.) SPIRE 2004. LNCS, vol. 3246, pp. 43–54. Springer, Heidelberg (2004)
Huston, S., Croft, W.B.: Evaluating verbose query processing techniques. In: Proceedings of SIGIR 2010, pp. 291–298 (2010)
Kumaran, G., Allan, J.: Adapting information retrieval systems to user queries. Information Processing and Management 44(6), 1838–1862 (2008)
Kumaran, G., Allan, J.: Effective and efficient user interaction for long queries. In: Proceedings of SIGIR 2008, pp. 11–18 (2008)
Kumaran, G., Carvalho, V.R.: Reducing long queries using query quality predictors. In: Proceedings of SIGIR 2009, pp. 564–571 (2009)
Lease, M., Allan, J., Croft, W.B.: Regression rank: Learning to meet the opportunity of descriptive queries. In: Boughanem, M., Berrut, C., Mothe, J., Soule-Dupuy, C. (eds.) ECIR 2009. LNCS, vol. 5478, pp. 90–101. Springer, Heidelberg (2009)
Lee, C.-J., Chen, R.-C., Kao, S.-H., Cheng, P.-J.: A term dependency-based approach for query terms ranking. In: Proceedings of CIKM 2009, pp. 1267–1276 (2009)
Luo, G., Tang, C., Yang, H., Wei, X.: MedSearch: a specialized search engine for medical information retrieval. In: Proceedings of CIKM 2008, pp. 143–152 (2008)
Pass, G., Chowdhury, A., Torgeson, C.: A picture of search. In: Proceedings of Infoscale, paper. 1 (2006)
Shapiro, J., Taksa, I.: Constructing web search queries from the user’s information need expressed in a natural language. In: Proceedings of SAC 2003, pp. 1157–1162 (2003)
Stein, B., Hagen, M.: Making the most of a web search session. In: Proceedings of WI-IAT 2010, pp. 90–97 (2010)
Stein, B., Hagen, M.: Introducing the user-over-ranking hypothesis. In: Clough, P., Foley, C., Gurrin, C., Jones, G.J.F., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 503–509. Springer, Heidelberg (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hagen, M., Stein, B. (2011). Applying the User-over-Ranking Hypothesis to Query Formulation. In: Amati, G., Crestani, F. (eds) Advances in Information Retrieval Theory. ICTIR 2011. Lecture Notes in Computer Science, vol 6931. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23318-0_21
Download citation
DOI: https://doi.org/10.1007/978-3-642-23318-0_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23317-3
Online ISBN: 978-3-642-23318-0
eBook Packages: Computer ScienceComputer Science (R0)