Explaining Query Modifications

Hollink, Vera; He, Jiyin; de Vries, Arjen

doi:10.1007/978-3-642-28997-2_1

Vera Hollink²²,
Jiyin He²² &
Arjen de Vries²²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7224))

Included in the following conference series:

European Conference on Information Retrieval

2775 Accesses
4 Citations

Abstract

In the course of a search session, searchers often modify their queries several times. In most previous work analyzing search logs, the addition of terms to a query is identified with query specification and the removal of terms with query generalization. By analyzing the result sets that motivated searchers to make modifications, we show that this interpretation is not always correct. In fact, our experiments indicate that in the majority of cases the modifications have the opposite functions. Terms are often removed to get rid of irrelevant results matching only part of the query and thus to make the result set more specific. Similarly, terms are often added to retrieve more diverse results. We propose an alternative interpretation of term additions and removals and show that it explains the deviant modification behavior that was observed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Boldi, P., Bonchi, F., Castillo, C., Vigna, S.: Query reformulation mining: models, patterns, and applications. Information Retrieval 14(3), 257–289 (2010)
Article Google Scholar
Bozzon, A., Chirita, P.A., Firan, C.S., Nejdl, W.: Lexical analysis for modeling web query reformulation. In: SIGIR 2007, pp. 739–740 (2007)
Google Scholar
Bruza, P., Dennis, S.: Query reformulation on the internet: empirical data and the hyperindex search engine. In: RIAO 1997, pp. 488–499 (1997)
Google Scholar
Costa, R.P., Seco, N.: Hyponymy Extraction and Web Search Behavior Analysis Based on Query Reformulation. In: Geffner, H., Prada, R., Machado Alexandre, I., David, N. (eds.) IBERAMIA 2008. LNCS (LNAI), vol. 5290, pp. 332–341. Springer, Heidelberg (2008)
Chapter Google Scholar
Cronen-Townsend, S., Croft, W.B.: Quantifying query ambiguity. In: HLT 2002, pp. 104–109 (2002)
Google Scholar
Gonzalo, J., Peinado, V., Clough, P., Karlgren, J.: Overview of iCLEF 2009: exploring search behaviour in a multilingual folksonomy environment. In: CLEF 2009, pp. 13–20 (2010)
Google Scholar
He, D., Göker, A., Harper, D.J.: Combining evidence for automatic web session identification. Information Processing and Management 38(5), 727–742 (2002)
Article MATH Google Scholar
He, J., Larson, M., de Rijke, M.: Using Coherence-Based Measures to Predict Query Difficulty. In: Macdonald, C., Ounis, I., Plachouras, V., Ruthven, I., White, R.W. (eds.) ECIR 2008. LNCS, vol. 4956, pp. 689–694. Springer, Heidelberg (2008)
Chapter Google Scholar
Hiemstra, D.: Term-specific smoothing for the language modeling approach to information retrieval: the importance of a query term. In: SIGIR 2002, pp. 35–41 (2002)
Google Scholar
Hollink, V., Tsikrika, T., De Vries, A.P.: Semantic search log analysis: a method and a study on professional image search. JASIST 62(4), 691–713 (2011)
Article Google Scholar
Huang, J., Efthimiadis, E.N.: Analyzing and evaluating query reformulation strategies in web search logs. In: CIKM 2009, pp. 77–86 (2009)
Google Scholar
Jansen, B.J., Booth, D.L., Spink, A.: Patterns of query reformulation during web searching. JASIST 60(7), 1358–1371 (2009)
Article Google Scholar
Jansen, B.J., Spink, A., Pedersen, J.O.: An analysis of multimedia searching on AltaVista. In: MIR 2003, pp. 186–192 (2003)
Google Scholar
Jones, R., Fain, D.C.: Query word deletion prediction. In: SIGIR 2003, pp. 435–436 (2003)
Google Scholar
Jörgensen, C., Jörgensen, P.: Image querying by image professionals. JASIST 56(12), 1346–1359 (2005)
Article Google Scholar
Landis, J.R., Koch, G.G.: The measurement of observer agreement for categorical data. Biometrics 33(1), 159–174 (1977)
Article MathSciNet MATH Google Scholar
Özmutlu, H.C.: Markovian analysis for automatic new topic identification in search engine transaction logs. Applied Stochastic Models in Business and Industry 25(6), 737–768 (2009)
Article MathSciNet Google Scholar
Peinado, V., Gonzalo, J., Artiles, J., López-Ostenero, F.: Log Analysis of Multilingual Image Searches in Flickr. In: Peters, C., Deselaers, T., Ferro, N., Gonzalo, J., Jones, G.J.F., Kurimo, M., Mandl, T., Peñas, A., Petras, V. (eds.) CLEF 2008. LNCS, vol. 5706, pp. 236–242. Springer, Heidelberg (2009)
Chapter Google Scholar
Rieh, S.Y., Xie, H.: Analysis of multiple query reformulations on the web: the interactive information retrieval context. Information Processing and Management 42(3), 751–768 (2006)
Article Google Scholar
Rudinac, S., Larson, M., Hanjalic, A.: Exploiting Result Consistency to Select Query Expansions for Spoken Content Retrieval. In: Gurrin, C., He, Y., Kazai, G., Kruschwitz, U., Little, S., Roelleke, T., Rüger, S., van Rijsbergen, K. (eds.) ECIR 2010. LNCS, vol. 5993, pp. 645–648. Springer, Heidelberg (2010)
Chapter Google Scholar
Whittle, M., Eaglestone, B., Ford, N., Gillet, V.J., Madden, A.: Data mining of search engine logs. JASIST 58(14), 2382–2400 (2007)
Article Google Scholar
Xiang, B., Jiang, D., Pei, J., Sun, X., Chen, E., Li, H.: Context-aware ranking in web search. In: SIGIR 2010, pp. 451–458 (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

Centrum Wiskunde en Informatica, Science Park 123, 1098 XG, Amsterdam, The Netherlands
Vera Hollink, Jiyin He & Arjen de Vries

Authors

Vera Hollink
View author publications
You can also search for this author in PubMed Google Scholar
Jiyin He
View author publications
You can also search for this author in PubMed Google Scholar
Arjen de Vries
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Yahoo! Research, Diagonal 177, 08018, Barcelona, Spain
Ricardo Baeza-Yates & B. Barla Cambazoglu &
Centrum Wiskunde & Informatica, Science Park 123, Amsterdam, The Netherlands
Arjen P. de Vries
Websays, Nàpols 294 7-4, 08025, Barcelona, Spain
Hugo Zaragoza
Yahoo! Research, Diagnoal 177, 08018, Barcelona, Spain
Vanessa Murdock
Yahoo! Labs, Tower 3, Matam Park, 31905, Haifa, Israel
Ronny Lempel
ISTI-CNR, via G. Moruzzi, 1, 56124, Pisa, Italy
Fabrizio Silvestri

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hollink, V., He, J., de Vries, A. (2012). Explaining Query Modifications. In: Baeza-Yates, R., et al. Advances in Information Retrieval. ECIR 2012. Lecture Notes in Computer Science, vol 7224. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28997-2_1

Download citation

DOI: https://doi.org/10.1007/978-3-642-28997-2_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28996-5
Online ISBN: 978-3-642-28997-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics