Skip to main content

Query Intent Understanding

  • Chapter
  • First Online:
Query Understanding for Search Engines

Part of the book series: The Information Retrieval Series ((INRE,volume 46))

Abstract

Search engines aim at helping users find relevant results from the Web. Understanding the underlying intent of queries issued to search engines is a critical step toward this goal. Till now, it is still a challenge to have a scientific definition of query intent. Existing approaches attempting to understand query intents can be classified into two categories: (1) query intent classification: mapping queries into categories and (2) query intent mining: finding subtopics covered by the queries. For the first group of work, the mapping between queries and categories can be conducted in various ways, including classifying based on navigational, informational, or transactional intent, based on geographic locality, temporal intent, topical categories, or available vertical services. For query intent mining, the output can be a list of explicit subqueries, or some implicit representation of subintent, such as a list of document clusters, a list of entities, etc. In this chapter, we will introduce these query intent prediction approaches in detail.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Rakesh Agrawal, Sreenivas Gollapudi, Alan Halverson, and Samuel Ieong. Diversifying search results. In Proceedings of the Second International Conference on Web Search and Data Mining9, pages 5–14, 2009.

    Google Scholar 

  2. Jaime Arguello, Fernando Diaz, Jamie Callan, and Jean-François Crespo. Sources of evidence for vertical selection. In Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 315–322, 2009.

    Google Scholar 

  3. Jaime Arguello, Fernando Diaz, and Jean-François Paiement. Vertical selection in the presence of unlabeled verticals. In Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 691–698, 2010.

    Google Scholar 

  4. Ricardo A. Baeza-Yates, Liliana Calderón-Benavides, and Cristina N. González-Caro. The intention behind web queries. In Proceedings of the 13th International Conference on String Processing and Information Retrieval, pages 98–109, 2006.

    Google Scholar 

  5. Steven M. Beitzel, Eric C. Jensen, Ophir Frieder, David D. Lewis, Abdur Chowdhury, and Aleksander Kolcz. Improving automatic query classification via semi-supervised learning. In Proceedings of the 5th IEEE International Conference on Data Mining, pages 42–49, 2005.

    Google Scholar 

  6. Steven M. Beitzel, Eric C. Jensen, Abdur Chowdhury, and Ophir Frieder. Varying approaches to topical web query classification. In Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 783–784, 2007.

    Google Scholar 

  7. Jiang Bian, Tie-Yan Liu, Tao Qin, and Hongyuan Zha. Ranking with query-dependent loss for web search. In Proceedings of the Third International Conference on Web Search and Data Mining, pages 141–150, 2010.

    Google Scholar 

  8. David J. Brenes and Daniel Gayo-Avello. Automatic detection of navigational queries according to behavioural characteristics. In Proceedings of the LWA 2008 - Workshop-Woche: Lernen, Wissen & Adaptivität, pages 41–48, 2008.

    Google Scholar 

  9. David J. Brenes, Daniel Gayo-Avello, and Kilian Pérez-González. Survey and evaluation of query intent detection methods. In Proceedings of the 2009 workshop on Web Search Click Data, pages 1–7, 2009.

    Google Scholar 

  10. Andrei Z. Broder. A taxonomy of web search. SIGIR Forum, 36 (2): 3–10, 2002.

    Article  Google Scholar 

  11. Andrei Z. Broder, Marcus Fontoura, Evgeniy Gabrilovich, Amruta Joshi, Vanja Josifovski, and Tong Zhang. Robust classification of rare queries using web knowledge. In Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 231–238, 2007.

    Google Scholar 

  12. Jaime G. Carbonell and Jade Goldstein. The use of MMR, diversity-based reranking for reordering documents and producing summaries. In Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 335–336, 1998.

    Google Scholar 

  13. Harr Chen and David R. Karger. Less is more: probabilistic models for retrieving fewer relevant documents. In Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 429–436, 2006.

    Google Scholar 

  14. Charles L. A. Clarke, Nick Craswell, and Ian Soboroff. Overview of the TREC 2009 web track. In Proceedings of The Eighteenth Text REtrieval Conference, volume 500–278, 2009.

    Google Scholar 

  15. ClueWeb09. The clueweb09 dataset. http://boston.lti.cs.cmu.edu/Data/clueweb09/.

  16. Stephen Cronen-Townsend, Yun Zhou, and W. Bruce Croft. Predicting query performance. In Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 299–306, 2002.

    Google Scholar 

  17. Van Dang and W. Bruce Croft. Diversity by proportionality: an election-based approach to search result diversification. In Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, pages 65–74, 2012.

    Google Scholar 

  18. Vincenzo Deufemia, Massimiliano Giordano, Giuseppe Polese, and Luigi Marco Simonetti. Exploiting interaction features in user intent understanding. In Proceedings of the 15th Asia-Pacific Web Conference, pages 506–517, 2013.

    Google Scholar 

  19. Zhicheng Dou, Ruihua Song, and Ji-Rong Wen. A large-scale evaluation and analysis of personalized search strategies. In Proceedings of the 16th International Conference on World Wide Web, pages 581–590, 2007.

    Google Scholar 

  20. Zhicheng Dou, Ruihua Song, Ji-Rong Wen, and Xiaojie Yuan. Evaluating the effectiveness of personalized web search. IEEE Trans. Knowl. Data Eng., 21 (8): 1178–1190, 2009.

    Article  Google Scholar 

  21. Zhicheng Dou, Sha Hu, Kun Chen, Ruihua Song, and Ji-Rong Wen. Multi-dimensional search result diversification. In Proceedings of the Forth International Conference on Web Search and Data Mining, pages 475–484, 2011.

    Google Scholar 

  22. Qi Guo and Eugene Agichtein. Exploring mouse movements for inferring query intent. In Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 707–708, 2008.

    Google Scholar 

  23. Mohammed Hasanuzzaman, Sriparna Saha, Gaël Dias, and Stéphane Ferrari. Understanding temporal query intent. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 823–826, 2015.

    Google Scholar 

  24. Mauro Rojas Herrera, Edleno Silva de Moura, Marco Cristo, Thomaz Philippe C. Silva, and Altigran Soares da Silva. Exploring features for the automatic identification of user goals in web search. Inf. Process. Manage., 46 (2): 131–142, 2010.

    Google Scholar 

  25. Jian Hu, Gang Wang, Frederick H. Lochovsky, Jian-Tao Sun, and Zheng Chen. Understanding user’s query intent with Wikipedia. In Proceedings of the 18th International Conference on World Wide Web, pages 471–480, 2009.

    Google Scholar 

  26. Sha Hu, Zhicheng Dou, Xiao-Jie Wang, Tetsuya Sakai, and Ji-Rong Wen. Search result diversification based on hierarchical intents. In Proceedings of the 24th ACM International Conference on Information and Knowledge Management, pages 63–72, 2015.

    Google Scholar 

  27. Yunhua Hu, Ya-nan Qian, Hang Li, Daxin Jiang, Jian Pei, and Qinghua Zheng. Mining query subtopics from search log data. In Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, pages 305–314, 2012.

    Google Scholar 

  28. Bernard J. Jansen, Amanda Spink, and Tefko Saracevic. Real life, real users, and real needs: a study and analysis of user queries on the web. Inf. Process. Manag., 36 (2): 207–227, 2000.

    Article  Google Scholar 

  29. Bernard J. Jansen, Danielle L. Booth, and Amanda Spink. Determining the user intent of web search engine queries. In Proceedings of the 16th International Conference on World Wide Web, pages 1149–1150, 2007.

    Google Scholar 

  30. Bernard J. Jansen, Danielle L. Booth, and Amanda Spink. Determining the informational, navigational, and transactional intent of web queries. Inf. Process. Manag., 44 (3): 1251–1266, 2008.

    Article  Google Scholar 

  31. In-Ho Kang. Transactional query identification in web search. In Proceedings of the Second Asia Information Retrieval Symposium, pages 221–232, 2005.

    Google Scholar 

  32. In-Ho Kang and Gil-Chang Kim. Query type classification for web document retrieval. In Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 64–71, 2003.

    Google Scholar 

  33. Nattiya Kanhabua, Tu Ngoc Nguyen, and Wolfgang Nejdl. Learning to detect event-related queries for web search. In Proceedings of the 24th International Conference on World Wide Web, pages 1339–1344, 2015.

    Google Scholar 

  34. Uichin Lee, Zhenyu Liu, and Junghoo Cho. Automatic identification of user goals in web search. In Proceedings of the 14th international conference on World Wide Web, pages 391–400, 2005.

    Google Scholar 

  35. Xiao Li, Ye-Yi Wang, and Alex Acero. Learning query intent from regularized click graphs. In Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 339–346, 2008.

    Google Scholar 

  36. Xiao Li, Ye-Yi Wang, Dou Shen, and Alex Acero. Learning with click graph for query intent classification. ACM Trans. Inf. Syst., 28 (3): 12:1–12:20, 2010.

    Google Scholar 

  37. Yiqun Liu, Min Zhang, Liyun Ru, and Shaoping Ma. Automatic query type identification based on click through information. In Proceedings of the Third Asia Information Retrieval Symposium, pages 593–600, 2006.

    Google Scholar 

  38. Yiqun Liu, Ruihua Song, Min Zhang, Zhicheng Dou, Takehiro Yamamoto, Makoto P. Kato, Hiroaki Ohshima, and Ke Zhou. Overview of the NTCIR-11 imine task. In Proceedings of the 11th NTCIR Conference on Evaluation of Information Access Technologies, 2014.

    Google Scholar 

  39. Yuchen Liu, Xiaochuan Ni, Jian-Tao Sun, and Zheng Chen. Unsupervised transactional query classification based on webpage form understanding. In Proceedings of the 20th ACM Conference on Information and Knowledge Management, pages 57–66, 2011.

    Google Scholar 

  40. Yumao Lu, Fuchun Peng, Xin Li, and Nawaaz Ahmed. Coupling feature selection and machine learning methods for navigational query identification. In Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, pages 682–689, 2006.

    Google Scholar 

  41. Marcelo Mendoza and Juan Zamora. Identifying the intent of a user query using support vector machines. In Proceedings of the 16th International Symposium on String Processing and Information Retrieval, pages 131–142, 2009.

    Google Scholar 

  42. David Nettleton, Liliana Calderón-benavides, and Ricardo Baeza-yates. Analysis of web search engine query sessions. In Proceedings of WebKDD 2006: KDD Workshop on Web Mining and Web Usage Analysis, in conjunction with the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006.

    Google Scholar 

  43. Filip Radlinski and Susan T. Dumais. Improving personalized web search using result diversification. In Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 691–692, 2006.

    Google Scholar 

  44. Filip Radlinski, Martin Szummer, and Nick Craswell. Inferring query intent from reformulations and clicks. In Proceedings of the 19th International Conference on World Wide Web, pages 1171–1172, 2010.

    Google Scholar 

  45. Davood Rafiei, Krishna Bharat, and Anand Shukla. Diversifying web search results. In Proceedings of the 19th International Conference on World Wide Web, pages 781–790, 2010.

    Google Scholar 

  46. Karthik Raman, Paul N. Bennett, and Kevyn Collins-Thompson. Toward whole-session relevance: exploring intrinsic diversity in web search. In Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, pages 463–472, 2013.

    Google Scholar 

  47. Daniel E. Rose and Danny Levinson. Understanding user goals in web search. In Proceedings of the 13th international conference on World Wide Web, pages 13–19, 2004.

    Google Scholar 

  48. Tetsuya Sakai, Zhicheng Dou, Takehiro Yamamoto, Yiqun Liu, Min Zhang, and Ruihua Song. Overview of the NTCIR-10 INTENT-2 task. In Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013.

    Google Scholar 

  49. Rodrygo L. T. Santos, Jie Peng, Craig Macdonald, and Iadh Ounis. Explicit search result diversification through sub-queries. In Proceedings of the 32nd European Conference on IR Research, pages 87–99, 2010.

    Google Scholar 

  50. Dou Shen, Jian-Tao Sun, Qiang Yang, and Zheng Chen. Building bridges for web query classification. In Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 131–138, 2006.

    Google Scholar 

  51. Luo Si and James P. Callan. Relevant document distribution estimation method for resource selection. In Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 298–305, 2003.

    Google Scholar 

  52. Craig Silverstein, Monika Rauch Henzinger, Hannes Marais, and Michael Moricz. Analysis of a very large web search engine query log. SIGIR Forum, 33 (1): 6–12, 1999.

    Google Scholar 

  53. Ruihua Song, Zhenxiao Luo, Jian-Yun Nie, Yong Yu, and Hsiao-Wuen Hon. Identification of ambiguous queries in web search. Inf. Process. Manag., 45 (2): 216–229, 2009.

    Article  Google Scholar 

  54. Ruihua Song, Min Zhang, Tetsuya Sakai, Makoto P. Kato, Yiqun Liu, Miho Sugimoto, Qinglei Wang, and Naoki Orii. Overview of the NTCIR-9 INTENT task. In Proceedings of the 9th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, Question Answering and Cross-Lingual Information Access, 2011.

    Google Scholar 

  55. Markus Strohmaier, Mark Kröll, and Christian Körner. Intentional query suggestion: making user goals more explicit during search. In Proceedings of the 2009 workshop on Web Search Click Data, pages 68–74, 2009.

    Google Scholar 

  56. Gilad Tsur, Yuval Pinter, Idan Szpektor, and David Carmel. Identifying web queries with question intent. In Proceedings of the 25th International Conference on World Wide Web, pages 783–793, 2016.

    Google Scholar 

  57. Qinglei Wang, Ya-nan Qian, Ruihua Song, Zhicheng Dou, Fan Zhang, Tetsuya Sakai, and Qinghua Zheng. Mining subtopics from text fragments for a web query. Inf. Retr., 16 (4): 484–503, 2013.

    Article  Google Scholar 

  58. Xiao-Jie Wang, Ji-Rong Wen, Zhicheng Dou, Tetsuya Sakai, and Rui Zhang. Search result diversity evaluation based on intent hierarchies. IEEE Trans. Knowl. Data Eng., 30 (1): 156–169, 2018.

    Article  Google Scholar 

  59. Xuanhui Wang and ChengXiang Zhai. Learn from web search logs to organize search results. In Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 87–94, 2007.

    Google Scholar 

  60. Yu Wang and Eugene Agichtein. Query ambiguity revisited: Clickthrough measures for distinguishing informational and ambiguous queries. In Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, pages 361–364, 2010.

    Google Scholar 

  61. Takehiro Yamamoto, Yiqun Liu, Min Zhang, Zhicheng Dou, Ke Zhou, Ilya Markov, Makoto P. Kato, Hiroaki Ohshima, and Sumio Fujita. Overview of the NTCIR-12 imine-2 task. In Proceedings of the 12th NTCIR Conference on Evaluation of Information Access Technologies, 2016.

    Google Scholar 

  62. Xing Yi, Hema Raghavan, and Chris Leggetter. Discovering users’ specific geo intention in web search. In Proceedings of the 18th International Conference on World Wide Web, pages 481–490, 2009.

    Google Scholar 

  63. Yisong Yue and Thorsten Joachims. Predicting diverse subsets using structural SVMs. In Proceedings of the Twenty-Fifth International Conference on Machine Learning, pages 1224–1231, 2008.

    Google Scholar 

  64. Juan Zamora, Marcelo Mendoza, and Héctor Allende. Query intent detection based on query log mining. J. Web Eng., 13 (1&2): 24–52, 2014.

    Google Scholar 

  65. Hua-Jun Zeng, Qi-Cai He, Zheng Chen, Wei-Ying Ma, and Jinwen Ma. Learning to cluster web search results. In Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 210–217, 2004.

    Google Scholar 

  66. Yue Zhao and Claudia Hauff. Temporal query intent disambiguation using time-series data. In Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, pages 1017–1020, 2016.

    Google Scholar 

  67. Ke Zhou, Ronan Cummins, Martin Halvey, Mounia Lalmas, and Joemon M. Jose. Assessing and predicting vertical intent for web queries. In Proceedings of the 34th European Conference on IR Research, pages 499–502, 2012.

    Google Scholar 

  68. Xiaojin Zhu, Andrew B. Goldberg, Jurgen Van Gael, and David Andrzejewski. Improving diversity in ranking using absorbing random walks. In Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, pages 97–104, 2007.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhicheng Dou .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Dou, Z., Guo, J. (2020). Query Intent Understanding. In: Chang, Y., Deng, H. (eds) Query Understanding for Search Engines. The Information Retrieval Series, vol 46. Springer, Cham. https://doi.org/10.1007/978-3-030-58334-7_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-58334-7_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-58333-0

  • Online ISBN: 978-3-030-58334-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics