Query Ambiguity Identification Based on User Behavior Information

  • Cheng Luo
  • Yiqun Liu
  • Min Zhang
  • Shaoping Ma
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8870)


Query ambiguity identification is of vital importance for Web search related studies such as personalized search or diversified ranking. Different from existing solutions which usually require a supervised topic classification process, we propose a query ambiguity identification framework which takes user behavior features collected from click-through logs into consideration. Especially, besides the features collected from query level, we focus on how to tell the differences between clear and ambiguous queries via features extracted from multi-query sessions. Inspired by recent progresses in word representation researches, we propose a query representation approach named “query2vec” which constructs representations from the distributions of queries in query log context. Experiment results based on large scale commercial search engine logs show effectiveness of the proposed framework as well as the corresponding representation approach.


Search Engine User Session Personalized Search Query Intent Query Representation 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Clarke, C.L.A., Kolla, M., Vechtomova, O.: An Effectiveness Measure for Ambiguous and Underspecified Queries. In: Azzopardi, L., Kazai, G., Robertson, S., Rüger, S., Shokouhi, M., Song, D., Yilmaz, E. (eds.) ICTIR 2009. LNCS, vol. 5766, pp. 188–199. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  2. 2.
    Mirco, S., Gauch S.: Personalized Search based on User Search Histories. In: The 2005 IEEE/WIC/ACM International Conference on IEEE, Web Intelligence, pp. 622–628 (2005)Google Scholar
  3. 3.
    Song, R., et al.: Identification of Ambiguous Queries in Web Search. Information Processing & Management 45(2), 216–229 (2009)CrossRefGoogle Scholar
  4. 4.
    Song, R., et al.: Learning Query Ambiguity Models by using Search Logs. Journal of Computer Science and Technology 25(4), 728–738 (2010)CrossRefMathSciNetGoogle Scholar
  5. 5.
    Mikolov, T., et al.: Distributed Representations of Words and Phrases and their Compositionality. Advances in Neural Information Processing Systems (2013)Google Scholar
  6. 6.
    Mikolov, T., et al.: Efficient Estimation of Word Representations in Vector Space. arXiv preprint arXiv:1301.3781 (2013)Google Scholar
  7. 7.
    Broder, A.: A Taxonomy of Web Search. SIGIR Forum 36(2) (2002)Google Scholar
  8. 8.
    Rose, D.E., Levinson, D.: Understanding User Goals in Web Search. In: Proceedings of the 13th International Conference on World Wide Web, pp. 13–19. ACM (2004)Google Scholar
  9. 9.
    Zhai, C.X., Cohen, W.W., Lafferty, J.: Beyond Independent Relevance: Methods and Evaluation Metrics for Subtopic Retrieval. In: Proceedings of SIGIR 2013, pp. 10–17. ACM (2003)Google Scholar
  10. 10.
    Chirita, P.A., et al.: Using ODP Metadata to Personalize Search. In: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 178–185. ACM (2005)Google Scholar
  11. 11.
    Carolyn Theresa, H., Jansen, B.J.: Understanding the Specificity of Web Search Queries. In: CHI 2013 Extended Abstracts on Human Factors in Computing Systems, pp. 1827–1832. ACM (2013)Google Scholar
  12. 12.
    Li, Y., Zheng, Z., Dai, H.K.: KDD CUP-2005 report: Facing a Great Challenge. ACM SIGKDD Explorations Newsletter 7(2), 91–99 (2005)CrossRefGoogle Scholar
  13. 13.
    Shen, D., et al.: Building Bridges for Web Query Classification. In: Proceedings of SIGIR 2006, pp. 131–138. ACM (2006)Google Scholar
  14. 14.
    Beitzel, S.M., et al.: Varying Approaches to Topical Web Query Classification. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 783–784. ACM (2007)Google Scholar
  15. 15.
    Westerveld, T., Kraaij, W., Hiemstra, D.: Retrieving Web Pages using Content, Links, Urls and Anchors, pp. 663–672 (2002)Google Scholar
  16. 16.
    Brenes, D.J., Gayo-Avello, D.: Automatic Detection of Navigational Queries according to Behavioural Characteristics, pp. 41–48. LWA (2008)Google Scholar
  17. 17.
    Agichtein, E., Zheng, Z.: Identifying Best Bet Web Search Results by Mining Past User Behavior. In: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 902–908. ACM (2006)Google Scholar
  18. 18.
    Liu, Y., Zhang, M., Ru, L., Ma, S.: Automatic Query Type Identification based on Click through Information. In: Ng, H.T., Leong, M.-K., Kan, M.-Y., Ji, D. (eds.) AIRS 2006. LNCS, vol. 4182, pp. 593–600. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  19. 19.
    Wang, Y., Agichtein, E.: Query Ambiguity Revisited: Clickthrough Measures for Distinguishing Informational and Ambiguous Queries. In: Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics. ACL (2010)Google Scholar
  20. 20.
    Lee, U., Liu, Z., Cho, J.: Automatic Identification of User Goals in Web Search. In: Proceedings of the 14th International Conference on World Wide Web, pp. 391–400. ACM (2005)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Cheng Luo
    • 1
  • Yiqun Liu
    • 1
  • Min Zhang
    • 1
  • Shaoping Ma
    • 1
  1. 1.State Key Laboratory of Intelligent Technology and Systems,Tsinghua National Laboratory for Information Science and Technology,Department of Computer Science and TechnologyTsinghua UniversityBeijingChina

Personalised recommendations