A Multi-faceted Approach to Query Intent Classification

  • Cristina González-Caro
  • Ricardo Baeza-Yates
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7024)


In this paper we report results for automatic classification of queries in a wide set of facets that are useful to the identification of query intent. Our hypothesis is that the performance of single-faceted classification of queries can be improved by introducing information of multi-faceted training samples into the learning process. We test our hypothesis by performing a multi-faceted classification of queries based on the combination of correlated facets. Our experimental results show that this idea can significantly improve the quality of the classification. Since most of previous works in query intent classification are oriented to the study of single facets, these results are a first step to an integrated query intent classification model.


Spatial Sensitivity Genre Category Automatic Prediction Query Topic Query Intent 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Baeza-Yates, R., Calderón-Benavides, L., González-Caro, C.N.: The Intention Behind Web Queries. In: Crestani, F., Ferragina, P., Sanderson, M. (eds.) SPIRE 2006. LNCS, vol. 4209, pp. 98–109. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  2. 2.
    Beitzel, S.M., Jensen, E.C., Frieder, O., Lewis, D.D., Chowdhury, A., Kolcz, A.: Improving automatic query classification via semi-supervised learning. In: ICDM 2005, pp. 42–49. IEEE, Los Alamitos (2005)Google Scholar
  3. 3.
    Broder, A.: A taxonomy of web search. SIGIR Forum 36, 3–10 (2002)CrossRefzbMATHGoogle Scholar
  4. 4.
    chung Chang, C., Lin, C.J.: Libsvm: a library for support vector machines (2001)Google Scholar
  5. 5.
    Cool, C., Belkin, N.J.: A classification of interactions with information. In: Proceedings of the Fourth International Conference on Conceptions of Library and Information Science, pp. 1–15. Libraries Unlimited, Greenwood Village (2002)Google Scholar
  6. 6.
    Herrera, M.R., de Moura, E.S., Cristo, M., Silva, T.P., da Silva, A.S.: Exploring features for the automatic identification of user goals in web search. Inf. Process. Manage. 46, 131–142 (2010)CrossRefGoogle Scholar
  7. 7.
    Jansen, B.J., Booth, D.L., Spink, A.: Determining the informational, navigational, and transactional intent of web queries. Inf. Process. Manage. 44(3), 1251–1266 (2008)CrossRefGoogle Scholar
  8. 8.
    Jones, R., Zhang, W.V., Rey, B., Jhala, P., Stipp, E.: Geographic intention and modification in web search. Int. J. Geogr. Inf. Sci. 22, 229–246 (2008)CrossRefGoogle Scholar
  9. 9.
    Lee, U., Liu, Z., Cho, J.: Automatic identification of user goals in web search. In: WWW 2005, pp. 391–400. ACM, New York (2005)Google Scholar
  10. 10.
    Li, X., Wang, Y.Y., Shen, D., Acero, A.: Learning with click graph for query intent classification. ACM Trans. Inf. Syst. 28, 12:1–12:20 (2010)CrossRefGoogle Scholar
  11. 11.
    Liu, Y., Zhang, M., Ru, L., Ma, S.: Automatic query type identification based on click through information. In: Ng, H.T., Leong, M.-K., Kan, M.-Y., Ji, D. (eds.) AIRS 2006. LNCS, vol. 4182, pp. 593–600. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  12. 12.
    Metzler, D., Jones, R., Peng, F., Zhang, R.: Improving search relevance for implicitly temporal queries. In: SIGIR 2009, pp. 700–701. ACM, New York (2009)Google Scholar
  13. 13.
    Nguyen, V.B., Kan, M.Y.: Functional faceted web query analysis. In: Amitay, E., Murray, C.G., Teevan, J. (eds.) Query Log Analysis: Social And Technological Challenges. A Workshop at WWW 2007 (2007)Google Scholar
  14. 14.
    Qin, Y.-p., Wang, X.-k.: Study on multi-label text classification based on svm. In: FSKD 2009, vol. 01, pp. 300–304. IEEE Computer Society, Los Alamitos (2009)Google Scholar
  15. 15.
    Rafiei, D., Bharat, K., Shukla, A.: Diversifying web search results. In: WWW 2010, pp. 781–790. ACM, New York (2010)Google Scholar
  16. 16.
    Rose, D.E., Levinson, D.: Understanding user goals in web search. In: WWW 2004, pp. 13–19. ACM, New York (2004)Google Scholar
  17. 17.
    Shen, D., Sun, J.T., Yang, Q., Chen, Z.: Building bridges for web query classification. In: SIGIR 2006, pp. 131–138. ACM, New York (2006)Google Scholar
  18. 18.
    Song, R., Luo, Z., Nie, J.Y., Yu, Y., Hon, H.W.: Identification of ambiguous queries in web search. Inf. Process. Manage. 45, 216–229 (2009)CrossRefGoogle Scholar
  19. 19.
    Teevan, J., Dumais, S.T., Liebling, D.J.: To personalize or not to personalize: modeling queries with variation in user intent. In: SIGIR 2008, pp. 163–170. ACM, New York (2008)Google Scholar
  20. 20.
    Tsoumakas, G., Katakis, I.: Multi-label classification: An overview. Int. J. Data Warehousing and Mining 2007, 1–13 (2007)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Cristina González-Caro
    • 1
    • 3
  • Ricardo Baeza-Yates
    • 2
    • 3
  1. 1.Universidad Autónoma de BucaramangaBucaramangaColombia
  2. 2.Yahoo! Research BarcelonaBarcelonaSpain
  3. 3.Web Research Group Dept. of Information and CommunicationTechnologies Universitat Pompeu FabraBarcelonaSpain

Personalised recommendations