Page Sets as Web Search Answers

  • Takayuki Yumoto
  • Katsumi Tanaka
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4312)


Conventional Web search engines rank their searched results page by page. That is, conventionally, the information unit for both searching and ranking is a single Web page. There are, however, cases where a set of searched pages shows a better similarity (relevance) to a given (keyword) query than each individually searched page. This is because the information a user wishes to have is sometimes distributed on multiple Web pages. In such cases, the information unit used for ranking should be a set of pages rather than a single page. In this paper, we propose the notion of a “page set ranking”, which is to rank each pertinent set of searched Web pages. We describe our new algorithm of the page set ranking to efficiently construct and rank page sets. We present some experimental results and the effectiveness of our approach.


Ranking Score Evaluation Vector Query Keyword Total Feature Wind Power Generation 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Cutting, D.R., Pedersen, J.O., Karger, D., Tukey, J.W.: Scatter/gather: A cluster-based approach to browsing large document collections. In: Proceedings of the Fifteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 318–329 (1992)Google Scholar
  2. 2.,
  3. 3.
  4. 4.
    Yumoto, T., Tanaka, K.: Finding pertinent page-pairs from web search results. In: Fox, E.A., Neuhold, E.J., Premsmit, P., Wuwongse, V. (eds.) ICADL 2005. LNCS, vol. 3815, pp. 301–310. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  5. 5.
    Sun, J.T., Wang, X., Shen, D., Zeng, H.J., Chen, Z.: CWS: a comparative web search system. In: WWW 2006: Proceedings of the 15th international conference on World Wide Web, pp. 467–476. ACM Press, New York (2006)CrossRefGoogle Scholar
  6. 6.
    Salton, G.: Developments in automatic text retrieval. Science 253, 974–979 (1991)CrossRefMathSciNetGoogle Scholar
  7. 7.
    Oyama, S., Tanaka, K.: Query modification by discovering topics from web page structures. In: Yu, J.X., Lin, X., Lu, H., Zhang, Y. (eds.) APWeb 2004. LNCS, vol. 3007, pp. 553–564. Springer, Heidelberg (2004)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Takayuki Yumoto
    • 1
  • Katsumi Tanaka
    • 1
  1. 1.Dept. of Social Informatics, Graduate School of InformaticsKyoto UniversityKyotoJapan

Personalised recommendations