Using Web Archive for Improving Search Engine Results

  • Adam Jatowt
  • Yukiko Kawai
  • Katsumi Tanaka
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3841)

Abstract

Search engines affect page popularity by making it difficult for currently unpopular pages to reach the top ranks in the search results. This is because people tend to visit and create links to the top-ranked pages. We have addressed this problem by analyzing the previous content of web pages. Our approach is based on the observation that the quality of this content greatly affects link accumulation and hence the final rank of the page. We propose detecting the content that has the greatest impact on the link accumulation process of top-ranked pages and using it for detecting high quality but unpopular web pages. Such pages would have higher ranks assigned.

Keywords

Search Engine Page Content Page Ranking User Attention Query Topic 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Amitay, E., Carmel, D., Herscovici, M., Lempel, R., Soffer, A.: Trend Detection Through Temporal Link Analysis. Journal of The American Society for Information Science and Technology 55, 1–12 (2004)CrossRefGoogle Scholar
  2. 2.
    Baeza-Yates, R., Saint-Jean, F., Castillo, C.: Web Structure, Age and Page Quality. In: Laender, A.H.F., Oliveira, A.L. (eds.) SPIRE 2002. LNCS, vol. 2476, pp. 117–130. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  3. 3.
    Cho, J., Roy, S.: Impact of search engines on page popularity. In: Proceedings of the 13th International World Wide Web Conference, New York, USA (2004)Google Scholar
  4. 4.
    Cho, J., Roy, S., Adams, R.: Page quality: In search of an unbiased web ranking. In: Proceedings of SIGMOD 2005, Baltimore, Maryland, USA (2005)Google Scholar
  5. 5.
    Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: Bringing order to the Web. Technical report, Stanford Digital Library Technologies Project (1998)Google Scholar
  6. 6.
  7. 7.

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Adam Jatowt
    • 1
  • Yukiko Kawai
    • 1
  • Katsumi Tanaka
    • 1
    • 2
  1. 1.National Institute of Information and Communications TechnologyKyotoJapan
  2. 2.Graduate School of InformaticsKyoto UniversityKyotoJapan

Personalised recommendations