Advertisement

Web Structure, Dynamics and Page Quality

  • Ricardo Baeza-Yates
  • Felipe Saint-Jean
  • Carlos Castillo
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2476)

Abstract

This paper is aimed at the study of quantitative measures of the relation between Web structure, page recency, and quality of Web pages. Quality is studied using different link-based metrics considering their relationship with the structure of the Web and the last modification time of a page. We show that, as expected, Pagerank is biased against new pages. As a subproduct we propose a Pagerank variant that includes page recency into account and we obtain information on how recency is related with Web structure.

Keywords

Search Engine Ranking Algorithm Main Page Authority Rank Random Surfer 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. [1]
    Akwan search engine: Main page. http://www.akwan.com, 1999.
  2. [2]
    Baeza-Yates, R., AND Castillo, C. Relating web characteristics with link analysis. In String Processing and Information Retrieval (2001), IEEE Computer Science Press.Google Scholar
  3. [3]
    Brewington, B., Cybenko, G., Stata, R., Bharat, K., AND Maghoul, F. How dynamic is the web? In 9th World Wide Web Conference (2000).Google Scholar
  4. [4]
    Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., Stata, R., AND Tomkins, A. Graph structure in the Web: Experiments and models. In 9th World Wide Web Conference (2000).Google Scholar
  5. [5]
    Cho, J., AND Garcia-Molina, H. The evolution of the Web and implications for an incremental crawler. In The VLDB Journal (2000).Google Scholar
  6. [6]
    Douglas, F., Feldmann, A., Krishnamurthy, B., AND Mogul, J. Rate of change and other metrics: a live study of the World Wide Web. In USENIX Symposium on Internet Technologies and Systems (1997).Google Scholar
  7. [7]
    Google search engine: Main page. http://www.google.com/, 1998.
  8. [8]
    Kleinberg, J. Authoritative sources in a hyperlinked environment. In 9th Symposium on discrete algorithms (1998).Google Scholar
  9. [9]
    Levene, M., AND Poulovassilis, A. Report on International Workshop on Web Dynamics, London, January 2001.Google Scholar
  10. [10]
    Netcraft web server survey. http://www.netcraft.com/survey/, June 2002.
  11. [11]
    Nua internet-how many online. http://www.nua.ie/surveys/howmanyonline/, February 2002.
  12. [12]
    Page, L., Brin, S., Motwani, R., AND Winograd, T. The Pagerank citation algorithm: bringing order to the Web. Tech. rep., Dept. of Computer Science, Stanford University, 1999.Google Scholar
  13. [13]
    TodoCL search engine: Main page. http://www.todocl.cl/, 2000.

Copyright information

© Springer-Verlag Berlin Heidelberg 2002

Authors and Affiliations

  • Ricardo Baeza-Yates
    • 1
  • Felipe Saint-Jean
    • 1
  • Carlos Castillo
    • 1
  1. 1.Computer Science DepartmentUniversity of ChileSantiagoChile

Personalised recommendations