Skip to main content

Using SiteRank for Decentralized Computation of Web Document Ranking

  • Conference paper
Adaptive Hypermedia and Adaptive Web-Based Systems (AH 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3137))

Abstract

The PageRank algorithm demonstrates the significance of the computation of document ranking of general importance or authority in Web information retrieval. However, doing a PageRank computation for the whole Web graph is both time-consuming and costly. State of the art Web crawler based search engines also suffer from the latency in retrieving a complete Web graph for the computation of PageRank. We look into the problem of computing PageRank in a decentralized and timely fashion by making use of SiteRank and aggregating rankings from multiple sites. A SiteRank is basically the ranking generated by applying the classical PageRank algorithm to the graph of Web sites, i.e., the Web graph at the granularity of Web sites instead of Web pages. Our empirical results show that SiteRank also follows a power-law distribution. Our experimental results demonstrate that the decomposition of global Web document ranking computation by making use of SiteRank is a very promising approach for computing global document rankings in a decentralized Web search system. In particular, by sharing SiteRank among member servers, such a search system also obtains a new means to fight link spamming.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aberer, K., Wu, J.: A framework for decentralized ranking in web information retrieval. In: Zhou, X., Zhang, Y., Orlowska, M.E. (eds.) APWeb 2003. LNCS, vol. 2642, pp. 213–226. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  2. Abiteboul, S., Preda, M., Cobena, G.: Adaptive on-line page importance computation. In: Proceedings of World Wide Wed Conference 2003 (WWW 2003), Budapest, Hungary, May 2003, May 20-24 (2003)

    Google Scholar 

  3. Bharat, K., Chang, B.-W., Henzinger, M., Ruhl, M.: Who links to whom: Mining linkage between web sites. In: Proceedings of the IEEE International Conference on Data Mining (ICDM 2001), San Jose, USA (November 2001)

    Google Scholar 

  4. Faloutsos, M., Faloutsos, P., Faloutsos, C.: On power-law relationships of the internet topology. In: SIGCOMM, pp. 251–262 (1999)

    Google Scholar 

  5. Harchol-Balter, M., Leighton, T., Lewin, D.: Resource discovery in distributed networks. In: Proceedings of the eighteenth annual ACM symposium on Principles of distributed computing, pp. 229–237. ACM Press, New York (1999)

    Chapter  Google Scholar 

  6. Sepandar, D., Kamvar, T.H., Haveliwala, C.D.: Manning, and Gene H. Golub. Exploiting the block structure of theweb for computing pagerank. Technical report, Stanford University (March 2003) (submitted on 4th of March 2003)

    Google Scholar 

  7. Kleinberg, J.: Authoritative sources in a hyperlinked environment. In: Proceedings of the ACM-SIAM Symposium on Discrete Algorithms (1998)

    Google Scholar 

  8. Page, L., Brin, S., Motwani, R., Winograd, T.: The pagerank citation ranking: Bringing order to the web. Technical report, Stanford University (January 1998)

    Google Scholar 

  9. Pandurangan, G., Raghavan, P., Upfal, E.: Using pagerank to characterize web structure. In: Ibarra, O.H., Zhang, L. (eds.) COCOON 2002. LNCS, vol. 2387, p. 330. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  10. Wu, J., Aberer, K.: Using siterank in p2p information retrieval. Technical Report IC/2004/31, Swiss Federal Institute of Technology, Lausanne, Switzerland (March 2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wu, J., Aberer, K. (2004). Using SiteRank for Decentralized Computation of Web Document Ranking. In: De Bra, P.M.E., Nejdl, W. (eds) Adaptive Hypermedia and Adaptive Web-Based Systems. AH 2004. Lecture Notes in Computer Science, vol 3137. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27780-4_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-27780-4_30

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22895-0

  • Online ISBN: 978-3-540-27780-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics