Peer-to-Peer Systems II

Volume 2735 of the series Lecture Notes in Computer Science pp 207-215

On the Feasibility of Peer-to-Peer Web Indexing and Search

  • Jinyang LiAffiliated withMIT Lab for Computer Science
  • , Boon Thau LooAffiliated withUC
  • , Joseph M. HellersteinAffiliated withUC
  • , M. Frans KaashoekAffiliated withMIT Lab for Computer Science
  • , David R. KargerAffiliated withMIT Lab for Computer Science
  • , Robert MorrisAffiliated withMIT Lab for Computer Science

* Final gross prices may vary according to local VAT.

Get Access


This paper discusses the feasibility of peer-to-peer full-text keyword search of the Web. Two classes of keyword search techniques are in use or have been proposed: flooding of queries over an overlay network (as in Gnutella), and intersection of index lists stored in a distributed hash table. We present a simple feasibility analysis based on the resource constraints and search workload. Our study suggests that the peer-to-peer network does not have enough capacity to make naive use of either of search techniques attractive for Web search. The paper presents a number of existing and novel optimizations for P2P search based on distributed hash tables, estimates their effects on performance, and concludes that in combination these optimizations would bring the problem to within an order of magnitude of feasibility. The paper suggests a number of compromises that might achieve the last order of magnitude.