Skip to main content

Federated Search of Text-Based Digital Libraries in Hierarchical Peer-to-Peer Networks

  • Conference paper
Advances in Information Retrieval (ECIR 2005)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3408))

Included in the following conference series:

Abstract

Peer-to-peer architectures are a potentially powerful model for developing large-scale networks of text-based digital libraries, but peer-to-peer networks have so far provided very limited support for text-based federated search of digital libraries using relevance-based ranking. This paper addresses the problems of resource representation, resource ranking and selection, and result merging for federated search of text-based digital libraries in hierarchical peer-to-peer networks. Existing approaches to text-based federated search are adapted and new methods are developed for resource representation and resource selection according to the unique characteristics of hierarchical peer-to-peer networks. Experimental results demonstrate that the proposed approaches offer a better combination of accuracy and efficiency than more common alternatives for federated search in peer-to-peer networks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Arampatzis, A., Beney, J., Koster, C., van der Weide, T.: KUN on the TREC-9 Filtering Track: Incrementality, decay, and threshold optimization for adaptive filtering systems. In: Proc. of the 9thText REtrieval Conference (2001)

    Google Scholar 

  2. Callan, J.: Distributed information retrieval. In: Croft, W.B. (ed.) Advances in information retrieval,  ch. 5, pp. 127–150. Kluwer Academic Publishers, Dordrecht (2000)

    Google Scholar 

  3. Crespo, A., Garcia-Molina, H.: Routing indices for peer-to-peer systems. In: Proc. of the International Conference on Distributed Computing Systems (ICDCS) (July 2002)

    Google Scholar 

  4. Cuenca-Acuna, F., Nguyen, T.: Text-based content search and retrieval in ad hoc p2p communities. Technical Report DCS-TR-483, Rutgers University (2002)

    Google Scholar 

  5. Gravano, L., Chang, C., Garcia-Molina, H., Paepcke, A.: STARTS: Stanford proposal for internet meta-searching. In: Proc. of the ACM-SIGMOD International Conference on Management of Data (1997)

    Google Scholar 

  6. Gravano, L., Garcia-Molina, H.: Generalizing GlOSS to vector-space databases and broker hierarchies. In: Proc. of 21th International Conference on Very Large Data Bases (VLDB 1995), pp. 78–89 (1995)

    Google Scholar 

  7. Javasim, http://javasim.ncl.ac.uk/

  8. KaZaA, http://www.kazaa.com

  9. Kirsch, S.T.: Document retrieval over networks wherein ranking and relevance scores are computed at the client for multiple database documents. U.S. Patent 5,659,732

    Google Scholar 

  10. Limewire, http://www.limewire.com

  11. Lu, J., Callan, J.: Content-based retrieval in hybrid peer-to-peer networks. In: Proc. of the 12th International Conference on Information Knowledge Management (2003)

    Google Scholar 

  12. Nottelmann, H., Fuhr, N.: Evaluation different methods of estimating retrieval quality for resource selection. In: Proc. of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (2003)

    Google Scholar 

  13. Ratnasamy, S., Francis, P., Handley, M., Karp, R., Shenker, S.: A scalable content-addressable network. In: Proc. of the ACM SIGCOMM 2001 Conference (August 2001)

    Google Scholar 

  14. Elena Renda, M., Callan, J.: The robustness of content-based search in hierarchical peer-to-peer networks. In: Proc. of the 13th International Conference on Information Knowledge Management (2004)

    Google Scholar 

  15. Rowstron, A., Druschel, P.: Pastry: Scalable, distributed object location and routing for large-scale peer-to-peer systems. In: IFIP/ACM International Conference on Distributed Systems Platforms, pp. 329–350 (2001)

    Google Scholar 

  16. Si, L., Callan, J.: A semi-supervised learning method to merge search engine results. In: ACM Transactions on Information Systems, vol. 24(4), pp. 457–491. ACM, New York

    Google Scholar 

  17. Si, L., Callan, J.: Relevant document distribution estimation method for resource selection. In: Proc. of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (2003)

    Google Scholar 

  18. Stoica, I., Morris, R., Karger, D., Kaashoek, M., Balakrishnan, H.: Chord: A scalable peer-to-peer lookup service for internet applications. In: Proc. of the ACM SIGCOMM 2001 Conference (August 2001)

    Google Scholar 

  19. Tang, C., Xu, Z., Dwarkadas, S.: Peer-to-peer information retrieval using self-organizing semantic overlay networks. In: Proc. of the ACM SIGCOMM 2003 Conference (2003)

    Google Scholar 

  20. Waterhouse, S.: JXTA Search: Distributed search for distributed networks. Technical report, Sun Microsystems Inc. (2001)

    Google Scholar 

  21. Xu, J., Croft, W.B.: Cluster-based language models for distributed retrieval. In: Proc. of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (1999)

    Google Scholar 

  22. Zhang, Y., Callan, J.: Maximum likelihood estimation for filtering thresholds. In: Proc. of 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lu, J., Callan, J. (2005). Federated Search of Text-Based Digital Libraries in Hierarchical Peer-to-Peer Networks. In: Losada, D.E., Fernández-Luna, J.M. (eds) Advances in Information Retrieval. ECIR 2005. Lecture Notes in Computer Science, vol 3408. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-31865-1_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-31865-1_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-25295-5

  • Online ISBN: 978-3-540-31865-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics