Abstract
Peer-to-peer architectures are a potentially powerful model for developing large-scale networks of text-based digital libraries, but peer-to-peer networks have so far provided very limited support for text-based federated search of digital libraries using relevance-based ranking. This paper addresses the problems of resource representation, resource ranking and selection, and result merging for federated search of text-based digital libraries in hierarchical peer-to-peer networks. Existing approaches to text-based federated search are adapted and new methods are developed for resource representation and resource selection according to the unique characteristics of hierarchical peer-to-peer networks. Experimental results demonstrate that the proposed approaches offer a better combination of accuracy and efficiency than more common alternatives for federated search in peer-to-peer networks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Arampatzis, A., Beney, J., Koster, C., van der Weide, T.: KUN on the TREC-9 Filtering Track: Incrementality, decay, and threshold optimization for adaptive filtering systems. In: Proc. of the 9thText REtrieval Conference (2001)
Callan, J.: Distributed information retrieval. In: Croft, W.B. (ed.) Advances in information retrieval,  ch. 5, pp. 127–150. Kluwer Academic Publishers, Dordrecht (2000)
Crespo, A., Garcia-Molina, H.: Routing indices for peer-to-peer systems. In: Proc. of the International Conference on Distributed Computing Systems (ICDCS) (July 2002)
Cuenca-Acuna, F., Nguyen, T.: Text-based content search and retrieval in ad hoc p2p communities. Technical Report DCS-TR-483, Rutgers University (2002)
Gravano, L., Chang, C., Garcia-Molina, H., Paepcke, A.: STARTS: Stanford proposal for internet meta-searching. In: Proc. of the ACM-SIGMOD International Conference on Management of Data (1997)
Gravano, L., Garcia-Molina, H.: Generalizing GlOSS to vector-space databases and broker hierarchies. In: Proc. of 21th International Conference on Very Large Data Bases (VLDB 1995), pp. 78–89 (1995)
Javasim, http://javasim.ncl.ac.uk/
KaZaA, http://www.kazaa.com
Kirsch, S.T.: Document retrieval over networks wherein ranking and relevance scores are computed at the client for multiple database documents. U.S. Patent 5,659,732
Limewire, http://www.limewire.com
Lu, J., Callan, J.: Content-based retrieval in hybrid peer-to-peer networks. In: Proc. of the 12th International Conference on Information Knowledge Management (2003)
Nottelmann, H., Fuhr, N.: Evaluation different methods of estimating retrieval quality for resource selection. In: Proc. of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (2003)
Ratnasamy, S., Francis, P., Handley, M., Karp, R., Shenker, S.: A scalable content-addressable network. In: Proc. of the ACM SIGCOMM 2001 Conference (August 2001)
Elena Renda, M., Callan, J.: The robustness of content-based search in hierarchical peer-to-peer networks. In: Proc. of the 13th International Conference on Information Knowledge Management (2004)
Rowstron, A., Druschel, P.: Pastry: Scalable, distributed object location and routing for large-scale peer-to-peer systems. In: IFIP/ACM International Conference on Distributed Systems Platforms, pp. 329–350 (2001)
Si, L., Callan, J.: A semi-supervised learning method to merge search engine results. In: ACM Transactions on Information Systems, vol. 24(4), pp. 457–491. ACM, New York
Si, L., Callan, J.: Relevant document distribution estimation method for resource selection. In: Proc. of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (2003)
Stoica, I., Morris, R., Karger, D., Kaashoek, M., Balakrishnan, H.: Chord: A scalable peer-to-peer lookup service for internet applications. In: Proc. of the ACM SIGCOMM 2001 Conference (August 2001)
Tang, C., Xu, Z., Dwarkadas, S.: Peer-to-peer information retrieval using self-organizing semantic overlay networks. In: Proc. of the ACM SIGCOMM 2003 Conference (2003)
Waterhouse, S.: JXTA Search: Distributed search for distributed networks. Technical report, Sun Microsystems Inc. (2001)
Xu, J., Croft, W.B.: Cluster-based language models for distributed retrieval. In: Proc. of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (1999)
Zhang, Y., Callan, J.: Maximum likelihood estimation for filtering thresholds. In: Proc. of 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lu, J., Callan, J. (2005). Federated Search of Text-Based Digital Libraries in Hierarchical Peer-to-Peer Networks. In: Losada, D.E., Fernández-Luna, J.M. (eds) Advances in Information Retrieval. ECIR 2005. Lecture Notes in Computer Science, vol 3408. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-31865-1_5
Download citation
DOI: https://doi.org/10.1007/978-3-540-31865-1_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25295-5
Online ISBN: 978-3-540-31865-1
eBook Packages: Computer ScienceComputer Science (R0)