Abstract
Traditional approaches for similarity-based retrieval of structured data, such as Case-Based Reasoning (CBR), have been largely implemented using centralized storage systems. In such systems, when the cases contain both numeric and free-text attributes, similarity-based retrieval cannot exploit standard speedup techniques based on multi-dimensional indexing, and the retrieval is implemented by an exhaustive comparison of the case to be solved with the whole set of stored cases. In this work, we review current research on Peer-to-Peer (P2P) and distributed CBR techniques and propose a novel approach for storage of the case-base in a decentralized Peer-to-Peer environment using the notion of Unspecified Ontology to improve the performance of the case retrieval stage and build CBR systems that can scale up to large case-bases. We develop an algorithm for efficient retrieval of approximated most-similar cases, which exploits inherent characteristics of the unspecified ontology in order to improve the performance of the case retrieval stage in the CBR problem solving cycle. The experiments show that the algorithm successfully retrieves cases close to the most-similar cases, while reducing the number of cases to be compared. Hence, it improves the performance of the retrieval stage. Moreover, the distributed nature of our approach eliminates the computational bottleneck and single point of failure of the centralized storage systems.
Similar content being viewed by others
References
Aamodt A, Plaza E (1994) Case-based reasoning: foundational issues, methodological variations, and system approaches.. AI Commun 7(1): 39–59
Adar E, Huberman B (2000) Free riding on Gnutella. Technical report, Xerox PARC
Androutsellis-Theotokis S, Spinellis D (2004) A survey of peer-to-peer content distribution technologies. ACM Comput Surv 36(4): 335–371
Bawa M, Condie T, Ganesan P (2005) LSH forest: self-tuning indexes for similarity search. In: Proceedings of the international conference on World Wide Web, Japan
Ben-Asher Y, Berkovsky S (2006) UNSO: unspecified ontologies for peer-to-peer E-commerce applications. J Data Semant 6: 115–142
Bernstein A, Kaufmann E, Buerki C, Klein M (2005) How similar is it? Towards personalized similarity measures in ontologies. In: Proceedings of the internationale Tagung Wirtschaftsinformatik, Germany
Bogaerts S, Leake D (2004) Facilitating CBR for incompletely-described cases: distance metrics for partial problem descriptions. In: Proceedings of the European conference on case-based reasoning, Spain
Bonifacio M, Bouquet P, Mameli G, Nori M (2003) Peer-mediated distributed knowledge management. Agent-Mediated Knowledge Management, Springer
Branting K, Aha DW (1995) Stratified case-based reasoning: reusing hierarchical problem solving episodes. In: Proceedings of the international joint conference on artificial intelligence, CA
Clarke I, Sandberg O, Wiley B, Hong T (2000) Freenet: a distributed anonymous information storage and retrieval system. In: Proceedings of the workshop on design issues in anonymity and unobservability, CA
Coyle L, Doyle D, Cunningham P (2004) Representing similarity for CBR in XML. In: Proceedings of the European conference on advances in case-based reasoning, Spain
de Mantaras RL, McSherry D, Bridge D, Leake D, Smyth B, Craw S, Faltings B, Maher ML, Cox MT, Forbus K, Keane M, Aamodt A, Watson I (2005) Retrieval, reuse, revision and retention in case-based reasoning. Knowl Eng Rev 20(3): 215–240
Fellbaum C (1998) WordNet–an electronic lexical database. MIT Press Publishers
Friedman JH, Bentley JH, Finkel RA (1977) An algorithm for finding best matches in logarithmic expected time. ACM Trans Math Softw 3(3): 209–226
Gruber TR (1993) A translation approach to portable ontology specifications. Knowl Acquis J 6(2): 199–220
Harren M, Hellerstein JM, Huebsch R, Loo BT, Shenker S, Stoica I (2002) Complex queries in DHT-based peer-to-peer networks. In: Proceedings of the international workshop on peer-to-peer systems, MA
Herlocker JL, Konstan JA, Borchers A, Riedl J (1999) An algorithmic framework for performing collaborative filtering. In: Proceedings of the international SIGIR conference on research and development in information retrieval, CA
Kalnis P, Ng WS, Ooi BC, Tan KL (2006) Answering similarity queries in peer-to-peer networks. Inf Syst J 31(1): 57–72
Leake DB, Sooriamurthi R (2001) When two case bases are better than one: exploiting multiple case bases. In: Proceedings of the international conference on case-based reasoning, Canada
Leake DB, Sooriamurthi R (2003) Dispatching cases versus merging case-bases: when MCBR matters. In: Proceedings of the international Florida artificial intelligence research society conference, FL
McGinty L, Smyth B (2001) Collaborative case-based reasoning: applications in personalised route planning. In: Proceedings of the international conference on case-based reasoning, Canada
Milojicic DS, Kalogeraki V, Lukose R, Nagaraja K, Pruyne J, Richard B, Rollins S, Xu Z (2002) Peer-to-peer computing. Technical report HPL-2002-57, HP Labs
Nagendra Prasad MV, Lesser V, Lander S (1996) Retrieval and reasoning in distributed case bases. J Vis Commun Image Represent, Special Issue on Digital Libraries 7(1): 74–87
Napster Inc, The Napster homepage. http://www.napster.com
Plaxton C, Rajaraman R, Richa A (1997) Accessing nearby copies of replicated objects in a distributed environment. In: Proceedings of the symposium on parallel algorithms and architectures, RI
Plaza E, McGinty L (2005) Distributed case-based reasoning. Knowl Eng Rev 20(3): 261–265
Plaza E, Arcos JL, Martin F (1996) Cooperative case-based reasoning. In: Proceedings of the workshop distributed artificial intelligence meets machine learning, Hungary
Pouwelse J, van Slobbe M, Wang J, Reinders MJT, Sips H (2005) P2P-based PVR recommendation using friends, taste buddies and superpeers. In: Proceedings of the beyond personalization workshop, CA
Ratnasamy S, Francis P, Handley M, Karp R, Shenker S (2001) A scalable content-addressable network. In: Proceedings of the conference of the special interest group on data communication, CA
Richter MM (1992) Classification and learning of similarity measure. In: Proceedings of the annual conference of the German society for classification, Germany
Rowstron A, Druschel P (2001) Pastry: scalable, distributed object location and routing for large-scale peer-to-peer systems. In: Proceedings of the international conference on distributed systems platforms, Germany
Salton G, McGill M (1983) Introduction to modern information retrieval. McGraw-Hill Publishing
Schlosser M, Sintek M, Decker S, Nejdl W (2002) A scalable and ontology-based P2P infrastructure for semantic web services. In: Proceedings of the international conference on peer-to-peer computing, Sweden
Smyth B, Cunningham P (1996) The utility problem analysed: a case-based reasoning perspective. In: Proceedings of the European workshop on case-based reasoning, Switzerland
Tanin E, Nayar D, Samet H (2005) An efficient nearest neighbor algorithm for P2P settings. In: Proceedings of the national conference on digital government research, GA
Tran DA (2005) Hierarchical semantic overlay approach to P2P similarity search. In: Proceedings of the USENIX annual technical conference, CA
Watson I (1997) Applying case-based reasoning: techniques for enterprise systems. Morgan Kaufmann Publishers
Watson I, Gardingen D (1999) A distributed case-based reasoning application for engineering sales support. In: Proceedings of the international joint conference on artificial intelligence, CA
Wess S, Althoff KD, Derwand G (1993) Using K-d trees to improve the retrieval step in case-based reasoning. In: Proceedings of the European workshop on case-based reasoning, Germany
Wilson DR, Martinez TR (1997) Improved Heterogeneous distance functions. J Artif Intell Res 6: 3–21
Wooldridge M (2002) An introduction to multi-agent systems. John Wiley Publishers
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Berkovsky, S., Kuflik, T. & Ricci, F. P2P case storage and retrieval with an unspecified ontology. Artif Intell Rev 28, 227–255 (2007). https://doi.org/10.1007/s10462-009-9100-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10462-009-9100-2