Advertisement

Artificial Intelligence Review

, Volume 28, Issue 3, pp 227–255 | Cite as

P2P case storage and retrieval with an unspecified ontology

  • Shlomo BerkovskyEmail author
  • Tsvi Kuflik
  • Francesco Ricci
Article
  • 69 Downloads

Abstract

Traditional approaches for similarity-based retrieval of structured data, such as Case-Based Reasoning (CBR), have been largely implemented using centralized storage systems. In such systems, when the cases contain both numeric and free-text attributes, similarity-based retrieval cannot exploit standard speedup techniques based on multi-dimensional indexing, and the retrieval is implemented by an exhaustive comparison of the case to be solved with the whole set of stored cases. In this work, we review current research on Peer-to-Peer (P2P) and distributed CBR techniques and propose a novel approach for storage of the case-base in a decentralized Peer-to-Peer environment using the notion of Unspecified Ontology to improve the performance of the case retrieval stage and build CBR systems that can scale up to large case-bases. We develop an algorithm for efficient retrieval of approximated most-similar cases, which exploits inherent characteristics of the unspecified ontology in order to improve the performance of the case retrieval stage in the CBR problem solving cycle. The experiments show that the algorithm successfully retrieves cases close to the most-similar cases, while reducing the number of cases to be compared. Hence, it improves the performance of the retrieval stage. Moreover, the distributed nature of our approach eliminates the computational bottleneck and single point of failure of the centralized storage systems.

Keywords

Case-Based Reasoning Similarity-based retrieval Peer-to-Peer Unspecified ontology 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aamodt A, Plaza E (1994) Case-based reasoning: foundational issues, methodological variations, and system approaches.. AI Commun 7(1): 39–59Google Scholar
  2. Adar E, Huberman B (2000) Free riding on Gnutella. Technical report, Xerox PARCGoogle Scholar
  3. Androutsellis-Theotokis S, Spinellis D (2004) A survey of peer-to-peer content distribution technologies. ACM Comput Surv 36(4): 335–371CrossRefGoogle Scholar
  4. Bawa M, Condie T, Ganesan P (2005) LSH forest: self-tuning indexes for similarity search. In: Proceedings of the international conference on World Wide Web, JapanGoogle Scholar
  5. Ben-Asher Y, Berkovsky S (2006) UNSO: unspecified ontologies for peer-to-peer E-commerce applications. J Data Semant 6: 115–142CrossRefGoogle Scholar
  6. Bernstein A, Kaufmann E, Buerki C, Klein M (2005) How similar is it? Towards personalized similarity measures in ontologies. In: Proceedings of the internationale Tagung Wirtschaftsinformatik, GermanyGoogle Scholar
  7. Bogaerts S, Leake D (2004) Facilitating CBR for incompletely-described cases: distance metrics for partial problem descriptions. In: Proceedings of the European conference on case-based reasoning, SpainGoogle Scholar
  8. Bonifacio M, Bouquet P, Mameli G, Nori M (2003) Peer-mediated distributed knowledge management. Agent-Mediated Knowledge Management, SpringerGoogle Scholar
  9. Branting K, Aha DW (1995) Stratified case-based reasoning: reusing hierarchical problem solving episodes. In: Proceedings of the international joint conference on artificial intelligence, CAGoogle Scholar
  10. Clarke I, Sandberg O, Wiley B, Hong T (2000) Freenet: a distributed anonymous information storage and retrieval system. In: Proceedings of the workshop on design issues in anonymity and unobservability, CAGoogle Scholar
  11. Coyle L, Doyle D, Cunningham P (2004) Representing similarity for CBR in XML. In: Proceedings of the European conference on advances in case-based reasoning, SpainGoogle Scholar
  12. de Mantaras RL, McSherry D, Bridge D, Leake D, Smyth B, Craw S, Faltings B, Maher ML, Cox MT, Forbus K, Keane M, Aamodt A, Watson I (2005) Retrieval, reuse, revision and retention in case-based reasoning. Knowl Eng Rev 20(3): 215–240CrossRefGoogle Scholar
  13. Fellbaum C (1998) WordNet–an electronic lexical database. MIT Press PublishersGoogle Scholar
  14. Friedman JH, Bentley JH, Finkel RA (1977) An algorithm for finding best matches in logarithmic expected time. ACM Trans Math Softw 3(3): 209–226zbMATHCrossRefGoogle Scholar
  15. Gruber TR (1993) A translation approach to portable ontology specifications. Knowl Acquis J 6(2): 199–220CrossRefGoogle Scholar
  16. Harren M, Hellerstein JM, Huebsch R, Loo BT, Shenker S, Stoica I (2002) Complex queries in DHT-based peer-to-peer networks. In: Proceedings of the international workshop on peer-to-peer systems, MAGoogle Scholar
  17. Herlocker JL, Konstan JA, Borchers A, Riedl J (1999) An algorithmic framework for performing collaborative filtering. In: Proceedings of the international SIGIR conference on research and development in information retrieval, CAGoogle Scholar
  18. Kalnis P, Ng WS, Ooi BC, Tan KL (2006) Answering similarity queries in peer-to-peer networks. Inf Syst J 31(1): 57–72CrossRefGoogle Scholar
  19. Leake DB, Sooriamurthi R (2001) When two case bases are better than one: exploiting multiple case bases. In: Proceedings of the international conference on case-based reasoning, CanadaGoogle Scholar
  20. Leake DB, Sooriamurthi R (2003) Dispatching cases versus merging case-bases: when MCBR matters. In: Proceedings of the international Florida artificial intelligence research society conference, FLGoogle Scholar
  21. McGinty L, Smyth B (2001) Collaborative case-based reasoning: applications in personalised route planning. In: Proceedings of the international conference on case-based reasoning, CanadaGoogle Scholar
  22. Milojicic DS, Kalogeraki V, Lukose R, Nagaraja K, Pruyne J, Richard B, Rollins S, Xu Z (2002) Peer-to-peer computing. Technical report HPL-2002-57, HP LabsGoogle Scholar
  23. Nagendra Prasad MV, Lesser V, Lander S (1996) Retrieval and reasoning in distributed case bases. J Vis Commun Image Represent, Special Issue on Digital Libraries 7(1): 74–87Google Scholar
  24. Napster Inc, The Napster homepage. http://www.napster.com
  25. Plaxton C, Rajaraman R, Richa A (1997) Accessing nearby copies of replicated objects in a distributed environment. In: Proceedings of the symposium on parallel algorithms and architectures, RIGoogle Scholar
  26. Plaza E, McGinty L (2005) Distributed case-based reasoning. Knowl Eng Rev 20(3): 261–265CrossRefGoogle Scholar
  27. Plaza E, Arcos JL, Martin F (1996) Cooperative case-based reasoning. In: Proceedings of the workshop distributed artificial intelligence meets machine learning, HungaryGoogle Scholar
  28. Pouwelse J, van Slobbe M, Wang J, Reinders MJT, Sips H (2005) P2P-based PVR recommendation using friends, taste buddies and superpeers. In: Proceedings of the beyond personalization workshop, CAGoogle Scholar
  29. Ratnasamy S, Francis P, Handley M, Karp R, Shenker S (2001) A scalable content-addressable network. In: Proceedings of the conference of the special interest group on data communication, CAGoogle Scholar
  30. Richter MM (1992) Classification and learning of similarity measure. In: Proceedings of the annual conference of the German society for classification, GermanyGoogle Scholar
  31. Rowstron A, Druschel P (2001) Pastry: scalable, distributed object location and routing for large-scale peer-to-peer systems. In: Proceedings of the international conference on distributed systems platforms, GermanyGoogle Scholar
  32. Salton G, McGill M (1983) Introduction to modern information retrieval. McGraw-Hill PublishingGoogle Scholar
  33. Schlosser M, Sintek M, Decker S, Nejdl W (2002) A scalable and ontology-based P2P infrastructure for semantic web services. In: Proceedings of the international conference on peer-to-peer computing, SwedenGoogle Scholar
  34. Smyth B, Cunningham P (1996) The utility problem analysed: a case-based reasoning perspective. In: Proceedings of the European workshop on case-based reasoning, SwitzerlandGoogle Scholar
  35. Tanin E, Nayar D, Samet H (2005) An efficient nearest neighbor algorithm for P2P settings. In: Proceedings of the national conference on digital government research, GAGoogle Scholar
  36. Tran DA (2005) Hierarchical semantic overlay approach to P2P similarity search. In: Proceedings of the USENIX annual technical conference, CAGoogle Scholar
  37. Watson I (1997) Applying case-based reasoning: techniques for enterprise systems. Morgan Kaufmann PublishersGoogle Scholar
  38. Watson I, Gardingen D (1999) A distributed case-based reasoning application for engineering sales support. In: Proceedings of the international joint conference on artificial intelligence, CAGoogle Scholar
  39. Wess S, Althoff KD, Derwand G (1993) Using K-d trees to improve the retrieval step in case-based reasoning. In: Proceedings of the European workshop on case-based reasoning, GermanyGoogle Scholar
  40. Wilson DR, Martinez TR (1997) Improved Heterogeneous distance functions. J Artif Intell Res 6: 3–21MathSciNetGoogle Scholar
  41. Wooldridge M (2002) An introduction to multi-agent systems. John Wiley PublishersGoogle Scholar

Copyright information

© Springer Science+Business Media B.V. 2009

Authors and Affiliations

  • Shlomo Berkovsky
    • 1
    Email author
  • Tsvi Kuflik
    • 2
  • Francesco Ricci
    • 3
  1. 1.CSIRO, ICT CentreHobartAustralia
  2. 2.Management Information Systems DepartmentUniversity of HaifaHaifaIsrael
  3. 3.Faculty of Computer ScienceFree University of Bozen-BolzanoBolzanoItaly

Personalised recommendations