ECCBR 2008: Advances in Case-Based Reasoning pp 240-254 | Cite as

Supporting Case-Based Retrieval by Similarity Skylines: Basic Concepts and Extensions

  • Eyke Hüllermeier
  • Ilya Vladimirskiy
  • Belén Prados Suárez
  • Eva Stauch
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5239)

Abstract

Conventional approaches to similarity search and case-based retrieval, such as nearest neighbor search, require the specification of a global similarity measure which is typically expressed as an aggregation of local measures pertaining to different aspects of a case. Since the proper aggregation of local measures is often quite difficult, we propose a novel concept called similarity skyline. Roughly speaking, the similarity skyline of a case base is defined by the subset of cases that are most similar to a given query in a Pareto sense. Thus, the idea is to proceed from a d-dimensional comparison between cases in terms of d (local) distance measures and to identify those cases that are maximally similar in the sense of the Pareto dominance relation [2]. To refine the retrieval result, we propose a method for computing maximally diverse subsets of a similarity skyline. Moreover, we propose a generalization of similarity skylines which is able to deal with uncertain data described in terms of interval or fuzzy attribute values. The method is applied to similarity search over uncertain archaeological data.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Vladimirskiy, I., Hüllermeier, E., Stauch, E.: Similarity search over uncertain archaeological data using a modified skyline operator. In: Wilson, D., Khemani, D. (eds.) Workshop Proceedings of ICCBR 2007, Belfast, Northern Ireland, pp. 31–40 (2007)Google Scholar
  2. 2.
    Aizerman, M., Aleskerov, F.: Theory of Choice. North-Holland, Amsterdam (1995)MATHGoogle Scholar
  3. 3.
    Daniels, J., Rissland, E.: A case-based approach to intelligent information retrieval. In: Proc. 18th International ACM SIGIR Conference, Seattle, Washington, US, pp. 238–245 (1995)Google Scholar
  4. 4.
    Roussopoulos, N., Kelley, S., Vincent, F.: Nearest neighbor queries. In: Proc. SIGMOD 1995, New York, NY, USA, pp. 71–79 (1995)Google Scholar
  5. 5.
    Weber, R., Schek, H., Blott, S.: A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces. In: Proc. VLDB 1998, San Francisco, CA, USA, pp. 194–205 (1998)Google Scholar
  6. 6.
    Richter, M.: Foundations of similarity and utility. In: Proc. FLAIRS-20, The 20th International FLAIRS Conference, Key West, Florida (2007)Google Scholar
  7. 7.
    Cunningham, P.: A taxonomy of similarity mechanisms for case-based reasoning. Technical Report UCD-CSI-2008-01, University College Dublin (2008)Google Scholar
  8. 8.
    Borzsony, S., Kossmann, D., Stocker, K.: The skyline operator. In: Proc. 17th International Conference on Data Engineering, San Jose, California, USA, pp. 421–430 (2001)Google Scholar
  9. 9.
    Papadias, D., Tao, Y., Fu, G., Seeger, B.: Progressive skyline computation in database systems. ACM Transactions on Database Systems 30(1), 41–82 (2005)CrossRefGoogle Scholar
  10. 10.
    McSherry, D.: Diversity-conscious retrieval. In: Craw, S., Preece, A.D. (eds.) ECCBR 2002. LNCS (LNAI), vol. 2416, pp. 219–233. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  11. 11.
    McSherry, D.: Increasing recommendation diversity without loss of similarity. Expert Update 5, 17–26 (2002)MathSciNetGoogle Scholar
  12. 12.
    Kukkonen, S., Lampinen, J.: Ranking-dominance and many-objective optimization. In: IEEE Congress on Evolutionary Computation, Singapore, pp. 3983–3990 (2007)Google Scholar
  13. 13.
    Pardalos, P., Xue, J.: The maximum clique problem. Journal of Global Optimization 4(3), 301–328 (1994)MATHCrossRefMathSciNetGoogle Scholar
  14. 14.
    Tomita, E., Kameda, T.: An efficient branch-and-bound algorithm for finding a maximum clique with computational experiments. Journal of Global Optimization 37(1), 95–111 (2007)MATHCrossRefMathSciNetGoogle Scholar
  15. 15.
    Zadeh, L.: The concept of a linguistic variable and its applications in approximate reasoning. Information Science 8, 199–251 (1975)CrossRefMathSciNetGoogle Scholar
  16. 16.
    Klement, E., Mesiar, R., Pap, E.: Triangular Norms. Kluwer Academic Publishers, Dordrecht (2002)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Eyke Hüllermeier
    • 1
  • Ilya Vladimirskiy
    • 1
  • Belén Prados Suárez
    • 2
  • Eva Stauch
    • 3
  1. 1.Philipps-Universität, FB InformatikMarburgGermany
  2. 2.Department of Computer Science and Artificial IntelligenceUniversity of GranadaSpain
  3. 3.Westfälische Wilhelms-Universität, Historisches SeminarMünsterGermany

Personalised recommendations