ECCBR 2008: Advances in Case-Based Reasoning pp 240-254 | Cite as
Supporting Case-Based Retrieval by Similarity Skylines: Basic Concepts and Extensions
Abstract
Conventional approaches to similarity search and case-based retrieval, such as nearest neighbor search, require the specification of a global similarity measure which is typically expressed as an aggregation of local measures pertaining to different aspects of a case. Since the proper aggregation of local measures is often quite difficult, we propose a novel concept called similarity skyline. Roughly speaking, the similarity skyline of a case base is defined by the subset of cases that are most similar to a given query in a Pareto sense. Thus, the idea is to proceed from a d-dimensional comparison between cases in terms of d (local) distance measures and to identify those cases that are maximally similar in the sense of the Pareto dominance relation [2]. To refine the retrieval result, we propose a method for computing maximally diverse subsets of a similarity skyline. Moreover, we propose a generalization of similarity skylines which is able to deal with uncertain data described in terms of interval or fuzzy attribute values. The method is applied to similarity search over uncertain archaeological data.
Preview
Unable to display preview. Download preview PDF.
References
- 1.Vladimirskiy, I., Hüllermeier, E., Stauch, E.: Similarity search over uncertain archaeological data using a modified skyline operator. In: Wilson, D., Khemani, D. (eds.) Workshop Proceedings of ICCBR 2007, Belfast, Northern Ireland, pp. 31–40 (2007)Google Scholar
- 2.Aizerman, M., Aleskerov, F.: Theory of Choice. North-Holland, Amsterdam (1995)MATHGoogle Scholar
- 3.Daniels, J., Rissland, E.: A case-based approach to intelligent information retrieval. In: Proc. 18th International ACM SIGIR Conference, Seattle, Washington, US, pp. 238–245 (1995)Google Scholar
- 4.Roussopoulos, N., Kelley, S., Vincent, F.: Nearest neighbor queries. In: Proc. SIGMOD 1995, New York, NY, USA, pp. 71–79 (1995)Google Scholar
- 5.Weber, R., Schek, H., Blott, S.: A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces. In: Proc. VLDB 1998, San Francisco, CA, USA, pp. 194–205 (1998)Google Scholar
- 6.Richter, M.: Foundations of similarity and utility. In: Proc. FLAIRS-20, The 20th International FLAIRS Conference, Key West, Florida (2007)Google Scholar
- 7.Cunningham, P.: A taxonomy of similarity mechanisms for case-based reasoning. Technical Report UCD-CSI-2008-01, University College Dublin (2008)Google Scholar
- 8.Borzsony, S., Kossmann, D., Stocker, K.: The skyline operator. In: Proc. 17th International Conference on Data Engineering, San Jose, California, USA, pp. 421–430 (2001)Google Scholar
- 9.Papadias, D., Tao, Y., Fu, G., Seeger, B.: Progressive skyline computation in database systems. ACM Transactions on Database Systems 30(1), 41–82 (2005)CrossRefGoogle Scholar
- 10.McSherry, D.: Diversity-conscious retrieval. In: Craw, S., Preece, A.D. (eds.) ECCBR 2002. LNCS (LNAI), vol. 2416, pp. 219–233. Springer, Heidelberg (2002)CrossRefGoogle Scholar
- 11.McSherry, D.: Increasing recommendation diversity without loss of similarity. Expert Update 5, 17–26 (2002)MathSciNetGoogle Scholar
- 12.Kukkonen, S., Lampinen, J.: Ranking-dominance and many-objective optimization. In: IEEE Congress on Evolutionary Computation, Singapore, pp. 3983–3990 (2007)Google Scholar
- 13.Pardalos, P., Xue, J.: The maximum clique problem. Journal of Global Optimization 4(3), 301–328 (1994)MATHCrossRefMathSciNetGoogle Scholar
- 14.Tomita, E., Kameda, T.: An efficient branch-and-bound algorithm for finding a maximum clique with computational experiments. Journal of Global Optimization 37(1), 95–111 (2007)MATHCrossRefMathSciNetGoogle Scholar
- 15.Zadeh, L.: The concept of a linguistic variable and its applications in approximate reasoning. Information Science 8, 199–251 (1975)CrossRefMathSciNetGoogle Scholar
- 16.Klement, E., Mesiar, R., Pap, E.: Triangular Norms. Kluwer Academic Publishers, Dordrecht (2002)Google Scholar