Skip to main content
Log in

Classification of dissimilarity data with a new flexible Mahalanobis-like metric

  • Theoretical Advances
  • Published:
Pattern Analysis and Applications Aims and scope Submit manuscript

Abstract

Statistical pattern recognition traditionally relies on feature-based representation. For many applications, such vector representation is not available and we only possess proximity data (distance, dissimilarity, similarity, ranks, etc.). In this paper, we consider a particular point of view on discriminant analysis from dissimilarity data. Our approach is inspired by the Gaussian classifier and we defined decision rules to mimic the behavior of a linear or a quadratic classifier. The number of parameters is limited (two per class). Numerical experiments on artificial and real data show interesting behavior compared to Support Vector Machines and to kNN classifier: (a) lower or equivalent error rate, (b) equivalent CPU time, (c) more robustness with sparse dissimilarity data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

References

  1. Bahlmann C, Haasdonk B, Burkhardt H (2002) On-line handwriting recognition with support vector machines—a kernel approach. In: Eighth international workshop on frontiers in handwriting recognition, ON, Canada

  2. Borg I, Groenen P (1997) Modern multidimensional scaling: theory and applications. Springer, New York

    MATH  Google Scholar 

  3. Celeux G, Govaert G (1995) Parsimonious Gaussian models in cluster analysis. Pattern Recognit 28:781–793

    Article  Google Scholar 

  4. Dubuisson MP, Jain AK (1994) Modified Hausdorff distance for object matching. In: 12th international conference on pattern recognition, vol 1, pp 566–568

  5. Duch W (2000) Similarity-based methods: a general framework for classification, approximation and association. Control Cybern 29(4):937–968

    MATH  MathSciNet  Google Scholar 

  6. Garris M, Blue J, Candela G, Grother P, Janet S, Wilson C (1997) NIST Form-Based Handprint Recognition System (Release 2.0), internal report

  7. Guérin-Dugué A, Oliva A (2000) Classification of scene photographs from local orientations features. Pattern Recognit Lett 11:1135–1140

    Article  Google Scholar 

  8. Guérin-Dugué A, Celeux G (2001) Discriminant analysis on dissimilarity data: a new fast Gaussian-like algorithm. In: AISTATS’20001, Fl, USA, pp 202–207

  9. Haasdonk B, Bahlmann C (2004) Learning with distance substitution kernels, pattern recognition. In: Proceedings of the 26th DAGM symposium, Tübingen, Germany

  10. Haasdonk B, Keysers D (2002) Tangent distance kernels for support vector machines. In: International conference on pattern recognition, QC, Canada

  11. Ho PT, Guérin-Dugué A (2007) A new adaptation of self-organizing map for dissimilarity data. In: Lecture notes in computer sciences, IWANN’2007, 20–22 June 2007, San Sebastian, Spain, pp 219–226

  12. Jain AK, Dubes RC (1988) Algorithms for clustering data. Prentice-Hall, Englewood Cliffs

    MATH  Google Scholar 

  13. Hammer B, Vilmann T (2005) Classification using non-standard metrics. In: Esann’2005, 27–29 April, Bruges, Belgium, pp 303–316

  14. Kohonen T, Somervuo PJ (1998) Self-organizing maps for symbol strings, Neurocomputing 21:19–30

    Article  MATH  Google Scholar 

  15. Kohonen T, Somervuo PJ (2002) How to make large self-organizing maps for non vectorial data, Neural Netw 21(8):945–952

    Article  Google Scholar 

  16. Levenshtein VI (1966) Binary codes capable of correcting deletions, insertions and reversals. Sov Phys Dokl 10(8):707–710

    MathSciNet  Google Scholar 

  17. Lozano M, Sotoca JM, Sànchez JS, Pla F, Pekalska E, Duin RPW (2006) Experimental study on prototype-based classifier in vector spaces. Pattern Recognit 39:1827–1838

    Article  MATH  Google Scholar 

  18. Moreno P, Ho P, Vasconcelos N (2003) A Kullback–Leibler divergence based kernel for SVM classification in multimedia applications. In: Neural information processing system. Whistler, Canada

  19. Pekalska E, Paclik P, Duin RPW (2001) A generalizes kernel approach to dissimilarity-based classification. J Mach Learn Res 2:175–211

    Article  MathSciNet  Google Scholar 

  20. Pekalska E (2005) Dissimilarity representations in pattern recognition: concept, theory and applications. Phd Thesis, ISBN 90-9019021-X

  21. Schlökopf B (2000) The kernel trick for distances. In: Neural information processing system, Vancouver, Canada, pp 301–307

  22. Simard P, LeCun Y, Denker J (1993) Efficient pattern recognition using a new transformation distance. In: Hanson S, Cowan J, Giles L (eds) Advances in neural information processing systems, vol 5. Morgan Kaufmann Publisher

  23. Typke R, Giannopoulos P, Veltkamp RC, Wiering F, van Oostrum R (2003) Using transportation distances for measuring melodic similarity, 4th international conference on music information retrieval, ISMIR, pp 107–114

  24. Van Cutsen B (ed) (1994) Classification and dissimilarity analysis. In: Lecture notes in statistics, vol 95. Springer, Heidelberg

Download references

Acknowledgments

The authors would like to sincerely thank Gilles Celeux, who inspired this work and for all the fruitful and valuable scientific discussions, which allowed this study to be carried out.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Agata Manolova.

Additional information

This work is supported by grants of the “Fonds National pour la Science”, from the program “ACI Masse de Données” and the project “DataHighDim”.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Manolova, A., Guérin-Dugué, A. Classification of dissimilarity data with a new flexible Mahalanobis-like metric. Pattern Anal Applic 11, 337–351 (2008). https://doi.org/10.1007/s10044-008-0101-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10044-008-0101-6

Keywords

Navigation