Skip to main content
Log in

Constructing dissimilarity measures

  • Published:
Journal of Classification Aims and scope Submit manuscript

Abstract

The paper addresses the problem of specifying differential weights for variables in the construction of a measure of dissimilarity. An assessor is required to provide subjective judgments of the pairwise dissimilarities within a training set of objects, and these dissimilarities are then modeled as a function of the recorded differences between the objects on each of the variables. The aim is to make explicit the relative importance that assessors attach to each of the variables, and thus obtain guidance on how these variables should be combined into a relevant dissimilarity matrix. The methodology is illustrated by application to some archaeological data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • ANDERBERG, M. R. (1973),Cluster Analysis for Applications, New York: Academic Press.

    Google Scholar 

  • ANDERSON, E. (1960) “A Semigraphical Method for the Analysis of Complex Problems,”Technometrics, 2, 387–391.

    Google Scholar 

  • BERTIN, J. (1983),Semiology of Graphics: Diagrams, Networks, Maps (translated by W.J. Berg), Madison: University of Wisconsin Press.

    Google Scholar 

  • CARROLL, J. D., and CHANG, J.-J. (1970), “Analysis of Individual Differences in Multidimensional Scaling Via an N-Way Generalization of ‘Eckhart-Young’ Decomposition,”Psychometrika, 35, 283–319.

    Google Scholar 

  • CHERNOFF, H. (1973), “The Use of Faces to Represent Points in k-Dimensional Space Graphically,”Journal of the American Statistical Association, 68, 361–368.

    Google Scholar 

  • DE SOETE, G., DESARBO, W. S., and CARROLL, J. D. (1985), “Optimal Variable Weighting for Hierarchical Clustering: An Alternating Least-Squares Algorithm,”Journal of Classification, 2, 173–192.

    Google Scholar 

  • DESARBO, W. S., CARROLL, J. D., CLARK, L. A., and GREEN, P. E. (1984), “Synthesized Clustering: A Method for Amalgamating Alternative Clustering Bases with Differential Weighting of Variables,”Psychometrika, 49, 57–78.

    Google Scholar 

  • DESARBO, W. S., and MAHAJAN, V. (1984), “Constrained Classification: The Use of A Priori Information in Cluster Analysis,”Psychometrika, 49, 187–215.

    Google Scholar 

  • DIDAY, E., and MOREAU, J. V. (1984), “Learning Hierarchical Clustering from Examples — Application to the Adaptive Construction of Dissimilarity Indices,”Pattern Recognition Letters, 2, 365–378.

    Google Scholar 

  • DIDAY, E., and SIMON, J. C. (1976), “Clustering Analysis,” inCommunication and Cybernetics 10: Digital Pattern Recognition, ed. K. S. Fu, Berlin: Springer-Verlag, 47–94.

    Google Scholar 

  • DRAPER, N. R., and SMITH, H. (1981),Applied Regression Analysis (2nd ed.), New York: Wiley.

    Google Scholar 

  • FOWLKES, E. B., GNANADESIKAN, R., and KETTENRING, J. R. (1988), “Variable Selection in Clustering,”Journal of Classification, 5, 205–228.

    Google Scholar 

  • FRIEDMAN, J. H., and TUKEY, J. W. (1974), “A Projection Pursuit Algorithm for Exploratory Data Analysis,”IEEE Transactions on Computers, C-23, 881–890.

    Google Scholar 

  • GORDON, A. D. (1981),Classification: Methods for the Exploratory Analysis of Multivariate Data, London: Chapman and Hall.

    Google Scholar 

  • GOWER, J. C. (1971), “A General Coefficient of Similarity and Some of its Properties,”Biometrics, 27, 857–874.

    Google Scholar 

  • GOWER, J. C. (1985), “Measures of Similarity, Dissimilarity, and Distance,” inEncyclopedia of Statistical Sciences, Volume 5, eds. S. Kotz, N. L. Johnson and C. B. Read, New York: Wiley, 397–405.

    Google Scholar 

  • GOWER, J. C., and LEGENDRE, P. (1986), “Metric and Euclidean Properties of Dissimilarity Coefficients,”Journal of Classification, 3, 5–48.

    Google Scholar 

  • GREEN, P. E., and DESARBO, W.S. (1978), “Additive Decomposition of Perceptions Data Via Conjoint Analysis,”Journal of Consumer Research, 5, 58–65.

    Google Scholar 

  • HANSOHM, J., and TURULSKI, N. (1985), “Analysis of the Classification of Cetacea: Whales, Porpoises and Dolphins,” inData Analysis in Real Life Environment: Ins and Outs of Solving Problems, eds. J.-F. Marcotorchino, J.-M. Proth, and J. Janssen, Amsterdam: North-Holland, 55–67.

    Google Scholar 

  • JAMBU, M. (1978),Classification Automatique pour L'Analyse des Données, Tome 1, Paris: Dunod.

    Google Scholar 

  • JAMBU, M., and LEBEAUX, M.-O. (1983),Cluster Analysis and Data Analysis, Amsterdam: North Holland Publishing Company.

    Google Scholar 

  • KLEINER, B., and HARTIGAN, J. A. (1981), “Representing Points in Many Dimensions by Trees and Castles,”Journal of the American Statistical Association, 76, 260–276.

    Google Scholar 

  • KRUSKAL, J. B. (1964a), “Multidimensional Scaling by Optimizing Goodness of Fit to a Nonmetric Hypothesis,”Psychometrika, 29, 1–27.

    Google Scholar 

  • KRUSKAL, J. B. (1964b), “Nonmetric Multidimensional Scaling: A Numerical Method,”Psychometrika, 27, 115–129.

    Google Scholar 

  • KRUSKAL, J.B. (1972), “Linear Transformation of Multivariate Data to Reveal Clustering,” inMultidimensional Scaling. Theory and Applications in the Behavioral Sciences, Volume 1 Theory, eds. R. N. Shepard, A. K. Romney, and S. B. Nerlove, New York: Seminar Press, 179–191.

    Google Scholar 

  • KRZANOWSKI, W. J. (1988),Principles of Multivariate Analysis. A User's Perspective, Oxford: Clarendon Press.

    Google Scholar 

  • LACHENBRUCH, P. A. (1975),Discriminant Analysis, New York: Hafner.

    Google Scholar 

  • LANCE, G. N., and WILLIAMS, W. T. (1967), “A General Theory of Classificatory Sorting Strategies I. Hierarchical Systems,”Computer Journal, 9, 373–380.

    Google Scholar 

  • MILLIGAN, G. W. (1980), “An Examination of the Effect of Six Types of Error Perturbation on Fifteen Clustering Algorithms,”Psychometrika, 45, 325–342.

    Google Scholar 

  • MILLIGAN, G. W., and COOPER, M. C. (1988), “A Study of Standardization of Variables in Cluster Analysis,”Journal of Classification, 5, 181–204.

    Google Scholar 

  • SIBSON, R. (1972), “Order Invariant Methods for Data Analysis (with Discussion),”Journal of the Royal Statistical Society, Series B, 34, 311–349.

    Google Scholar 

  • SNEATH, P. H. A., and SOKAL, R. R. (1973),Numerical Taxonomy, San Francisco: Freeman.

    Google Scholar 

  • SNIJDERS, T. A. B., DORMAAR, M., VAN SCHUUR, W. H., DIJKMAN-CAES, C., and DRIESSEN, G. (1990), “Distribution of Some Similarity Coefficients for Dyadic Binary Data in the Case of Associated Attributes,”Journal of Classification, 7, 5–31.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gordon, A.D. Constructing dissimilarity measures. Journal of Classification 7, 257–269 (1990). https://doi.org/10.1007/BF01908719

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF01908719

Keywords

Navigation