Skip to main content
Log in

On the Indeterminacy of Resemblance Measures for Binary (Presence/Absence) Data

  • Published:
Journal of Classification Aims and scope Submit manuscript

Abstract

Many similarity coefficients for binary data are defined as fractions. For certain resemblance measures the denominator may become zero. If the denominator is zero the value of the coefficient is indeterminate. It is shown that the seriousness of the indeterminacy problem differs with the resemblance measures. Following Batagelj and Bren (1995) we remove the indeterminacies by defining appropriate values in critical cases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • BARONI-URBANI, C. and BUSER, M.W. (1976), “Similarity of Binary Data,” Systematic Zoology, 25, 251–259.

    Article  Google Scholar 

  • BATAGELJ, V. and BREN, M. (1995), “Comparing Resemblance Measures,” Journal of Classification, 12, 73–90.

    Article  MATH  MathSciNet  Google Scholar 

  • BAULIEU, F.B. (1989), “A Classification of Presence/Absence Based Dissimilarity Coefficients,” Journal of Classification, 6, 233–246.

    Article  MATH  MathSciNet  Google Scholar 

  • BRAUN-BLANQUET, J. (1932), Plant Sociology: The Study of Plant Communities, Authorized English translation of Pflanzensoziologie, New York: McGraw-Hill.

    Google Scholar 

  • COHEN, J. (1960), “A Coefficient of Agreement for Nominal Scales,” Educational and Psychological Measurement, 20, 37–46.

    Article  Google Scholar 

  • DICE, L.R. (1945), “Measures of the Amount of Ecologic Association Between Species,” Ecology, 26, 297–302.

    Article  Google Scholar 

  • FLEISS, J.L. (1975), “Measuring Agreement between Two Judges on the Presence or Absence of a Trait,” Biometrics, 31, 651–659.

    Article  MathSciNet  Google Scholar 

  • GOODMAN, L.A. and KRUSKAL, W.H. (1954), “Measures of Association for Cross Classifications,” Journal of the American Statistical Association, 49, 732–764.

    Article  MATH  Google Scholar 

  • GOWER, J.C. and LEGENDRE, P. (1986), “Metric and Euclidean Properties of Dissimilarity Coefficients,” Journal of Classification, 3, 5–48.

    Article  MATH  MathSciNet  Google Scholar 

  • HAMANN, U. (1961), “Merkmalsbestand und Verwandtschaftsbeziehungen der Farinose. Ein Betrag zum System der Monokotyledonen,” Willdenowia, 2, 639–768.

    Google Scholar 

  • HAWKINS, R.P. and DOTSON, V.A. (1968), “Reliability Scores That Delude: An Alice in Wonderland Trip Through Misleading Characteristics of Interobserver Agreement Scores in Interval Recording”, in Behavior Analysis: Areas of Research and Application, eds. E. Ramp and G. Semb, Englewood Cliffs, N. J.: Prentice-Hall.

    Google Scholar 

  • JACCARD, P. (1912), “The Distribution of the Flora in the Alpine Zone,” The New Phytologist, 11, 37–50.

    Article  Google Scholar 

  • KULCZYŃSKI, S. (1927), “Die Pflanzenassociationen der Pienenen,” Bulletin International de L’Académie Polonaise des Sciences et des Letters, classe des sciences mathematiques et naturelles, Serie B, Suppl´ement II, 2, 57–203.

    Google Scholar 

  • LOEVINGER, J.A. (1948), “The Technique of Homogeneous Tests Compared with Some Aspects of Scale Analysis and Factor Analysis,” Psychological Bulletin, 45, 507–530.

    Article  Google Scholar 

  • MAXWELL, A.E. and PILLINER, A. E. G. (1968), “Deriving Coefficients of Reliability and Agreement for Ratings,” British Journal of Mathematical and Statistical Psychology, 21, 105–116.

    Google Scholar 

  • MCCONNAUGHEY, B.H. (1964), “The Determination and Analysis of Plankton Communities,” Marine Research, Special No., Indonesia, 1–40.

  • MICHAEL, E.L. (1920), “Marine Ecology and the Coefficient of Association: A Plea in Behalf of Quantitative Biology,” The Journal of Ecology, 8, 54–59.

    Article  Google Scholar 

  • OCHIAI, A. (1957), “Zoogeographic Studies on the Soleoid Fishes Found in Japan and Its Neighboring Regions,” Bulletin of the Japanese Society for Fish Science, 22, 526–530.

    Google Scholar 

  • ROGERS, D.J. and TANIMOTO, T.T. (1960), “A Computer Program for Classifying Plants,” Science, 132, 1115–1118.

    Article  Google Scholar 

  • RUSSEL, P.F. and RAO, T.R. (1940), “On Habitat and Association of Species of Anopheline Larvae in South-Eastern Madras,” Journal of Malaria Institute India, 3, 153–178.

    Google Scholar 

  • SCOTT,W.A. (1955), “Reliability of Content Analysis: The Case of Nominal Scale Coding,” Public Opinion Quarterly, 19, 321–325.

    Article  Google Scholar 

  • SIMPSON, G.G. (1943), “Mammals and the Nature of Continents,” American Journal of Science, 241, 1–31.

    Google Scholar 

  • SOKAL, R.R. and MICHENER, C.D. (1958), “A Statistical Method for Evaluating Systematic Relationships,” University of Kansas Science Bulletin, 38, 1409–1438.

    Google Scholar 

  • SOKAL, R.R. and SNEATH, R.H. (1963), Principles of Numerical Taxonomy, San Francisco: W. H. Freeman and Company.

    Google Scholar 

  • SØRENSON, T. (1948), “A Method of Stabilizing Groups of Equivalent Amplitude in Plant Sociology Based on the Similarity of Species Content and Its Application to Analyses of the Vegetation on Danish Commons,” Kongelige Danske Videnskabernes Selskab Biologiske Skrifter, 5, 1–34.

    Google Scholar 

  • SORGENFREI, T. (1958), Molluscan Assemblages from the Marine MiddleMiocene of South Jutland and Their Environments, Copenhagen: Reitzel.

    Google Scholar 

  • YULE, G.U. (1900), “On the Association of Attributes in Statistics,” Philosophical Transactions of the Royal Society of London, 194, 257–319.

    Article  Google Scholar 

  • YULE, G.U. (1912), “On the Methods of Measuring the Association between Two Attributes,” Journal of the Royal Statistical Society, 75, 579–652.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Matthijs J. Warrens.

Additional information

The author would like to thank three anonymous reviewers for their helpful comments and valuable suggestions on earlier versions of this article.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Warrens, M.J. On the Indeterminacy of Resemblance Measures for Binary (Presence/Absence) Data. J Classif 25, 125–136 (2008). https://doi.org/10.1007/s00357-008-9006-8

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00357-008-9006-8

Keywords

Navigation