Abstract
Many similarity coefficients for binary data are defined as fractions. For certain resemblance measures the denominator may become zero. If the denominator is zero the value of the coefficient is indeterminate. It is shown that the seriousness of the indeterminacy problem differs with the resemblance measures. Following Batagelj and Bren (1995) we remove the indeterminacies by defining appropriate values in critical cases.
Similar content being viewed by others
References
BARONI-URBANI, C. and BUSER, M.W. (1976), “Similarity of Binary Data,” Systematic Zoology, 25, 251–259.
BATAGELJ, V. and BREN, M. (1995), “Comparing Resemblance Measures,” Journal of Classification, 12, 73–90.
BAULIEU, F.B. (1989), “A Classification of Presence/Absence Based Dissimilarity Coefficients,” Journal of Classification, 6, 233–246.
BRAUN-BLANQUET, J. (1932), Plant Sociology: The Study of Plant Communities, Authorized English translation of Pflanzensoziologie, New York: McGraw-Hill.
COHEN, J. (1960), “A Coefficient of Agreement for Nominal Scales,” Educational and Psychological Measurement, 20, 37–46.
DICE, L.R. (1945), “Measures of the Amount of Ecologic Association Between Species,” Ecology, 26, 297–302.
FLEISS, J.L. (1975), “Measuring Agreement between Two Judges on the Presence or Absence of a Trait,” Biometrics, 31, 651–659.
GOODMAN, L.A. and KRUSKAL, W.H. (1954), “Measures of Association for Cross Classifications,” Journal of the American Statistical Association, 49, 732–764.
GOWER, J.C. and LEGENDRE, P. (1986), “Metric and Euclidean Properties of Dissimilarity Coefficients,” Journal of Classification, 3, 5–48.
HAMANN, U. (1961), “Merkmalsbestand und Verwandtschaftsbeziehungen der Farinose. Ein Betrag zum System der Monokotyledonen,” Willdenowia, 2, 639–768.
HAWKINS, R.P. and DOTSON, V.A. (1968), “Reliability Scores That Delude: An Alice in Wonderland Trip Through Misleading Characteristics of Interobserver Agreement Scores in Interval Recording”, in Behavior Analysis: Areas of Research and Application, eds. E. Ramp and G. Semb, Englewood Cliffs, N. J.: Prentice-Hall.
JACCARD, P. (1912), “The Distribution of the Flora in the Alpine Zone,” The New Phytologist, 11, 37–50.
KULCZYŃSKI, S. (1927), “Die Pflanzenassociationen der Pienenen,” Bulletin International de L’Académie Polonaise des Sciences et des Letters, classe des sciences mathematiques et naturelles, Serie B, Suppl´ement II, 2, 57–203.
LOEVINGER, J.A. (1948), “The Technique of Homogeneous Tests Compared with Some Aspects of Scale Analysis and Factor Analysis,” Psychological Bulletin, 45, 507–530.
MAXWELL, A.E. and PILLINER, A. E. G. (1968), “Deriving Coefficients of Reliability and Agreement for Ratings,” British Journal of Mathematical and Statistical Psychology, 21, 105–116.
MCCONNAUGHEY, B.H. (1964), “The Determination and Analysis of Plankton Communities,” Marine Research, Special No., Indonesia, 1–40.
MICHAEL, E.L. (1920), “Marine Ecology and the Coefficient of Association: A Plea in Behalf of Quantitative Biology,” The Journal of Ecology, 8, 54–59.
OCHIAI, A. (1957), “Zoogeographic Studies on the Soleoid Fishes Found in Japan and Its Neighboring Regions,” Bulletin of the Japanese Society for Fish Science, 22, 526–530.
ROGERS, D.J. and TANIMOTO, T.T. (1960), “A Computer Program for Classifying Plants,” Science, 132, 1115–1118.
RUSSEL, P.F. and RAO, T.R. (1940), “On Habitat and Association of Species of Anopheline Larvae in South-Eastern Madras,” Journal of Malaria Institute India, 3, 153–178.
SCOTT,W.A. (1955), “Reliability of Content Analysis: The Case of Nominal Scale Coding,” Public Opinion Quarterly, 19, 321–325.
SIMPSON, G.G. (1943), “Mammals and the Nature of Continents,” American Journal of Science, 241, 1–31.
SOKAL, R.R. and MICHENER, C.D. (1958), “A Statistical Method for Evaluating Systematic Relationships,” University of Kansas Science Bulletin, 38, 1409–1438.
SOKAL, R.R. and SNEATH, R.H. (1963), Principles of Numerical Taxonomy, San Francisco: W. H. Freeman and Company.
SØRENSON, T. (1948), “A Method of Stabilizing Groups of Equivalent Amplitude in Plant Sociology Based on the Similarity of Species Content and Its Application to Analyses of the Vegetation on Danish Commons,” Kongelige Danske Videnskabernes Selskab Biologiske Skrifter, 5, 1–34.
SORGENFREI, T. (1958), Molluscan Assemblages from the Marine MiddleMiocene of South Jutland and Their Environments, Copenhagen: Reitzel.
YULE, G.U. (1900), “On the Association of Attributes in Statistics,” Philosophical Transactions of the Royal Society of London, 194, 257–319.
YULE, G.U. (1912), “On the Methods of Measuring the Association between Two Attributes,” Journal of the Royal Statistical Society, 75, 579–652.
Author information
Authors and Affiliations
Corresponding author
Additional information
The author would like to thank three anonymous reviewers for their helpful comments and valuable suggestions on earlier versions of this article.
Rights and permissions
About this article
Cite this article
Warrens, M.J. On the Indeterminacy of Resemblance Measures for Binary (Presence/Absence) Data. J Classif 25, 125–136 (2008). https://doi.org/10.1007/s00357-008-9006-8
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00357-008-9006-8