Abstract
We consider the problem of estimating the accuracy of quantitative similarity coefficients. For this purpose, we introduce a new concept of the similarity measure for the corresponding coefficient. We show that only frequency forms of quantitative similarity coefficients represent consistent estimates of their similarity measures. We obtain asymptotic confidence intervals for the RužiŽka and Bray-Curtis similarity measures based on the coefficients with the same names. We also propose a test for homogeneity of two populations based on the above-mentioned coefficients.
Similar content being viewed by others
References
Cha, S.-H., Comprehensive Survey on Distance/Similarity Measures between Probability Density Functions, Int. J. Math. Model. Meth. Appl. Sci., 2007, vol. 1, no. 4, pp. 300–307.
Semkin, B.I., Descriptive Sets and Their Applications, in Issledovaniya sistem. 1. Slozhnye sistemy (Systems Research. 1. Complex Systems), Vladivostok, 1973, pp. 83–94.
Semkin, B.I., The Axiomatic Approach to Introducing Measures for Ordering and Classification of Descriptive Sets, Patt. Recogn. Image Anal., 2011, vol. 21, no. 2, pp. 164–166.
Diserud, O.H. and Ødegaard F., A Multiple-Site Similarity Measures, Biol. Lett., 2007, vol. 3, no. 1, pp. 20–22.
Baselga, A., Jimenez-Valverde, A., and Niccolini, G., A Multiple-Site Similarity Measure Independent of Richness, Biol. Lett., 2007, vol. 3, no. 6, pp. 642–645.
Cheetham, A.H. and Hazel, J.E., Binary (Presence-Absence) Similarity Coefficients, J. Paleontol., 1969, vol. 43, no. 5, pp. 1130–1136.
Pesenko, Yu.A., Printsipy i metody kolichestvennogo analiza v faunisticheskikh issledovaniyakh (Principles and Methods of Quantitative Analysis in Fauna Studies), Moscow: Nauka, 1982.
Jaccard, P., Distribution de la flore alpine dans le bassin des Dranses et dans quelques régions voisines, Bulletin de la Société Vaudoise des Sciences Naturelles, 1901, vol. 37, pp. 241–272.
Ružička, M., Anwendung mathematisch-statistischer Methoden in der Geobotanik (Synthetische Bearbeitung von Aufnahmen), Biológia, Bratisl., 1958, vol. 13, pp. 647–661.
Dice, L.R., Measures of the Amount of Ecologic Association between Species, Ecology, 1945, vol. 26, no. 3, pp. 297–302.
Sørensen, T., A Method of Establishing Groups of Equal Amplitude in Plant Sociology Based on Similarity of Species Content, Kongelige Danske Videnskabernes Selskab. Biol. Krifter, 1948, vol. 5, no. 4, pp. 1–34.
Czekanowski, J., Zur differential Diagnose der Neandertalgruppe, Korrespbl. Dtsch. Ges. Anthropol., 1909, vol. 40, S. 44–47.
Bray, J.R. and Curtis, J.T., An Ordination of Upland Forest Communities of Southern Wisconsin, Ecol. Monogr., 1957, vol. 27, pp. 325–349.
Glime, J.M. and Clemons, R.M., Species Diversity of Stream Insects on Fontinalis Spp. Compared to Diversity on Artificial Substrates, Ecology, 1972, vol. 53, no. 3, pp. 458–464.
Li, X. and Dubes, R.C., A Probabilistic Measure of Similarity for Binary Data in Pattern Recognition, Patt. Recogn., 1989, vol. 22, no. 4, pp. 397–409.
Bolton, H.C., On the Mathematical Significance of the Similarity Index of Ochiai as a Measure for Biogeographical Habitats, Aust. J. Zool., 1991, vol. 39, pp. 143–156.
Baroni-Urbani, C. and Buser, M.W., Similarity of Binary Data, Syst. Zool., 1976, vol. 25, no. 3, pp. 251–259.
Engen, S., Grøtan, V., and Sæther, B.-E., Estimating Similarity of Communities: A Parametric Approach to Spatio-Temporal Analysis of Species Diversity, Ecography, 2011, vol. 34, no. 2, pp. 220–231.
McCormick, W.P., Lyons, N.I., and Hutcheson, K., Distributional Properties of Jaccard’s Index of Similarity, Commun. Statist. Theor. Meth., 1992, vol. 21, no. 1, pp. 51–68.
Chao, A., Estimating the Population Size for Capture-Recapture Data with Unequal Catchability, Biometrics, 1987, vol. 43, no. 4, pp. 783–791.
Chao, A., Hwang, W.-H., Chen, Y.-C., and Kuo, C.-Y., Estimating the Number of Shared Species in Two Communities, Statist. Sinica, 2000, vol. 10, pp. 227–246.
Chao, A., Chazdon, R.L., Colwell, R.K., and Shen, T.J., A New Statistical Approach for Assessing Similarity of Species Composition with Incidence and Abundance Data, Ecol. Lett., 2005, vol. 8, pp. 148–159.
Acknowledgments
The work of I.V. Rodionov in Sections 1 and 2 was supported by the Russian Science Foundation, project no. 19-11-00290 provided by the Steklov Mathematical Institute of the RAS.
Author information
Authors and Affiliations
Corresponding authors
Additional information
This paper was recommended for publication by E.Ya. Rubinovich, a member of the Editorial Board
Russian Text © The Author(s), 2020, published in Avtomatika i Telemekhanika, 2020, No. 2, pp. 157–172.
Rights and permissions
About this article
Cite this article
Rodionov, I.V., Sozontov, A.N. On Confidence Estimation Based on Quantitative Similarity Coefficients. Autom Remote Control 81, 320–332 (2020). https://doi.org/10.1134/S0005117920020101
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1134/S0005117920020101