Abstract
Correlation is one of the most common statistical tools and one of the most basic methods for discovery from data. Over the past century, correlation has led to an extremely high number of discoveries in virtually all fields of science and is currently one of the basic building blocks that enable scientific research. In most cases, correlation is detected between numerical variables using tools such as Pearson or Spearman correlation. This paper describes a method that can identify rank correlation between two sequences of images. That is, instead of identifying a rank correlation between two numerical variables, the method identifies rank correlations between two variables such that each variable is an image. Experimental results show that the method is able to detect rank correlations between sequences of synthetic images, as well as sequences of natural images. The method can provide useful information for profiling visual data, and example discoveries that can be made with the method are discussed. Given the prevalence of correlation in data-driven discovery, combined with the increasing size and availability of databases of visual data, identifying rank correlations in image data can provide useful insights and new knowledge from the data. The method is nonparametric and can be applied to various types of image data.
Similar content being viewed by others
References
Pearson, K.: Notes on the history of correlation. Biometrika 13(1), 25–45 (1920)
Stigler, S.M.: Francis galton’s account of the invention of correlation. Statis. Sci., 73–79 (1989)
Speed, T.: A correlation for the 21st century. Science 334(6062), 1502–1503 (2011)
Swinscow, T.D.V., Campbell, M.J., et al.: Statistics at Square One. BMJ London, London, UK (2002)
Hinkle, D.E., Wiersma, W., Jurs, S.G., et al.: Applied statistics for the behavioral sciences (1988)
Bolboaca, S.-D., Jäntschi, L.: Pearson versus Spearman, Kendall’s tau correlation analysis on structure-activity relationships of biologic active compounds. Leonardo J. Sci. 5(9), 179–200 (2006)
Mukaka, M.M.: A guide to appropriate use of correlation coefficient in medical research. Malawi Med. J. 24(3), 69–71 (2012)
Zhou, S.K., Georgescu, B., Zhou, X.S., Comaniciu, D.: Image based regression using boosting method. In: Tenth IEEE International Conference on Computer Vision, vol. 1, pp. 541–548 (2005). IEEE
Guo, G., Fu, Y., Dyer, C.R., Huang, T.S.: Image-based human age estimation by manifold learning and locally adjusted robust regression. IEEE Trans. Image Process. 17(7), 1178–1188 (2008)
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Yan, L.C., Yoshua, B., Geoffrey, H.: Deep learning. Nature 521(7553), 436–444 (2015)
Hu, W., Huang, Y., Wei, L., Zhang, F., Li, H.: Deep convolutional neural networks for hyperspectral image classification. J. Sens. 2015, 1–12 (2015)
Yuan, Z.-W., Zhang, J.: Feature extraction and image retrieval based on alexnet. In: Eighth International Conference on Digital Image Processing (ICDIP 2016), vol. 10033, pp. 65–69 (2016). SPIE
Haber, E., Ruthotto, L.: Stable architectures for deep neural networks. Inverse Probl. 34(1), 014004 (2017)
Sharma, N., Jain, V., Mishra, A.: An analysis of convolutional neural networks for image classification. Proc. Comput. Sci. 132, 377–384 (2018)
Cichy, R.M., Kaiser, D.: Deep neural networks as scientific models. Trends Cogn. Sci. 23(4), 305–317 (2019)
Bau, D., Zhu, J.-Y., Strobelt, H., Lapedriza, A., Zhou, B., Torralba, A.: Understanding the role of individual units in a deep neural network. Proc. Natl. Acad. Sci. 117(48), 30071–30078 (2020)
Ju, Z., Xue, Y.: Fish species recognition using an improved alexnet model. Optik 223, 165499 (2020)
Shamir, L., Orlov, N., Eckley, D.M., Macura, T., Johnston, J., Goldberg, I.G.: Wndchrm-an open source utility for biological image analysis. Source Code Biol. Med. 3(1), 13 (2008)
Orlov, N., Shamir, L., Macura, T., Johnston, J., Eckley, D.M., Goldberg, I.G.: Wnd-charm: multi-purpose image classification using compound image transforms. Pattern Recogn. Lett. 29(11), 1684–1693 (2008)
Shamir, L.: Evaluation of face datasets as tools for assessing the performance of face recognition methods. Int. J. Comput. Vision 79(3), 225 (2008)
George, J., Shamir, L.: Computer analysis of similarities between albums in popular music. Pattern Recogn. Lett. 45, 78–84 (2014)
Shamir, L.: Morphology-based query for galaxy image databases. Public. Astron. Soc. Pacific 129(972), 024003 (2016)
Shamir, L., Orlov, N., Goldberg, I.G.: Evaluation of the informativeness of multi-order image transforms. In: IPCV, pp. 37–42 (2009)
Shamir, L.: A computer analysis method for correlating knee x-rays with continuous indicators. Int. J. Comput. Assist. Radiol. Surg. 6(5), 699–704 (2011)
Guyon, I., Gunn, S., Nikravesh, M., Zadeh, L.A.: Feature Extraction: Foundations and Applications vol. 207. Springer, (2008)
Shamir, L., Wolkow, C.A., Goldberg, I.G.: Quantitative measurement of aging using image texture entropy. Bioinformatics 25(23), 3060–3063 (2009)
Kellgren, J.H., Lawrence, J.: Radiological assessment of osteo-arthrosis. Ann. Rheum. Dis. 16(4), 494 (1957)
Kohn, M.D., Sassoon, A.A., Fernando, N.D.: Classifications in brief: Kellgren-lawrence classification of osteoarthritis. Clin. Orthopaed. Related Res. 474, 1886–1893 (2016)
Fisher, R.: On the probable error of a coefficient of correlation deduced from a small sample. Metron 1(4), 3–32 (1921)
Aspden, R.M., Scheven, B.A., Hutchison, J.D.: Osteoarthritis as a systemic disorder including stromal cell differentiation and lipid metabolism. The Lancet 357(9262), 1118–1120 (2001)
Visser, A., De Mutsert, R., Le Cessie, S., Den Heijer, M., Rosendaal, F., Kloppenburg, M.: The relative contribution of mechanical stress and systemic processes in different types of osteoarthritis: the neo study. Ann. Rheum. Dis. 74(10), 1842–1847 (2015)
Acknowledgements
I would like to thank the two knowledgeable reviewers for the insightful comments. This work is supported in part by NSF grants AST-1903823 and IIS-1546079.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Qiang Zhu.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Shamir, L. Automatic identification of rank correlation between image sequences. Int J Data Sci Anal 17, 1–11 (2024). https://doi.org/10.1007/s41060-023-00450-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41060-023-00450-4