Abstract
We use a simple example to show that Pearson’s correlation matrix R can underestimate the true dependence between two variables when nonlinearities are present by as much as 83%, compared to the newer and easy to compute \(R^*\) in Vinod (Commun Statist Simul Comput 46(6):4513–4534, 2017, https://doi.org/10.1080/03610918.2015.1122048). We include intuitive expository discussion of nonparametric kernel methods needed by \(R^*\) with graphs and examples. We illustrate how partial correlation coefficients based on R can underestimate the nonlinear effect of a confounding variable, compared to those from the newer \(R^*\). This paper develops an entirely new generalization of Hotelling’s canonical correlations based on nonlinear nonparametric pairwise dependencies of \(R^*\). An example illustrates how traditional methods can underestimate the joint dependence by 266%.
Similar content being viewed by others
Change history
20 March 2022
A Correction to this paper has been published: https://doi.org/10.1007/s10614-022-10236-8
References
Allen, D. E., & Hooper, V. (2018). Generalized correlation measures of causality and forecasts of the VIX using non-linear models. Sustainability, 10(8), 1–15, 2695. https://doi.org/10.3390/su10082695.
Garcia-Medina, A., & Gonzalez, F. G. (2020). Transfer entropy as a variable selection methodology of cryptocurrencies in the framework of a high dimensional predictive model. PLoS ONE, 15(1), e0227269. https://doi.org/10.1371/journal.pone.0227269.
Harrell, F. E., et al. (2010). Hmisc: Harrell Miscellaneous. http://CRAN.R-project.org/package=Hmisc. R package version 3.8-2
Hayfield, T., & Racine, J. S. (2008). Nonparametric econometrics: The np package. Journal of Statistical Software, 27(5), 1–32.
Hotelling, H. (1936). Relations between two sets of variables. Biometrika, 28, 321–327.
Kendall, M., & Stuart, A. (1977). The advanced theory of statistics (4th ed., Vol. 1). New York: Macmillan Publishing Co.
Li, Q., & Racine, J. S. (2007). Nonparametric econometrics. Princeton University Press.
Lister, B. C., & Garcia, A. (2018). Climate-driven declines in arthropod abundance restructure a rainforest food web. Proceedings of the National Academy of Sciences, 15, 1–10.
Raveh, A. (1985). On the use of the inverse of the correlation matrix in multivariate data analysis. The American Statistician, 39(1), 39–42.
Vinod, H. D. (1976). Canonical ridge and econometrics of joint production. Journal of Econometrics, 4, 147–166.
Vinod, H. D. (2017). Generalized correlation and kernel causality with applications in development economics. Communications in Statistics-Simulation and Computation, 46(6), 4513–4534. https://doi.org/10.1080/03610918.2015.1122048.
Vinod, H. D. (2018). generalCorr: Generalized Correlations and Initial Causal Path (2016). https://CRAN.R-project.org/package=generalCorr. Fordham University, New York, R package version 1.1.2, has 3 vignettes
Vinod, H. D. (2019). New exogeneity tests and causal paths. In H. D. Vinod & C. R. Rao (Eds.), Handbook of Statistics: Conceptual Econometrics Using R, chap. 2 (Vol. 41, pp. 33–64). Elsevier. https://doi.org/10.1016/bs.host.2018.11.011.
Zheng, S., Shi, N. Z., & Zhang, Z. (2012). Generalized measures of correlation for asymmetry, nonlinearity, and beyond. Journal of the American Statistical Association, 107(499), 1239–1252.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
In the original online version of this article, equations 27 to 30 were updated.
Rights and permissions
About this article
Cite this article
Vinod, H.D. Generalized, Partial and Canonical Correlation Coefficients. Comput Econ 60, 1479–1506 (2022). https://doi.org/10.1007/s10614-021-10190-x
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10614-021-10190-x