Vine copula regression for observational studies
- 62 Downloads
If explanatory variables and a response variable of interest are simultaneously observed, then fitting a joint multivariate density to all variables would enable prediction via conditional distributions. Regular vines or vine copulas with arbitrary univariate margins provide a rich and flexible class of multivariate densities for Gaussian or non-Gaussian dependence structures. The density enables calculation of all regression functions for any subset of variables conditional on any disjoint set of variables, thereby avoiding issues of transformations, heteroscedasticity, interactions, and higher-order terms. Only the question of finding an adequate vine copula remains. Heteroscedastic prediction inferences based on vine copulas are illustrated with two data sets, including one from the National Longitudinal Study of Youth relating breastfeeding to IQ. Some usual methods based on linear and quadratic equations are shown to have some undesirable inferences.
KeywordsRegular vine Gaussian copula Heteroscedasticity National Longitudinal Study of Youth Breastfeeding IQ Pitfalls of regression inference
Harry Joe is supported by NSERC Discovery Grant 8698. Roger Cooke acknowledges financial support from the Bill and Melinda Gates Foundation through the University of Virginia for related work on the NLSY data. Thanks to the referees for their constructive comments leading to an improved presentation.
- Colson, A., Cooke R.M., Lutter, R.: How does breastfeeding affect IQ? applying the classical model of structured expert judgment. Resources for the Future, RFF DP16-28 (2016)Google Scholar
- Kurowicka, D., Joe, H.: Dependence Modeling: Vine Copula Handbook. World Scientific, Singapore (2011)Google Scholar
- Parsa, R.A., Klugman, S.A.: Copula regression. Variance 5(1), 45–54 (2011)Google Scholar
- Sala-I-Martin, X.X.: I just ran two million regressions. Am. Econ. Rev. 87(2), 178–183 (1997)Google Scholar