Summary
We investigate the extent to which it may be possible to carry out a regression analysis using graphics alone, an idea that we refer to asgraphical regression. The limitations of this idea are explored. It is shown that graphical regression is theoretically possible with essentially no constraints on the conditional distribution of the response given the predictors, but with some conditions on marginal distribution of the predictors. Dimension reduction subspaces and added variable plots play a central role in the development. The possibility of useful methodology is explored through two examples.
Similar content being viewed by others
References
Becker, R. A. and Cleveland, W. S. (1987). Brushing Scatterplots.Technometrics 29, 127–142.
Becker, R. A., Cleveland, W. S. and Wilks, A. R. (1987). Dynamic graphics for data analysis.Statist. Sci. 2, 355–395 (with discussion).
Becker, R. A., Cleveland, W. S. and Weil (1988). The use of brushing and rotation for data analysis.Dynamic Graphics for Statistics (W. S. Cleveland and M. E. McGill eds.), Belmont, CA: Wadsworth, 247–276.
Chambers, J. M., Cleveland, W. S., Kleiner, B. and Tukey, P. (1983).Graphical Methods for Data Analysis. Boston: Duxbury Press.
Chambers, J. M. and Hastie, T. J. (1992).Statistical Models in S. Pacific Grove, CA: Wadsworth & Brooks/Cole.
Cleveland, W. S. and McGill, M. E. (1984). The many faces of a scatterplot.J. Amer. Statist. Assoc. 79, 807–822.
Cleveland, W. S. (1987). Research in Statistical Graphics.J. Amer. Statist. Assoc. 82, 419–423.
Cleveland, W. S. and McGill, M. E. (1988).Dynamic Graphics for Statistics. Belmont: Wadsworth.
Cook, R. D. (1986). Assessment of local influence.J. Roy. Statist. Soc. B, 133–155, (with discussion).
Cook, R. D. (1987). Software review of MacSpin.Amer. Statist. 41, 233–236.
Cook, R. D. (1992). Graphical regression. (Dodge, Y. and Whittaker, J. W., eds.)Computational Statistics 1, Heidlberg: Physica-Verlag, 11–22.
Cook, R. D. (1994). On the interpretation of regression plots.J. Amer. Statist. Assoc. (to appear).
Cook, R. D. and Nachtsheim, C. J. (1992). Re-weighting to achieve elliptically contoured covariates in regression.Tech. Rep. University of Minnesota.
Cook, R. D. and Weisberg S. (1982).Residuals and Influence in Regression. London: Chapman and Hall.
Cook, R. D. and Weisberg, S. (1989). Regression diagnostics with dynamic graphics.Technometrics 31, 277–312 (with discussion).
Cook, R. D. and Weisberg, S. (1990). Three dimensional residual plots, (Berk, K. and Malone, L., eds.),Proceedings of the 21st Symposium on the Interface: Computing Science and Statistics. Washington: American Statistical Association, 1990, 162–166.
Cox, D. R. (1978). Some remarks on the role in statistics of graphical methods.Ann. Statist 27, 4–9.
Cox, D. R. and Snell, E. J. (1968). A general definition of residuals.J. Roy. Statist. Soc. B 30, 248–275.
Dawid, A. P. (1979). Conditional independence in statistical theory.J. Roy. Statist. Soc. B 41, 1–31, (with discussion).
Eaton, M. L. (1986). A characterization of spherical distributions.Multivariate Analysis 20, 272–276.
Fisherkeller, M. A., Friedman, J. H. and Tukey, J. W. (1974). PRIM-9; An interactive multidimensional data display and analysis system.Dynamic Graphics for Statistics, (1988), (W. S. Cleveland and M. E. McGill, eds.), Belmont, CA: Wadsworth, 91–110.
Fowlkes, E. B. (1987). Some diagnostics for binary logistic regression via smoothing.Biometrika 74, 503–515.
Hastie, T. J. and Tibshirani (1990).Generalized Additive Models. London: Chapman and Hall.
Huber, P. (1985). Projection pursuit.Ann. Statist. 13, 435–525, (with discussion).
Huber, P. (1987). Experiences with three-dimensional scatterplots.J. Amer. Statist. Assoc. 82, 448–453.
Li, K. C. (1991). Sliced inverse regression for dimension reduction.J. Amer. Statist. Assoc. 86, 316–342 (with discussion).
Li, K. C. (1992). On principal Hessian directions for data visualization and dimension reduction: another application of Stein's lemma.J. Amer. Statist. Assoc. 87, 1025–1040.
Li, K. C. and Duan, N. (1989). Regression analysis under link violation.Ann. Statist. 17, 1009–1052.
Tierney, L. (1990).LISP-STAT Chichester: Wiley.
Young, F. W., Kent, D. P. and Kuhfeld, W. F. (1988). Dynamic graphics for exploring multivariate data.Dynamic Graphics for Statistics, (W. S. Cleveland and M. E. McGill, eds.) Belmont, CA: Wadsworth, 391–424.
Additional references in the discussion
Barron, A. R. and Xiao, X. (1991). Discussion of “Multivariate adaptive regression splines, with discussions”.Ann. Statist. 19, 67–82.
Efron, B. (1982).The Jackknife, the Bootstrap, and Other Resampling Plans, CMBS 38, Philadelphia: SIAM-NSF.
Friedman, J. H. (1987). Exploratory projection pursuit.J. Amer. Statist. Assoc. 82, 249–266.
Friedman, J. H. (1991). Multivariate adaptive regression splines.Ann. Statist. 19, 1–50.
Friedman, J. H. and Stuetzle, W. (1981). Projection pursuit regression.J. Amer. Statist. Assoc. 76, 817–823.
Li, K. C. (1985). From Stein's unbiased risk estimates to the method of generalized cross validation.Ann. Statist. 131, 352–377.
Li, K. C. (1986). Asymptotic optimality ofC L and generalized cross validation in ridge regression with application to spline smoothing.Ann. Statist. 141, 101–112.
Li, K. C. (1987). Asymptotic optimality forC p, C L , cross-validation and generalized cross-validation; discrete index set.Ann. Statist. 15, 958–975.
Li, K. C. (1989). Honest confidence regions for nonparametric regression.Ann. Statist. 17, 1001–1008.
Li, K. C. (1993). Quasi-helices in high dimensional regression.Tech. Rep. UCLA.
McDonald, J. A. and Pedersen, J. (1985). Computing environments for data analysis part I: introduction.SIAM Journal of Scientific and Statistical Computing 6, 1004–1012.
Miller, A. J. (1990).Subset Selection in Regression. London: Chapman & Hall.
Morton, S. C. (1989). Interpretable projection pursuit.Tech. Rep. 106, Stanford University.
Rice, John A. (1988).Mathematical Statistics and Data Analysis. Pacific Drove, CA: Wadsworth.
Wong, W. and Shen, X. (1991). Projection pursuit without additivity constraints.Tech. Rep.312, The University of Chicago.
Author information
Authors and Affiliations
Additional information
Read before the Spanish Statistical Society at a meeting organized by the Carlos III University of Madrid on Wednesday, April 21, 1993
Rights and permissions
About this article
Cite this article
Cook, R.D., Wetzel, N., Bermúdez, J.D. et al. Exploring regression structure with graphics. Test 2, 33–100 (1993). https://doi.org/10.1007/BF02562669
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF02562669