Abstract
Regression analysis is viewed as a search through model space using data analytic functions. The desired models should satisfy several requirements, unimportant variables should be excluded, outliers identified, etc. The methods of regression data analysis such as variable selection, transformation and outlier detection, that address these concerns are characterized as functions acting on regression models and returning regression models. A model that is unchanged by the application of any of these methods is considered acceptable. A method for the generation of all acceptable models supported by all possible orderings of the choice of regression data analysis methods is described with a view to determining if two statisticians may reasonably hold differing views on the same data. The consideration of all possible orders of analysis generates a directed graph in which the vertices are regression models and the arcs are data-analytic methods. The structure of the graph is of statistical interest. The ideas are demonstrated using a LISP-based analysis package. The methods described are not intended for the entirely automatic analysis of data, rather to assist the statistician in examining regression data at a strategic level.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Adams J. (1990) American Statistical Association Proceedings of the Statistical Computing Section 55–62
Andrews D. & Herzberg A (1985) “Data: a collection of problems from many fields for the student and research worker” New York, Springer-Verlag.
Brownstone D. (1988) “Regression strategies” Proceedings on the 20th Symposium on the Interface, Ed. Wegman E. et al. 74–79
Daniel C. & Wood F. (1980) “Fitting Equations to Data, 2nd Ed.” New York, John Wiley.
Faraway J. (1992) “On the Cost of Data Analysis” Journal of Computational and Graphical Statistics 1 215–231
Friedman J. (1991) “Multivariate Adaptive Regression Splines” Annals of Statistics 1–141
Gale W. (Editor) (1986) “Artificial intelligence and statistics” Addison-Wesley; Reading Mass.
Lubinsky D. & Pregibon D. (1987) “Data Analysis as Search” in “Interactions in Artificial Intelligence and Statistical Methods” edited by Phelps B. Gower Technical Press, Aldershot, Hants
Mosteller F. & Tukey J. (1977) “Data Analysis and Regression” Addison-Wesley, Reading Mass.
Phelps B. (Editor) (1987) “Interactions in Artificial Intelligence and Statistical Methods” Gower Technical Press, Aldershot, Hants
Tierney L. (1990) “Lisp-Stat: An object-oriented environment for statistical computing and dynamic graphics.” Wiley, New York
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1994 Springer-Verlag New York, Inc.
About this paper
Cite this paper
Faraway, J.J. (1994). Choice of Order in Regression Strategy. In: Cheeseman, P., Oldford, R.W. (eds) Selecting Models from Data. Lecture Notes in Statistics, vol 89. Springer, New York, NY. https://doi.org/10.1007/978-1-4612-2660-4_41
Download citation
DOI: https://doi.org/10.1007/978-1-4612-2660-4_41
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-94281-0
Online ISBN: 978-1-4612-2660-4
eBook Packages: Springer Book Archive