Abstract
In this article, we establish a method of selecting the best regression equation on the basis of F-test. The basic idea is to select the most significant regression equation, which corresponds to the minimum P-value of F-test. We also present upper and lower bounds for the P-value together with approximations for these bounds which are simple to compute. Using this method, not only can we find out the best regression equation, we also obtain its significance probability. When the method is applied to the well-known data from Hald (1952) and Gorman & Toman (1966), the results are satisfactory.
Similar content being viewed by others
References
Aitkin MA (1974) Simultaneous inference and the choice of variable subsets in multiple regression. Technometrics 16:221–227
Draper NR, Smith H (1981) Applied regression analysis, (2nd ed). Wiley, New York: 294–327
Gorman JW, Toman RJ (1966) Selection of variables for fitting equations to data. Technometrics 8:27–51
Hald A (1952) Statistical theory with engineering applications. Wiley, New York:647
Mallows CL (1964) Choosing variables in a linear regression: A graphical aid. Presented at the Central Regional Meeting of the Institute of Mathematical Statistics, Manhattan, Kansas
Mallows CL (1973) Some comments onC p . Technometrics 15:661–675
Spjøtvoll E (1972) Multiple comparison of regression functions. Ann Math Statist. 43:1076–1088
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Zhang, J., Wang, X. Selecting the best regression equation via the P-value of F-test. Metrika 46, 33–40 (1997). https://doi.org/10.1007/BF02717164
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF02717164