Skip to main content

Simple Linear Regression

  • Chapter
  • First Online:
Econometrics

Part of the book series: Classroom Companion: Economics ((CCE))

  • 2394 Accesses

Abstract

In this chapter, we study extensively the estimation of a linear relationship between two variables, Yi and Xi, of the form:

$$\displaystyle Y_{i}=\alpha +\beta X_{i}+u_{i}\quad i=1,2,\ldots ,n $$

where Yi denotes the i-th observation on the dependent variable Y  which could be consumption, investment, or output, and Xi denotes the i -th observation on the independent variable X which could be disposable income, the interest rate, or an input.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 69.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Change history

  • 01 January 2022

    A correction has been published.

References

  • Baltagi, B.H. (1995), “Optimal Weighting of Unbiased Estimators,” Econometric Theory, Problem 95.3.1, 11:637.

    Google Scholar 

  • Baltagi, B.H. and D. Levin (1992), “Cigarette Taxation: Raising Revenues and Reducing Consumption,” Structural Change and Economic Dynamics, 3: 321–335.

    Article  Google Scholar 

  • Belsley, D.A., E. Kuh and R.E. Welsch (1980), Regression Diagnostics (Wiley: New York).

    Book  Google Scholar 

  • Greene, W. (1993), Econometric Analysis (Macmillian: New York).

    Google Scholar 

  • Gujarati, D. (1995), Basic Econometrics (McGraw-Hill: New York).

    Google Scholar 

  • Johnston, J. (1984), Econometric Methods (McGraw-Hill: New York).

    Google Scholar 

  • Kelejian, H. and W. Oates (1989), Introduction to Econometrics (Harper and Row: New York).

    Google Scholar 

  • Kennedy, P. (1992), A Guide to Econometrics (MIT Press: Cambridge).

    Google Scholar 

  • Kmenta, J. (1986), Elements of Econometrics (Macmillan: New York).

    Google Scholar 

  • Maddala, G.S. (1992), Introduction to Econometrics (Macmillan: New York).

    Google Scholar 

  • Oksanen, E.H. (1993), “Efficiency as Correlation,” Econometric Theory, Problem 93.1.3, 9: 146.

    Google Scholar 

  • Samuel-Cahn, E. (1994), “Combining Unbiased Estimators,” The American Statistician, 48: 34–36.

    Google Scholar 

  • Theil, H. (1971), Principles of Econometrics (Wiley: New York).

    Google Scholar 

  • Wallace, D. and L. Silver (1988), Econometrics: An Introduction (Addison Wesley: New York).

    Google Scholar 

  • Zheng, J.X. (1994), “Efficiency as Correlation,” Econometric Theory, Solution 93.1.3, 10: 228.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Badi H. Baltagi .

Appendix: Centered and Uncentered R2

Appendix: Centered and Uncentered R2

From the OLS regression on (3.1) we get

$$\displaystyle \begin{aligned} Y_{i}=\widehat{Y}_{i}+e_{i}\quad i=1,2,\ldots ,n {} \end{aligned} $$
(A.1)

where \(\widehat {Y}_{i}=\widehat {\alpha } _{OLS}+X_{i}\widehat {\beta }_{OLS}\). Squaring and summing the above equation, we get

(A.2)

since . The uncentered R2 is given by

(A.3)

Note that the total sum of squares for Yi is not expressed in deviation from the sample mean \(\bar {Y}\).

In other words, the uncentered R2 is the proportion of variation of that is explained by the regression Y  on X. Regression packages usually report the centered R2 which was defined in Sect. 3.6 as where \(y_{i}=Y_{i}-\bar {Y}.\) The latter measure focuses on explaining the variation in Yi after fitting the constant.

From Sect. 3.6, we have seen that a naive model with only a constant in it gives \(\bar {Y}\) as the estimate of the constant, see also Problem 2. The variation in Yi that is not explained by this naive model is . Subtracting \(n \bar {Y}^{2}\) from both sides of (A.2) we get

and the centered R2 is

(A.4)

If there is a constant in the model \(\bar {Y}= \overline {\widehat {Y}}\), see Sect. 3.6, and . Therefore, the centered which is the R2 reported by regression packages. If there is no constant in the model, some regression packages give you the option of (no constant) and the R2 reported is usually the uncentered R2. Check your regression package documentation to verify what you are getting. We will encounter uncentered R2 again in constructing test statistics using regressions, see for example Chap. 11.

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Cite this chapter

H. Baltagi, B. (2021). Simple Linear Regression. In: Econometrics. Classroom Companion: Economics. Springer, Cham. https://doi.org/10.1007/978-3-030-80149-6_3

Download citation

Publish with us

Policies and ethics