Pooling Time-Series of Cross-Section Data

Baltagi, Badi H.

doi:10.1007/978-3-642-20059-5_12

Badi H. Baltagi²

Part of the book series: Springer Texts in Business and Economics ((STBE))

119k Accesses

Abstract

In this chapter, we will consider pooling time-series of cross-sections. This may be a panel of households or firms or simply countries or states followed over time. Two well known examples of panel data in the U.S. are the Panel Study of Income Dynamics (PSID) and the National Longitudinal Survey (NLS). The PSID began in 1968 with 4802 families, including an over-sampling of poor households. Annual interviews were conducted and socioeconomic characteristics of each of the families and of roughly 31000 individuals who have been in these or derivative families were recorded. The list of variables collected is over 5000. The NLS, followed five distinct segments of the labor force. The original samples include 5020 older men, 5225 young men, 5083 mature women, 5159 young women and 12686 youths. There was an over-sampling of blacks, hispanics, poor whites and military in the youths survey. The list of variables collected runs into the thousands. An inventory of national studies using panel data is given at http://www.isr.umich.edu/src/psid/panelstudies.html. Pooling this data gives a richer source of variation which allows for more efficient estimation of the parameters. With additional, more informative data, one can get more reliable estimates and test more sophisticated behavioral models with less restrictive assumptions. Another advantage of panel data sets are their ability to control for individual heterogeneity. Not controlling for these unobserved individual specific effects leads to bias in the resulting estimates. Panel data sets are also better able to identify and estimate effects that are simply not detectable in pure cross-sections or pure timeseries data. In particular, panel data sets are better able to study complex issues of dynamic behavior. For example, with a cross-section data set one can estimate the rate of unemployment at a particular point in time. Repeated cross-sections can show how this proportion changes over time. Only panel data sets can estimate what proportion of those who are unemployed in one period remain unemployed in another period. Some of the benefits and limitations of using panel data sets are listed in Hsiao (2003) and Baltagi (2008). Section 12.2 studies the error components model focusing on fixed effects, random effects and maximum likelihood estimation. Section 12.3 considers the question of prediction in a random effects model, while Section 12.4 illustrates the estimation methods using an empirical example. Section 12.5 considers testing the poolability assumption, the existence of random individual effects and the consistency of the random effects estimator using a Hausman test. Section 12.6 studies the dynamic panel data model and illustrates the methods used with an empirical example. Section 12.7 concludes with a short presentation of program evaluation and the difference-in-differences estimator.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 49.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Notes

1.
This chapter is based on Baltagi (2008).

References

Footnote
This chapter is based on Baltagi (2008).

Google Scholar
Ahn, S.C. and P. Schmidt (1995), “Efficient Estimation of Models for Dynamic Panel Data,” Journal of Econometrics, 68: 5–27.
Article Google Scholar
Amemiya, T. (1971), “The Estimation of the Variances in a Variance-Components Model,” International Economic Review, 12: 1–13.
Article Google Scholar
Anderson, T.W. and C. Hsiao (1982), “Formulation and Estimation of Dynamic Models Using Panel Data, Journal of Econometrics, 18: 47–82.
Article Google Scholar
Arellano, M. (1989), “A Note on the Anderson-Hsiao Estimator for Panel Data,” Economics Letters, 31: 337–341.
Article Google Scholar
Arellano, M. (1993), “On the Testing of Correlated Effects With Panel Data,” Journal of Econometrics, 59: 87–97.
Article Google Scholar
Arellano, M. and S. Bond (1991), “Some Tests of Specification for Panel Data: Monte Carlo Evidence and An Application to Employment Equations,” Review of Economic Studies, 58: 277–297.
Article Google Scholar
Balestra, P. (1973), “Best Quadratic Unbiased Estimators of the Variance-Covariance Matrix in Normal Regression,” Journal of Econometrics, 2: 17–28.
Article Google Scholar
Baltagi, B.H. (1981), “Pooling: An Experimental Study of Alternative Testing and Estimation Procedures in a Two-Way Errors Components Model,” Journal of Econometrics, 17: 21–49.
Article Google Scholar
Baltagi, B.H. (1996), “Heteroskedastic Fixed Effects Models,” Problem 96.5.1, Econometric Theory, 12: 867.
Google Scholar
Baltagi, B.H. (1999), “The Relative Efficiency of the Between Estimator with Respect to the Within Estimator,” Problem 99.4.3, Econometric Theory, 15: 630–631.
Google Scholar
Baltagi, B.H. (2008), Econometric Analysis of Panel Data (Wiley: Chichester).
Google Scholar
Baltagi, B.H. and J.M. Griffin (1983), “Gasoline Demand in the OECD: An Application of Pooling and Testing Procedures,” European Economic Review, 22: 117–137.
Article Google Scholar
Baltagi, B.H., J.M. Griffin and W. Xiong (2000), “To Pool or Not to Pool: Homogeneous Versus Heterogeneous Estimators Applied to Cigarette Demand,” Review of Economics and Statistics, 82: 117–126.
Article Google Scholar
Baltagi, B.H. and W. Krämer (1994), “Consistency, Asymptotic Unbiasedness and Bounds on the Bias of s2 in the Linear Regression Model with Error Components Disturbances,” Statistical Papers, 35: 323–328.
Article Google Scholar
Breusch, T.S. (1987), “Maximum Likelihood Estimation of Random Effects Models,” Journal of Econometrics, 36: 383–389.
Article Google Scholar
Breusch, T.S. and A.R. Pagan (1980), “The Lagrange Multiplier Test and its Applications to Model Specification in Econometrics,” Review of Economic Studies, 47: 239–253. Card (1990), “The Impact of the Mariel Boat Lift on the Miami Labor Market,” Industrial and Labor Relations Review, 43: 245–253.
Article Google Scholar
Chow, G.C. (1960), “Tests of Equality Between Sets of Coefficients in Two Linear Regressions,” Econometrica, 28: 591–605.
Article Google Scholar
Cornwell, C. and W.N. Trumbull (1994), “Estimating the Economic Model of Crime with Panel Data,” Review of Economics and Statistics 76: 360–366.
Article Google Scholar
Evans, M.A. and M.L. King (1985), “Critical Value Approximations for Tests of Linear Regression Disturbances,” Australian Journal of Statistics, 27: 68–83.
Article Google Scholar
Fisher, F.M. (1970), “Tests of Equality Between Sets of Coefficients in Two Linear Regressions: An Expository Note,” Econometrica, 38: 361–366.
Article Google Scholar
Fuller, W.A. and G.E. Battese (1974), “Estimation of Linear Models with Cross-Error Structure,” Journal of Econometrics, 2: 67–78.
Article Google Scholar
Goldberger, A.S. (1962), “Best Linear Unbiased Prediction in the Generalized Linear Regression Model,” Journal of the American Statistical Association, 57: 369–375.
Article Google Scholar
Graybill, F.A. (1961), An Introduction to Linear Statistical Models (McGraw-Hill: New York). Hansen, L.P. (1982), “Large Sample Properties of Generalized Method of Moments Estimators,” Econometrica, 50: 1029–1054.
Article Google Scholar
Hausman, J.A. (1978), “Specification Tests in Econometrics,” Econometrica, 46: 1251–1271.
Article Google Scholar
Honda, Y. (1985), “Testing the Error Components Model with Non-Normal Disturbances,” Review of Economic Studies, 52: 681–690.
Article Google Scholar
Hsiao, C. (2003), Analysis of Panel Data (Cambridge University Press: Cambridge).
Google Scholar
Judge, G.G., W.E. Griffiths, R.C. Hill, H. Lutkepohl and T.C. Lee (1985), The Theory and Practice of Econometrics (Wiley: New York).
Google Scholar
Kiviet, J.F. and W. Krämer (1992), “Bias of s2 in the Linear Regression Model with Correlated Errors,” Empirical Economics, 16: 375–377.
Google Scholar
Maddala, G.S. (1971), “The Use of Variance Components Models in Pooling Cross Section and Time Series Data,” Econometrica, 39: 341–358.
Article Google Scholar
Maddala, G.S. and T. Mount (1973), “A Comparative Study of Alternative Estimators for Variance Components Models Used in Econometric Applications,” Journal of the American Statistical Association, 68: 324–328.
Article Google Scholar
Moulton, B.R. and W.C. Randolph (1989), “Alternative Tests of the Error Components Model,” Econometrica, 57: 685–693.
Article Google Scholar
Nerlove, M. (1971), “A Note on Error Components Models,” Econometrica, 39: 383–396.
Article Google Scholar
Nickell, S. (1981), “Biases in Dynamic Models with Fixed Effects,”Econometrica, 49: 1417–1426.
Google Scholar
Searle, S.R. (1971), Linear Models (Wiley: New York).
Google Scholar
Sargan, J. (1958), “The Estimation of Economic Relationships Using Instrumental Variables,” Econometrica, 26: 393–415.
Article Google Scholar
Swamy, P.A.V.B. and S.S. Arora (1972), “The Exact Finite Sample Properties of the Estimators of Coefficients in the Error Components Regression Models,” Econometrica, 40: 261–275.
Article Google Scholar
Taub, A.J. (1979), “Prediction in the Context of the Variance-Components Model,” Journal of Econometrics, 10: 103–108.
Article Google Scholar
Taylor, W.E. (1980), “Small Sample Considerations in Estimation from Panel Data,” Journal of Econometrics, 13: 203–223.
Article Google Scholar
Wallace, T. and A. Hussain (1969), “The Use of Error Components Models in Combining Cross-Section and Time-Series Data,” Econometrica, 37: 55–72.
Article Google Scholar
Wansbeek, T.J. and A. Kapteyn (1978), “The Separation of Individual Variation and Systematic Change in the Analysis of Panel Data,” Annales de l’INSEE, 30–31: 659–680.
Google Scholar
Wansbeek, T.J. and A. Kapteyn (1982), “A Simple Way to Obtain the Spectral Decomposition of Variance Components Models for Balanced Data,” Communications in Statistics All, 2105–2112.
Google Scholar
Wansbeek, T.J. and A. Kapteyn, (1989), “Estimation of the error components model with incomplete panels,” Journal of Econometrics 41: 341–361.
Article Google Scholar
Zellner, A. (1962), “An Efficient Method of Estimating Seemingly Unrelated Regression and Tests for Aggregation Bias,” Journal of the American Statistical Association, 57: 348–368.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Center for Policy Research Department of Economics, Syracuse University, Eggers Hall 426, 13244-1020, Syracuse, New York, USA
Prof. Badi H. Baltagi

Authors

Prof. Badi H. Baltagi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Badi H. Baltagi .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Baltagi, B.H. (2011). Pooling Time-Series of Cross-Section Data. In: Econometrics. Springer Texts in Business and Economics. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20059-5_12

Download citation

DOI: https://doi.org/10.1007/978-3-642-20059-5_12
Published: 09 April 2011
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20058-8
Online ISBN: 978-3-642-20059-5
eBook Packages: Business and EconomicsEconomics and Finance (R0)

Publish with us

Policies and ethics