Abstract
In this chapter, we will consider pooling time-series of cross-sections. This may be a panel of households or firms or simply countries or states followed over time. Two well known examples of panel data in the U.S. are the Panel Study of Income Dynamics (PSID) and the National Longitudinal Survey (NLS). The PSID began in 1968 with 4802 families, including an over-sampling of poor households. Annual interviews were conducted and socioeconomic characteristics of each of the families and of roughly 31000 individuals who have been in these or derivative families were recorded. The list of variables collected is over 5000. The NLS, followed five distinct segments of the labor force. The original samples include 5020 older men, 5225 young men, 5083 mature women, 5159 young women and 12686 youths. There was an over-sampling of blacks, hispanics, poor whites and military in the youths survey. The list of variables collected runs into the thousands. An inventory of national studies using panel data is given at http://www.isr.umich.edu/src/psid/panelstudies.html. Pooling this data gives a richer source of variation which allows for more efficient estimation of the parameters. With additional, more informative data, one can get more reliable estimates and test more sophisticated behavioral models with less restrictive assumptions. Another advantage of panel data sets are their ability to control for individual heterogeneity. Not controlling for these unobserved individual specific effects leads to bias in the resulting estimates. Panel data sets are also better able to identify and estimate effects that are simply not detectable in pure cross-sections or pure timeseries data. In particular, panel data sets are better able to study complex issues of dynamic behavior. For example, with a cross-section data set one can estimate the rate of unemployment at a particular point in time. Repeated cross-sections can show how this proportion changes over time. Only panel data sets can estimate what proportion of those who are unemployed in one period remain unemployed in another period. Some of the benefits and limitations of using panel data sets are listed in Hsiao (2003) and Baltagi (2008). Section 12.2 studies the error components model focusing on fixed effects, random effects and maximum likelihood estimation. Section 12.3 considers the question of prediction in a random effects model, while Section 12.4 illustrates the estimation methods using an empirical example. Section 12.5 considers testing the poolability assumption, the existence of random individual effects and the consistency of the random effects estimator using a Hausman test. Section 12.6 studies the dynamic panel data model and illustrates the methods used with an empirical example. Section 12.7 concludes with a short presentation of program evaluation and the difference-in-differences estimator.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Notes
- 1.
This chapter is based on Baltagi (2008).
References
Footnote
This chapter is based on Baltagi (2008).
Ahn, S.C. and P. Schmidt (1995), “Efficient Estimation of Models for Dynamic Panel Data,” Journal of Econometrics, 68: 5–27.
Amemiya, T. (1971), “The Estimation of the Variances in a Variance-Components Model,” International Economic Review, 12: 1–13.
Anderson, T.W. and C. Hsiao (1982), “Formulation and Estimation of Dynamic Models Using Panel Data, Journal of Econometrics, 18: 47–82.
Arellano, M. (1989), “A Note on the Anderson-Hsiao Estimator for Panel Data,” Economics Letters, 31: 337–341.
Arellano, M. (1993), “On the Testing of Correlated Effects With Panel Data,” Journal of Econometrics, 59: 87–97.
Arellano, M. and S. Bond (1991), “Some Tests of Specification for Panel Data: Monte Carlo Evidence and An Application to Employment Equations,” Review of Economic Studies, 58: 277–297.
Balestra, P. (1973), “Best Quadratic Unbiased Estimators of the Variance-Covariance Matrix in Normal Regression,” Journal of Econometrics, 2: 17–28.
Baltagi, B.H. (1981), “Pooling: An Experimental Study of Alternative Testing and Estimation Procedures in a Two-Way Errors Components Model,” Journal of Econometrics, 17: 21–49.
Baltagi, B.H. (1996), “Heteroskedastic Fixed Effects Models,” Problem 96.5.1, Econometric Theory, 12: 867.
Baltagi, B.H. (1999), “The Relative Efficiency of the Between Estimator with Respect to the Within Estimator,” Problem 99.4.3, Econometric Theory, 15: 630–631.
Baltagi, B.H. (2008), Econometric Analysis of Panel Data (Wiley: Chichester).
Baltagi, B.H. and J.M. Griffin (1983), “Gasoline Demand in the OECD: An Application of Pooling and Testing Procedures,” European Economic Review, 22: 117–137.
Baltagi, B.H., J.M. Griffin and W. Xiong (2000), “To Pool or Not to Pool: Homogeneous Versus Heterogeneous Estimators Applied to Cigarette Demand,” Review of Economics and Statistics, 82: 117–126.
Baltagi, B.H. and W. Krämer (1994), “Consistency, Asymptotic Unbiasedness and Bounds on the Bias of s2 in the Linear Regression Model with Error Components Disturbances,” Statistical Papers, 35: 323–328.
Breusch, T.S. (1987), “Maximum Likelihood Estimation of Random Effects Models,” Journal of Econometrics, 36: 383–389.
Breusch, T.S. and A.R. Pagan (1980), “The Lagrange Multiplier Test and its Applications to Model Specification in Econometrics,” Review of Economic Studies, 47: 239–253. Card (1990), “The Impact of the Mariel Boat Lift on the Miami Labor Market,” Industrial and Labor Relations Review, 43: 245–253.
Chow, G.C. (1960), “Tests of Equality Between Sets of Coefficients in Two Linear Regressions,” Econometrica, 28: 591–605.
Cornwell, C. and W.N. Trumbull (1994), “Estimating the Economic Model of Crime with Panel Data,” Review of Economics and Statistics 76: 360–366.
Evans, M.A. and M.L. King (1985), “Critical Value Approximations for Tests of Linear Regression Disturbances,” Australian Journal of Statistics, 27: 68–83.
Fisher, F.M. (1970), “Tests of Equality Between Sets of Coefficients in Two Linear Regressions: An Expository Note,” Econometrica, 38: 361–366.
Fuller, W.A. and G.E. Battese (1974), “Estimation of Linear Models with Cross-Error Structure,” Journal of Econometrics, 2: 67–78.
Goldberger, A.S. (1962), “Best Linear Unbiased Prediction in the Generalized Linear Regression Model,” Journal of the American Statistical Association, 57: 369–375.
Graybill, F.A. (1961), An Introduction to Linear Statistical Models (McGraw-Hill: New York). Hansen, L.P. (1982), “Large Sample Properties of Generalized Method of Moments Estimators,” Econometrica, 50: 1029–1054.
Hausman, J.A. (1978), “Specification Tests in Econometrics,” Econometrica, 46: 1251–1271.
Honda, Y. (1985), “Testing the Error Components Model with Non-Normal Disturbances,” Review of Economic Studies, 52: 681–690.
Hsiao, C. (2003), Analysis of Panel Data (Cambridge University Press: Cambridge).
Judge, G.G., W.E. Griffiths, R.C. Hill, H. Lutkepohl and T.C. Lee (1985), The Theory and Practice of Econometrics (Wiley: New York).
Kiviet, J.F. and W. Krämer (1992), “Bias of s2 in the Linear Regression Model with Correlated Errors,” Empirical Economics, 16: 375–377.
Maddala, G.S. (1971), “The Use of Variance Components Models in Pooling Cross Section and Time Series Data,” Econometrica, 39: 341–358.
Maddala, G.S. and T. Mount (1973), “A Comparative Study of Alternative Estimators for Variance Components Models Used in Econometric Applications,” Journal of the American Statistical Association, 68: 324–328.
Moulton, B.R. and W.C. Randolph (1989), “Alternative Tests of the Error Components Model,” Econometrica, 57: 685–693.
Nerlove, M. (1971), “A Note on Error Components Models,” Econometrica, 39: 383–396.
Nickell, S. (1981), “Biases in Dynamic Models with Fixed Effects,”Econometrica, 49: 1417–1426.
Searle, S.R. (1971), Linear Models (Wiley: New York).
Sargan, J. (1958), “The Estimation of Economic Relationships Using Instrumental Variables,” Econometrica, 26: 393–415.
Swamy, P.A.V.B. and S.S. Arora (1972), “The Exact Finite Sample Properties of the Estimators of Coefficients in the Error Components Regression Models,” Econometrica, 40: 261–275.
Taub, A.J. (1979), “Prediction in the Context of the Variance-Components Model,” Journal of Econometrics, 10: 103–108.
Taylor, W.E. (1980), “Small Sample Considerations in Estimation from Panel Data,” Journal of Econometrics, 13: 203–223.
Wallace, T. and A. Hussain (1969), “The Use of Error Components Models in Combining Cross-Section and Time-Series Data,” Econometrica, 37: 55–72.
Wansbeek, T.J. and A. Kapteyn (1978), “The Separation of Individual Variation and Systematic Change in the Analysis of Panel Data,” Annales de l’INSEE, 30–31: 659–680.
Wansbeek, T.J. and A. Kapteyn (1982), “A Simple Way to Obtain the Spectral Decomposition of Variance Components Models for Balanced Data,” Communications in Statistics All, 2105–2112.
Wansbeek, T.J. and A. Kapteyn, (1989), “Estimation of the error components model with incomplete panels,” Journal of Econometrics 41: 341–361.
Zellner, A. (1962), “An Efficient Method of Estimating Seemingly Unrelated Regression and Tests for Aggregation Bias,” Journal of the American Statistical Association, 57: 348–368.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Baltagi, B.H. (2011). Pooling Time-Series of Cross-Section Data. In: Econometrics. Springer Texts in Business and Economics. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20059-5_12
Download citation
DOI: https://doi.org/10.1007/978-3-642-20059-5_12
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20058-8
Online ISBN: 978-3-642-20059-5
eBook Packages: Business and EconomicsEconomics and Finance (R0)