Solving and Testing for Regressor-Error (in)Dependence When no Instrumental Variables are Available: With New Evidence for the Effect of Education on Income

Ebbes, Peter; Wedel, Michel; Böckenholt, Ulf; Steerneman, Ton

doi:10.1007/s11129-005-1177-6

Solving and Testing for Regressor-Error (in)Dependence When no Instrumental Variables are Available: With New Evidence for the Effect of Education on Income

Published: December 2005

Volume 3, pages 365–392, (2005)
Cite this article

Quantitative Marketing and Economics Aims and scope Submit manuscript

Peter Ebbes¹,
Michel Wedel²,
Ulf Böckenholt³ &
…
Ton Steerneman⁴

1428 Accesses
116 Citations
3 Altmetric
Explore all metrics

Abstract

This paper has two main contributions. Firstly, we introduce a new approach, the latent instrumental variables (LIV) method, to estimate regression coefficients consistently in a simple linear regression model where regressor-error correlations (endogeneity) are likely to be present. The LIV method utilizes a discrete latent variable model that accounts for dependencies between regressors and the error term. As a result, additional ‘valid’ observed instrumental variables are not required. Furthermore, we propose a specification test based on Hausman (1978) to test for these regressor-error correlations. A simulation study demonstrates that the LIV method yields consistent estimates and the proposed test-statistic has reasonable power over a wide range of regressor-error correlations and several distributions of the instruments.

Secondly, the LIV method is used to re-visit the relationship between education and income based on previously published data. Data from three studies are re-analyzed. We examine the effect of education on income, where the variable ‘education’ is potentially endogenous due to omitted ‘ability’ or other causes. In all three applications, we find an upward bias in the OLS estimates of approximately 7%. Our conclusions agree closely with recent results obtained in studies with twins that find an upward bias in OLS of about 10% (Card, 1999). We also show that for each of the three datasets the classical IV estimates for the return to education point to biases in OLS that are not consistent in terms of size and magnitude. Our conclusion is that LIV estimates are preferable to the classical IV estimates in understanding the effects of education on income.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Partial Identification and Sensitivity Analysis

Estimating the intergenerational elasticity of expected income with short-run income measures: a generalized error-in-variables model

Article 28 June 2023

Sensitivity analysis for inference in 2SLS/GMM estimation with possibly flawed instruments

Article 12 February 2015

References

Al-Hussaini, E.K. and K.E.-D. Ahmad. (1981). “On the Identifiability of Finite Mixtures of Distributions.” IEEE Transactions on Information Theory 27, 664–668.
Article Google Scholar
Angrist, J.D. and A.B. Krueger. (1991). “Does Compulsory School Attendance Affect Schooling and Earnings?” The Quarterly Journal of Economics 56, 979–1014.
Google Scholar
Bekker, P.A. (1994). “Alternative Approximations to the Distributions of Instrumental Variable Estimators.” Econometrica 62, 657–681.
Google Scholar
Belsley, D.A., E. Kuh, and R.E. Welsch. (1980). Regression Diagnostics: Identifying Influential Data and Sources of Collinearity. New York: John Wiley & Sons, Inc.
Google Scholar
Bound, J. and D.A. Jaeger. (1996). “On the Validity of Season of Birth as an Instrument in Wage Equations A Comment on Angrist and Krueger's Does Compulsory School Attendance Affect Schooling and Earnings?” Technical Report 5835, NBER.
Bound, J., D.A. Jaeger, and R.M. Baker. (1995). “Problems with Instrumental Variables Estimation When the Correlation Between the Instruments and the Endogenous Explanatory Variable is Weak.” Journal of the American Statistical Association 90, 443–450.
Google Scholar
Bowden, R.J. and D.A. Turkington. (1984). Instrumental Variables. New York: Cambridge University Press.
Google Scholar
Bronnenberg, B.J. and V. Mahajan. (2001). “Unobserved Retailer Behavior in Multimarket Data: Joint Spatial Dependence in Market Shares and Promotion Variables.” Marketing Science 20, 284–299.
Article Google Scholar
Card, D. (1995). “Using Geographical Variation in College Proximity to Estimate the Return to Schooling.” In L.N. Christofides, E. Grant, and R. Swidinsky (eds.), Aspects of Labour Market Behaviour: Essays in Honour of John Vanderkamp, Toronto: University of Toronto Press, pp. 201–222.
Google Scholar
Card, D. (1999). “The Causal Effect of Education on Earnings.” In Ashenfelter, O.C. and D. Card (eds.), Handbook of Labor Economics volume 3A, North-Holland: Elsevier Science B.V., pp. 1801–1863
Google Scholar
11 Card, D. (2001). “Estimating the Return to Schooling: Progress on Some Persistent Econometric Problems.” Econometrica 69, 1127–1160.
Article Google Scholar
Chamberlain, G. and Z. Griliches. (1975). “Unobservables with a Variance-Components Structure: Ability, Schooling, and the Economic Success of Brothers.” International Economic Review 16, 422–449.
Google Scholar
Cook, R.D. and S. Weisberg. (1982). Residuals and Influence in Regression. New York: Chapman and Hall.
Google Scholar
Dijk, van, A., H.J. van Heerde, P.S.H. Leeflang, and D.R. Wittink. (2004). “Similarity-Based Spatial Methods for Estimating Shelf Space Elasticities From Correlational Data.” Quantitative Marketing and Economics 2, 257–277.
Google Scholar
Ebbes, P. (2004). “Latent Instrumental Variables: A New Approach to Solve for Endogeneity.” Ph D thesis, SOM Research School, University of Groningen ({http://irs.ub.rug.nl/ppn/270945334}).
Erickson, T. and T.M. Whited. (2002). “Two-Step GMM Estimation of the Errors-in-Variables Model Using High-Order Moments.” Econometric Theory 18, 776–799.
Article Google Scholar
Fahrmeir, L. and G. Tutz. (1994). Multivariate Statistical Modelling Based on Generalized Linear Models. New York: Springer-Verlag.
Google Scholar
Fox, J. (1991). Regression Diagnostics. London: Sage Publications, inc.
Google Scholar
Greene, W.H. (2000). Econometric Analysis. New Jersey: Prentice-Hall, Inc., Upper Saddle River.
Google Scholar
Griliches, Z. (1977). “Estimating the Returns to Schooling: Some Econometric Problems.” Econometrica 45, 1–22.
Google Scholar
Hahn, J. and J. Hausman. (2003). “Weak Instrumens: Diagnosis and Cures in Empirical Econometrics.” American Economic Review 93, 118–125.
Google Scholar
Harmon, C. and I. Walker. (1995). “Estimates of the Economic Return to Schooling for the United Kingdom.” American Economic Review 85, 1278–1286.
Google Scholar
Hartog, J. (1988). “An Ordered Response Model for Allocation and Earnings.” Kyklos 41, 113–141.
Google Scholar
Hausman, J.A. (1978). “Specification Tests for Econometrics.” Econometrica 46, 1251–1271.
Google Scholar
Hennig, C. (2000). “Identifiability of Models for Clusterwise Linear Regression.” Journal of Classification 17, 273–296.
Article Google Scholar
Hertz, T. (2003). “Upward Bias in the Estimated Returns to Education: Evidence from South Africa.” The American Economic Review 93, 1354–1368.
Article Google Scholar
Judge, G.G., W.E. Griffiths, R.C. Hill, H. Lütkepohl, and T.-C. Lee. (1985). The Theory and Practice of Econometrics. New York: John Wiley & Sons Inc.
Google Scholar
Madansky, A. (1959). “The Fitting of Straight Lines When Both Variables are Subject to Error.” Journal of the American Statistical Association 54, 173–205.
Google Scholar
Manchanda, P., P.E. Rossi, and P.K. Chintagunta. (2004). “Response Modeling with Non-Random Marketing Mix Variables.” Journal of Marketing Research 41, 467–478.
Article Google Scholar
McLachlan, G.J. and D. Peel. (2000). Finite Mixture Models. New York: John Wiley & Sons, Inc.
Google Scholar
Mroz, T.A. (1987). “The Sensitivity of an Empirical Model of Married Women's Hours of Work to Economic and Statistical Assumptions.” Econometrica 55, 765–799.
Google Scholar
Pagan, A. (1984). “Econometric Issues in the Analysis of Regressions with Generated Regressors.” International Economic Review 25, 221–247.
Google Scholar
33 Ploeg, van der, J. (1997). “Instrumental Variable Estimation and Group-Asymptotics.” Ph D thesis, SOM Research School, University of Groningen (http://irs.ub.rug.nl/ppn/157854507).
Redner, R.A. and H.F. Walker. (1984). “Mixture Densities, Maximum Likelihood and the EM Algorithm.” SIAM Review 26, 195–239.
Article Google Scholar
Ruud, P.A. (2000). An Introduction to Classical Econometric Theory. New York: Oxford University Press.
Google Scholar
Staiger, D. and J.H. Stock. (1997). “Instrumental Variables Regression with Weak Instruments.” Econometrica, 65, 557–586.
Google Scholar
Stock, J.H., J.H. Wright, and M. Yogo. (2002). “A Survey of Weak Instruments and Weak Identification in Generalized Method of Moments.” {Journal of Business & Economic Statistics} 20, 518–529.
Google Scholar
Teicher, H. (1963). “Identifiability of Finite Mixtures.” The Annals of Mathematical Statistics 34, 1265–1269.
Google Scholar
Titterington, D.M., A.F.M. Smith, and U.E. Makov. (1985). Statistical Analysis of Finite Mixture Distributions. Chichester: John Wiley & Sons Ltd.
Google Scholar
Uusitalo, R. (1999). “Essays in Economics of Education.” Ph D thesis, University of Helsinki.
Verbeek, M. (2000). A Guide to Modern Econometrics. Chichester: John Wiley & Sons Ltd.
Google Scholar
Wald, A. (1940). “The Fitting of Straight Lines if Both Variables are Subject to Error.” The Annals of Mathematical Statistics 11, 284–300.
Google Scholar
White, H. (1980). “A Heteroscedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroscedasticity.” Econometrica 48, 817–838.
Google Scholar
Wooldridge, J.M. (2002). Econometric Analysis of Cross Section and Panel Data. Cambridge: Massachusetts Institute of Technology.
Google Scholar
Yakowitz, S.J. and J.D. Spragins. (1968). “On the Identifiability of Finite Mixtures.” The Annals of Mathematical Statistics 39, 209–214.
Google Scholar

Download references

Author information

Authors and Affiliations

Smeal College of Business, The Penn State University, University Park, PA, 16802-3603, USA
Peter Ebbes
Ross School of Business, University of Michigan, Ann Arbor, MI, 48109-1234, USA
Michel Wedel
Faculty of Management, McGill University, Montreal, PQ, H3A 1G5, Canada
Ulf Böckenholt
Department of Economics, University of Groningen, P.O.Box 800, 9700AV, Groningen, NL
Ton Steerneman

Authors

Peter Ebbes
View author publications
You can also search for this author in PubMed Google Scholar
Michel Wedel
View author publications
You can also search for this author in PubMed Google Scholar
Ulf Böckenholt
View author publications
You can also search for this author in PubMed Google Scholar
Ton Steerneman
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Peter Ebbes.

Additional information

JEL Classification: C12, C13, C21, J3, M3

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ebbes, P., Wedel, M., Böckenholt, U. et al. Solving and Testing for Regressor-Error (in)Dependence When no Instrumental Variables are Available: With New Evidence for the Effect of Education on Income. Quant Market Econ 3, 365–392 (2005). https://doi.org/10.1007/s11129-005-1177-6

Download citation

Issue Date: December 2005
DOI: https://doi.org/10.1007/s11129-005-1177-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Solving and Testing for Regressor-Error (in)Dependence When no Instrumental Variables are Available: With New Evidence for the Effect of Education on Income

Abstract

Access this article

Similar content being viewed by others

Partial Identification and Sensitivity Analysis

Estimating the intergenerational elasticity of expected income with short-run income measures: a generalized error-in-variables model

Sensitivity analysis for inference in 2SLS/GMM estimation with possibly flawed instruments

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Solving and Testing for Regressor-Error (in)Dependence When no Instrumental Variables are Available: With New Evidence for the Effect of Education on Income

Abstract

Access this article

Similar content being viewed by others

Partial Identification and Sensitivity Analysis

Estimating the intergenerational elasticity of expected income with short-run income measures: a generalized error-in-variables model

Sensitivity analysis for inference in 2SLS/GMM estimation with possibly flawed instruments

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation