Skip to main content

The Pitfalls of Ignoring Outliers in Instrumental Variables Estimations: An Application to the Deep Determinants of Development

  • Chapter
  • First Online:
  • 1917 Accesses

Part of the book series: Advanced Studies in Theoretical and Applied Econometrics ((ASTA,volume 48))

Abstract

The extreme sensitivity of instrumental variables (IV) estimators to outliers is a crucial problem too often neglected or poorly dealt with. We address this issue by making the practitioner aware of the existence, usefulness, and inferential implications of robust-to-outliers instrumental variables estimators. We describe how the standard IV estimator can be made robust to outliers, provide a brief description of alternative robust IV estimators, simulate the behaviour of both the standard IV estimator and each robust IV estimator in presence of different types of outliers, and conclude by replicating a celebrated study on the deep determinants of development in order to establish the danger of ignoring outliers in an IV model.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    For the sake of clarity, vectors and matrices are in bold font and scalars are in normal font.

  2. 2.

    Note that these authors were more interested in the advantages offered by quantile regressions to study the distribution of a given variable than in the resistance of a median regression-type estimator to certain types of outliers.

  3. 3.

    We have implemented Cohen-Freue et al. (2013)-type RIV estimators in Stata, see Desbordes and Verardi (2013) for a description of the ready-to-use—-robivreg—package.

  4. 4.

    Ready-to-use Stata code for the Stahel and Donoho estimator is available upon request to the authors.

  5. 5.

    We have implemented this estimator using the -cqiv- Stata code written by Chernozhukov et al. (2010), given that other interested practitioners are likely to turn to the same source. However, note that we obtain similar results when we adopt a more flexible function of the first-stage residuals in the second stage. Results are available upon request.

  6. 6.

    For the simulation, we define \(\omega _{i} = I\left (\delta _{i}(\mathbf{X}_{C}.\mathbf{Z_{C}^{E}})\vdots_{(\mathbf{X}_{ D},\mathbf{Z}_{D}^{E})} < \sqrt{\chi _{p_{1 } +m_{3 },0.95 }^{2}}\right )\), \(\phi _{i} =\sigma \frac{\psi _{i}(\frac{r_{i}} {\sigma } )} {r_{i}}\), \(\psi _{i}(u) = u\left [1 -\left ( \frac{u} {1.546}\right )^{2}\right ]^{2}I\left (\left \vert u\right \vert \leq 1.546\right )\) where I is the indicator function.

  7. 7.

    The loss function is defined as \(\rho (u) = \frac{u^{2}} {2} I\left (\left \vert u\right \vert \leq 4.685\right ) + \left (4.685\left \vert u\right \vert -\frac{u^{2}} {2} \right )I\left (\left \vert u\right \vert > 4.685\right )\).

  8. 8.

    Obviously, in the absence of contamination, the standard IV estimator should be adopted.

  9. 9.

    http://www.hks.harvard.edu/fs/drodrik/research.html.

  10. 10.

    As in Rodrik et al. (2004), regressors have been scaled by expressing them as deviations from their mean divided by their standard deviation.

  11. 11.

    The first-stage F-statistics measure the correlation of the excluded instruments with the endogenous regressors, adjusted for the presence of two endogenous regressors. With weak identification, the classical IV estimator can be severely biased.

  12. 12.

    The DFBETA statistics measure how each regression coefficient changes when each observation is deleted in turn.

  13. 13.

    The value for the degrees of freedom of the Chi-Square distribution corresponds to the presence in the IV model of one dependent variable, three explanatory variables and two excluded instruments with 1 + 3 + 2 = 6.

  14. 14.

    Note that these dummies take the value of one for all countries in a specific group. They are not restricted to the identified outliers. These dummies capture therefore common group (regional) effects.

References

  • Acemoglu, D., Johnson,S., & Robinson, J. A. (2001). The colonial origins of comparative development: An empirical investigation. American Economic Review, 91(5), 1369–1401.

    Article  Google Scholar 

  • Albouy, D. Y. (2012). The Colonial origins of comparative development: An investigation of the settler mortality data. American Economic Review, 102(6), 3059–3076.

    Article  Google Scholar 

  • Amemiya, T. (1982). Two stage least absolute deviations estimators. Econometrica, 50(3), 689–711.

    Article  Google Scholar 

  • Belsley, D. A., Kuh, E., & Welsch, R. E. (1980). Regression diagnostics: Identifying influential data and sources of collinearity. Hoboken, NJ: Wiley-Interscience.

    Book  Google Scholar 

  • Chernozhukov, V., Fernandez-Val, I., & Kowalski, A. E. (2010). Quantile regression with censoring and endogeneity. Working Paper 2009–012, Department of Economics, Boston University.

    Google Scholar 

  • Cohen-Freue, G. V., Ortiz-Molina, H., & Zamar, R. H. (2013). A natural robustification of the ordinary instrumental variables estimator. Biometrics. doi:10.1111/biom.12043.

    Google Scholar 

  • Desbordes, R., & Verardi, V. (2013). A robust instrumental variables estimator. Stata Journal, 12(2), 169–181.

    Google Scholar 

  • Dollar, D., & Kraay, A. (2003). Institutions, trade, and growth. Journal of Monetary Economics, 50(1), 133–162.

    Article  Google Scholar 

  • Donoho, D. (1982). Breakdown properties of multivariate location estimators. Qualifying Paper, Harvard University, Boston.

    Google Scholar 

  • Frankel, J. A., & Romer, D. (1999). Does trade cause growth? American Economic Review, 89(3), 379–399.

    Article  Google Scholar 

  • Honore, B. E., & Hu, L. (2004). On the performance of some robust instrumental variables estimators. Journal of Business & Economic Statistics, 22(1), 30–39.

    Article  Google Scholar 

  • Huber, P. J. (1964). Robust estimation of a location parameter. Annals of Mathematical Statistics, 35, 73–101.

    Article  Google Scholar 

  • Lee, S. (2007). Endogeneity in quantile regression models: A control function approach. Journal of Econometrics, 141(2), 1131–1158.

    Article  Google Scholar 

  • Lucas, A., Van Dijk, R., & Kloek, T. (1997). Outlier robust GMM estimation of leverage determinants in linear dynamic panel data models. Working Paper, Department of Financial Sector Management and Tinbergen Institute, ECO/BFS, Vrije Universiteit.

    Google Scholar 

  • Maronna, R., Martin, D., & Yohai, V. (2006). Robust statistics. New York, NY: Wiley.

    Book  Google Scholar 

  • Maronna, R. A., & Yohai, V. J. (2000). Robust regression with both continuous and categorical predictors. Journal of Statistical Planning and Inference, 89(1–2), 197–214.

    Article  Google Scholar 

  • Rodrik, D., Subramanian, A., & Trebbi, A. (2004). Institutions rule: The primacy of institutions over geography and integration in economic development. Journal of Economic Growth, 9(2), 441–443.

    Article  Google Scholar 

  • Ronchetti, E., & Trojani, F. (2001). Robust inference with GMM estimators. Journal of Econometrics, 101(1), 37–69.

    Article  Google Scholar 

  • Stahel, W. (1981). Robuste Schätzungen: Infinitesimale Optimalität und Schätzungen von Kovarianzmatrizen (Ph.D. thesis). ETH Zürich, Zürich.

    Google Scholar 

  • Staiger, D., & Stock, J. H. (1997). Instrumental variables regression with weak instruments. Econometrica, 65(3), 557–586.

    Article  Google Scholar 

  • Wagenvoort, R., & Waldmann, R. (2002). On B-robust instrumental variable estimation of the linear model with panel data. Journal of Econometrics, 106(2), 297–324.

    Article  Google Scholar 

  • Welsh, A. H., & Ronchetti, E. (2002). A journey in single steps: robust one-step M-estimation in linear regression. Journal of Statistical Planning and Inference, 103(1–2), 287-310. http://archive-ouverte.unige.ch/unige:23228.

Download references

Acknowledgements

Catherine Dehon gratefully acknowledges research support from the IAP research network grant nr. P7/06 of the Belgian government (Belgian Science Policy) and from the ARC contract of the Communauté Française de Belgique. Catherine Dehon is also member of ECORE, the association between CORE and ECARES.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Catherine Dehon .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Dehon, C., Desbordes, R., Verardi, V. (2015). The Pitfalls of Ignoring Outliers in Instrumental Variables Estimations: An Application to the Deep Determinants of Development. In: Beran, J., Feng, Y., Hebbel, H. (eds) Empirical Economic and Financial Research. Advanced Studies in Theoretical and Applied Econometrics, vol 48. Springer, Cham. https://doi.org/10.1007/978-3-319-03122-4_12

Download citation

Publish with us

Policies and ethics