Abstract
This paper incorporates technological interdependence into a neoclassical regional growth framework with imperfect factor mobility, leading to a convergence equation with spatial effects. The empirical analysis is based on the estimation of a spatial Durbin panel data model and the implementation of multiple imputation techniques. Our results show that taking into account both unobserved heterogeneity and spatial dependence increases the estimated regional convergence rate. This provides an explanation for puzzling findings in the related literature. We also obtain evidence of heterogeneity across country groups regarding the regional speed of convergence and the degree of diffusion of technology.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00168-020-00978-4/MediaObjects/168_2020_978_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs00168-020-00978-4/MediaObjects/168_2020_978_Fig2_HTML.png)
Similar content being viewed by others
Notes
Leaving the disturbance term aside, expression (15) is equivalent to Eq. (7) in GLLS (page 264) in the absence of technological interdependence, i.e., when \(\rho = 0\). In matrix form, the convergence equation without productive externalities is given by \(G_{t+1}=E_{t+1}+F+(\tau \alpha -1) Y_{t}+ \alpha (1-\tau ) \bar{Y}_{t}\); with \(E_{t+1}=\alpha (v_{t+1} + s)\) and \(F=A+\alpha (1-\tau )\hat{A}\), where \(\hat{A}\) denotes a vector of dimension N including the regional relative TFPs (in natural logarithms).
See Lee and Carlin (2010) for a comparison of MICE with the iterative multivariate normal (MVN) method. The latter ensures that the imputed values are drawn from a specific distribution. Although MVN may be more attractive from a theoretical point of view, it may not be suitable for formulating a joint model for general data structures. That is to say, this technique is appropriate if the multivariate normal distribution is a reasonable description of the data. Panzera et al. (2016) have developed a procedure to deal with missing information in spatial data that combines Bayesian interpolation and multiple imputation. However, this method also relies on the assumption that the distribution of the underlying spatial process is normal.
Equation (15) is reformulated as a dynamic panel data model, see the seminal contribution of Islam (1995). Nevertheless, the difference in frequency of the information contained in the data set constructed by GLLS, determined at the country level, requires the estimation of a static panel where the dependent variable is the annualized compounded growth rate of real GDP per capita.
Depending on the distributional assumption for the disturbance term, the SDM can be estimated using maximum likelihood (Elhorst 2003, 2010) or the quasi-maximum likelihood approach (Belotti et al. 2017). In the present paper, we have followed this second alternative (i.e., assuming non-normal errors).
References
Alfaro L, Chanda A, Kalemli-Ozcan S, Sayek S (2004) FDI and economic growth: the role of local financial markets. J Int Econ 64(1):89–112. https://doi.org/10.1016/S0022-1996(03)00081-3
Badinger H, Müller W, Tondl G (2004) Regional convergence in the European Union, 1985–1999: a spatial dynamic panel analysis. Reg Stud 38(3):241–253. https://doi.org/10.1080/003434042000211105
Barro RJ (2015) Convergence and modernisation. Econ J 125(585):911–942. https://doi.org/10.1111/ecoj.12247
Baumont C, Ertur C, Le Gallo J (2003) Spatial convergence clubs and the European regional growth process, 1980–1995. In: Fingleton B (ed) European regional growth. Springer, Berlin, pp 131–158. https://doi.org/10.1007/978-3-662-07136-6_5
Belotti F, Hughes G, Mortari AP (2017) Spatial panel-data models using Stata. Stata J 17(1):139–180
Benos N, Karagiannis S, Karkalakos S (2015) Proximity and growth spillovers in European regions: the role of geographical, economic and technological linkages. J Macroecon 43:124–139. https://doi.org/10.1016/j.jmacro.2014.10.003
Breinlich H, Ottaviano GIP, Temple JRW (2014) Regional growth and regional decline. In: Philippe A, Durlauf SN (eds) Handbook of economic growth, vol 2. Elsevier, Amsterdam, pp 683–779. https://doi.org/10.1016/B978-0-444-53540-5.00004-5
Buuren S, Groothuis-Oudshoorn K (2011) MICE: multivariate imputation by chained equations in R. J Stat Softw 45(1):1–67. https://doi.org/10.18637/jss.v045.i03
Carrington A (2003) A divided Europe? Regional convergence and neighbourhood spillover effects. Kyklos 56(3):381–393. https://doi.org/10.1046/j.0023-5962.2003.00226.x
Comin D, Nanda R (2019) Financial development and technology diffusion. IMF Econ Rev 67(2):395–419. https://doi.org/10.1057/s41308-019-00078-0
Cravo TA, Resende GM (2013) Economic growth in Brazil: a spatial filtering approach. Ann Reg Sci 50(2):555–575. https://doi.org/10.1007/s00168-012-0504-6
Dapena AD, Rubiera-Morollon F, Paredes D (2019) New approach to economic convergence in the EU: a multilevel analysis from the spatial effects perspective. Int Reg Sci Rev 42(3–4):335–367. https://doi.org/10.1177/0160017618804010
Debarsy N (2012) The Mundlak approach in the spatial Durbin panel data model. Spat Econ Anal 7(1):109–131. https://doi.org/10.1080/17421772.2011.647059
Elhorst JP (2003) Specification and estimation of spatial panel data models. Int Reg Sci Rev 26(3):244–268. https://doi.org/10.1177/0160017603253791
Elhorst JP (2010) Spatial panel data models. In: Fischer MM, Getis A (eds) Handbook of applied spatial analysis: software tools, methods and applications. Springer, Berlin, pp 377–407. https://doi.org/10.1007/978-3-642-03647-7-19
Elhorst JP (2014) Spatial econometrics: from cross-sectional data to spatial panels. Springer, Berlin. https://doi.org/10.1007/978-3-642-40340-8
Ertur C, Koch W (2007) Growth, technological interdependence and spatial externalities: theory and evidence. J Appl Econom 22(6):1033–1062. https://doi.org/10.1002/jae.963
Fingleton B, López-Bazo E (2006) Empirical growth models with spatial effects. Pap Reg Sci 85(2):177–198. https://doi.org/10.1111/j.1435-5957.2006.00074.x
Fischer MM (2011) A spatial Mankiw–Romer–Weil model: theory and evidence. Ann Reg Sci 47(2):419–436. https://doi.org/10.1007/s00168-010-0384-6
Fischer MM, Bartkowska M, Riedl A, Sardadvar S, Kunnert A (2009) The impact of human capital on regional labor productivity in Europe. Lett Spat Resour Sci 2(2):97–108. https://doi.org/10.1007/s12076-009-0027-7
Geary RC (1954) The contiguity ratio and statistical mapping. Incorp Stat 5(3):115–146. https://doi.org/10.2307/2986645
Gennaioli N, La Porta R, Lopez-de-Silanes F, Shleifer A (2013) Human capital and regional development. Q J Econ 128(1):105–164. https://doi.org/10.1093/qje/qjs050
Gennaioli N, La Porta R, Silanes FLD, Shleifer A (2014) Growth in regions. J Econ Growth 19(3):259–309. https://doi.org/10.1007/s10887-014-9105-9
Giles DEA, Stroomer CN (2006) Does trade openness affect the speed of output convergence? Some empirical evidence. Empir Econ 31(4):883–903. https://doi.org/10.1007/s00181-006-0060-y
Hofmann P (2013) The impact of international trade and FDI on economic growth and technological change. Contributions to economics. Springer, Berlin. https://doi.org/10.1007/978-3-642-34581-4
Islam N (1995) Growth empirics: a panel data approach. Q J Econ 110(4):1127–1170. https://doi.org/10.2307/2946651
Jinji N, Zhang X, Haruna S (2019) Do deeper regional trade agreements enhance international technology spillovers? World Econ. https://doi.org/10.1111/twec.12797
Kiszewski A et al (2004) A global index representing the stability of malaria transmission. Am J Trop Med Hyg 70(5):486–498
Koch W (2010) Growth and spatial dependence in Europe. In: Páez A, Gallo J, Buliung RN, Dall’erba S (eds) Progress in spatial analysis: methods and applications. Springer, Berlin, pp 465–482. https://doi.org/10.1007/978-3-642-03326-1-21
Le Gallo J, Fingleton B (2014) Regional growth and convergence empirics. In: Fischer MM, Nijkamp P (eds) Handbook of regional science. Springer, Berlin, pp 291–315. https://doi.org/10.1007/978-3-642-23430-9-17
Lee KJ, Carlin JB (2010) Multiple imputation for missing data: fully conditional specification versus multivariate normal imputation. Am J Epidemiol 171(5):624–632. https://doi.org/10.1093/aje/kwp425
Lim U (2016) Regional income club convergence in US BEA economic areas: a spatial switching regression approach. Ann Reg Sci 56(1):273–294. https://doi.org/10.1007/s00168-015-0739-0
Mitton T (2016) The wealth of subnations: geography, institutions, and within country development. J Dev Econ 118:88–111. https://doi.org/10.1016/j.jdeveco.2015.09.002
Moran PAP (1950) Notes on continuous stochastic phenomena. Biometrika 37(1/2):17–23. https://doi.org/10.2307/2332142
Mundlak Y (1978) On the pooling of time series and cross section data. Econometrica 46(1):69–85. https://doi.org/10.2307/1913646
Panzera D, Benedetti R, Postiglione P (2016) A Bayesian approach to parameter estimation in the presence of spatial missing data. Spat Econ Anal 11(2):201–218. https://doi.org/10.1080/17421772.2016.1102962
Pfaffermayr M (2009) Maximum likelihood estimation of a general unbalanced spatial random effects model: a Monte Carlo study. Spat Econ Anal 4(4):467–483. https://doi.org/10.1080/17421770903317645
Pfaffermayr M (2012) Spatial convergence of regions revisited: a spatial maximum likelihood panel approach. J Reg Sci 52(5):857–873. https://doi.org/10.1111/j.1467-9787.2012.00773.x
Ramajo J, Márquez MA, Hewings GJD, Salinas MM (2008) Spatial heterogeneity and interregional spillovers in the European Union: do cohesion policies encourage convergence across regions? Eur Econ Rev 52(3):551–567. https://doi.org/10.1016/j.euroecorev.2007.05.006
Rey SJ, Le Gallo J (2009) Spatial analysis of economic convergence. In: Mills TC, Patterson K (eds) Palgrave handbook of econometrics: volume 2: applied econometrics. Palgrave Macmillan UK, London, pp 1251–1290. https://doi.org/10.1007/978-0-230-24440-5-27
Rey SJ, Montouri BD (1999) US regional income convergence: a spatial econometric perspective. Reg Stud 33(2):143–156. https://doi.org/10.1080/00343409950122945
Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592. https://doi.org/10.1093/biomet/63.3.581
Rubin DB (1987) Multiple imputation for nonresponse in surveys. Wiley, New York. https://doi.org/10.1002/9780470316696
Sanso-Navarro M, Vera-Cabello M, Ximénez-De-Embún DP (2017) Human capital spillovers and regional development. J Appl Econom 32(4):923–930. https://doi.org/10.1002/jae.2541
Svirydzenka K (2016) Introducing a new broad-based index of financial development. IMF working papers 16/5. International Monetary Fund
Wang W, Lee L-F (2013) Estimation of spatial panel data models with randomly missing data in the dependent variable. Reg Sci Urban Econ 43(3):521–538. https://doi.org/10.1016/j.regsciurbeco.2013.02.001
Acknowledgements
The authors have benefited from the valuable comments of two anonymous reviewers, Jesús Crespo Cuaresma and participants at the XXth Applied Economics Meeting (Valencia, Spain), the 57th ERSA Congress (Groningen, The Netherlands), the 43rd Simposio de la Asociación Española de Economía (Madrid, Spain) and the 23rd Conference on Macroeconomic Analysis and International Finance (Rethymno, Greece). This paper was originally written while the first author was visiting the Department of Economics (Institute of Macroeconomics) at the Vienna University of Economics and Business (WU), and he would like to acknowledge their hospitality and friendliness. Financial support from Gobierno de Aragón (S39-17R ADETRE Research Group), Ministerio de Economía y Competitividad (Grant ECO2017-82246-P), Ministerio de Educación, Cultura y Deporte (José Castillejo Grant CAS16/00010) and Universidad de Zaragoza (Grant JIUZ-2016-SOC-14) is also acknowledged.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix 1: steady-state uniqueness
The existence of a unique steady state is demonstrated by finding the values \(h_{i}^{*}\) and \(h^{*}=s\sum ^{N}_{i=1} (h_i^{*})^{\alpha }\) such that the capital \(h_{l}^{*}\) in any region l entails that there is no migration:
where \(v^{*}\) is the steady-state value of the normalization factor in expression (10).
The previous equation can be rewritten as
which is equivalent to saying that there is always an equilibrium with \(h_{i}^{*}=0, \forall i\). Once this possibility is discarded, there is an equilibrium given by \(h_{l}^{*} = b \left[ A_{l}^{\tau } \tilde{A}_{l}^{\frac{1-\tau }{1-\alpha }} \right] ^{\frac{1}{1-\alpha \tau }}\), where b is a positive constant with the same value for every region. Even though this constant depends on the entire profile of capital employment levels, each \(h_{i}^{*}\) has a negligible impact on it. This implies that the steady-state equilibrium will be interior and unique.
Appendix 2: multiple-imputation techniques
Multiple imputation is a simulation-based statistical technique that consists of replacing missing values by multiple sets of plausible values. The imputed values are not intended to represent the ‘real’ values, but rather to reproduce the variance–covariance structure that would have been observed if the missing information had not been missing. In doing so, multiple imputation handles missing data in a way that results in valid statistical inference. Although the theoretical foundations of this method were derived under the Bayesian paradigm, they are valid from a frequentist point of view. The Bayesian approach is used to create the imputations and underlies the combination of the estimated parameters. Once an imputation model is selected, M complete data sets are generated on which the analysis of interest is performed. Finally, the results from these M analyses are combined into a single multiple-imputation result. At this stage of the estimation, the coefficients and standard errors are adjusted for the variability between imputations, following the combination rules proposed by Rubin (1987).
Multiple imputation is preferable to alternative methods like listwise deletion, pairwise deletion, mean imputation and single imputations, because it only requires the missing data mechanism to be ignorable. In other words, it only requires that it is possible to disregard the process that caused the information to be missing, assuming that the data is missing at random (MAR). Another possibility is to assume that the data are missing completely at random (MCAR). In this case, the probability that information is missing does not depend on observed or unobserved variables, implying that the missing values are a random sample of all data values. If the missing data were not at random (MNAR), the reasons for being missing should be taken into account in the imputation model to obtain valid results.
Under the MAR assumption, iterative Markov chain Monte Carlo (MCMC) methods are used to simulate the imputed values from the posterior predictive distribution of the missing data given the observed data. The simulation error will decrease when the number of imputations increases, especially with high fractions of missing information. In the present paper, the simulations have been carried out running multiple independent chained equations (MICE), which obtain univariate conditional distributions for each variable from a fully conditional specification of the prediction equations. Even though this technique lacks a rigorous theoretical justification, its flexibility has made it a frequent choice in practice. More specifically, MICE is similar to the Gibbs sampler, a popular MCMC method for simulating data from complicated multivariate distributions.
Considering that the complete data X is a partially observed random sample from a multivariate distribution \(P\left( X|\theta \right) \), it has been assumed that the latter is completely specified by the unknown vector of parameters \(\theta \) (Buuren and Groothuis-Oudshoorn 2011). In order to obtain its multivariate distribution, the MICE algorithm samples iteratively, on a variable-by-variable basis, from the p univariate conditional distributions \(P\left( X_{1}|X_{-1},\theta _{-1}\right) \), ..., \(P\left( X_{p}|X_{-p},\theta _{-p}\right) \). It is worth noting that \(\theta _{1}\), ..., \(\theta _{p}\) are specific to their corresponding conditional densities and not necessarily the product of a factorization of \(P\left( X|\theta \right) \). The variables will be imputed from the most observed to the least observed. Taking a sample drawn from the observed marginal distributions as the starting point, the \(\kappa \)th iteration is a Gibbs sampler that draws
where \(X_{m}^{(\kappa )} = \left( X_{m}^\mathrm{obs},X_{m}^{*(\kappa )}\right) \) denotes the mth imputed variable.
Appendix 3: preliminary analysis
This appendix studies the extent to which changes in data handling and the sample period analysed and the use of the multiple-imputation technique, briefly described in “Appendix 2”, alter standard regression results. With this aim, Table 7 presents pooled OLS estimates of the growth equation (15) without spatial effects (i.e., imposing \(\rho =0\)) in four alternative samples. The first column of results reproduce those reported by GLLS—see specification (4) in Table 5 (page 282)—in order to compare them to the estimated parameters under our proposed treatment of the data, displayed in the second column. Although the explanatory power is now slightly higher, the number of observations is smaller and different coefficients are obtained.
The most important disparities are found for national GDP per capita and the indicator reflecting whether the region includes the capital city, whose estimated parameters are negative and statistically significant. Although coefficients of a similar magnitude are obtained for latitude, average distance to the coast, the ‘malaria index’ and years of schooling, this is not the case for oil and gas production and population density. These variables also change their statistical significance, which is now favourable for population density. Last, but not least, the parameter for the initial level of regional GDP per capita implies an even lower convergence speed than that reported by GLLS. The explanatory power and estimated coefficients are stable across samples. The greatest difference is in the variable that reflects the regional disease environment, as its corresponding parameter is not only higher but also statistically significant. In absolute values, the coefficients for the capital city indicator and the proxy for human capital are higher in the most recent period. These findings allow us to conclude that restricting the analysis to \(1980{-}2010\) does not alter the underlying relation between regional growth and its determinants in this empirical framework.
For each sample period analysed, the last two columns show the estimated parameters when the multiple-imputation technique is implemented, fixing \(M=20\). The difference between these pairs of sets of results depends on whether or not the panel structure underlying the data has been explicitly taken into account. Together with the growth determinants not affected by the missing information problem, the longitude of the regional centroid and country and time dummies have also been considered as explanatory variables in the imputation step. It can be stated that, in general, the use of this method does not modify either the sign or the statistical significance of the estimated parameters. This is especially the case when the panel nature of the data is controlled for, but at the cost of a higher sampling variance due to the missing information, as measured by the average relative variance increase (RVI) and the largest fraction of missing information (FMI).
In order to give a visual impression of how the multiple-imputation method works in practice, and for the sample covering the years from 1980 to 2010, Fig. 3 plots kernel estimates of the density function for the regional growth determinants with missing data. While the black lines correspond to the distributions of the observed data, the grey lines also include the imputed values. The density functions of both the observed and imputed data sets are similar for population density, which might explain why the estimated coefficient for this variable does not change when it is taken into account that some information is missing. More interestingly, both national and regional GDP per capita and years of education display higher frequencies in their lower values when imputed values are considered. This reflects the fact that the unbalanced character of the panel is mainly driven by the missing information problems suffered by the regions in countries with lower levels of development.
Appendix 4: panel-correlated random effects estimations—coefficients of region-specific averages
Mundlak (1978) proposed an empirical framework that permits the estimation of both time-varying and time-invariant variables with panel data. This method, known as correlated random effects (CRE) estimation, consists of including individual-specific means of time-varying covariates to account for their correlation with the individual effects. The CRE approach was extended to the panel version of the spatial Durbin model by Debarsy (2012). Table 8 reports the estimated coefficients in the empirical analysis carried out in the present paper for the region-specific averages of lagged regional and national income per capita levels, population density and years of schooling, as well as their spatial lags.
Rights and permissions
About this article
Cite this article
Sanso-Navarro, M., Vera-Cabello, M. & Puente-Ajovín, M. Regional convergence and spatial dependence: a worldwide perspective. Ann Reg Sci 65, 147–177 (2020). https://doi.org/10.1007/s00168-020-00978-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00168-020-00978-4