1 Introduction

In their influential essay, Acemoglu et al. (2005) provide convincing arguments in favor of the idea that institutions cause economic prosperity by providing “ right” incentives and constraints to the economic agents. Along the path of economic development, Acemoglu and coauthors claim, institutions emerge as outcomes of social decisions. Particularly, economic institutions encouraging economic growth may arise “ when political institutions allocate power to groups with interests in broad-based property rights enforcement, when they create effective constraints on power-holders, and when there are relatively few rents to be captured by power-holders”. This view traces back to North (1990), who defines institutions as “the rules of the game in a society or, more formally, [...] the humanly devised constraints that shape human interaction”. Consistently with this definition, the fundamental explanation of comparative growth should be sought in institutional differences. This is the perspective we adopt in this paper.

The attempt to understand cross-country differences in GDP dynamics through this lens is certainly not new.Footnote 1 We contribute in two ways. First, we provide new measures to describe a country’s institutional environment. Since the institutional setting is a multidimensional phenomenon and the array of connections between institutions and economic development is potentially extremely large, the first contribution of the paper is to propose a brand-new set of indices aimed at summarizing such multidimensionality. In our view, the term “institution” must be intended in a broad sense. Institutions affect the interactions among agents on many grounds. They operate formally, through the design of the rules of the game, but also informally, by shaping customs and social norms. The possibility for individuals and organizations—like corporations, public entities, financial institutions, etc.—to lead the society as a whole towards productive economic activities crucially depends on the incentives for these activities. Incentives that typically institutions provide. Building on the data provided by the Fraser Institute (2018), we focus on the following five measures: i) the size of the public sector, ii) the reliability and fairness of the legal system, iii) the degree of liquidity in the financial markets, iv) the degree of openness to international trade and v) the strength of regulation.Footnote 2 Our indices are obtained by employing a dimension reduction approach designed for panel data (Farcomeni et al. 2021) and rated on a 0–10 scale. Such a rating reflects the general idea that an institution is better the more it increases market freedom, protects private property rights, provides liquidity to the economy (with a beneficial effect on interest rates and capital accumulation), and promotes trade. These are considered the most important preconditions for a sustained economic development.

We are aware that some of our institutional indices are constructed starting from variables which, at best, can only be taken as proxies for institutions. While the legal system and the regulatory environment are intuitively identifiable as institutions per se, it might not be the same for other indices like, for instance, the size of the public sector. This measure, however, can be intended as a proxy for the welfare state, which is an institution itself or an aggregation of institutions, implicitly assuming that the larger is the public sector, the more developed is the welfare state. An assumption that seems to be supported by the data.Footnote 3 Similarly, well-functioning financial market and trade openness are informative about not only the appropriateness of the policies undertaken to pursue these goals but also about the soundness of the institutions which conceive and implement those policies.

As a second contribution, we use these indicators to assess their joint and separate role on GDP. We do this by explicitly taking into account the unobserved heterogeneity among countries. Using an optimal clustering method, we split our sample of 80 countries over the period 1980–2015, into two groups, “high-income” and “low- and middle-income” countries. By doing this we allow for heterogeneous effects of institutions among clusters. Then, applying the restrictions provided by an augmented version of the Solow model, including a role for institutions, we first estimate a Gaussian mixed-effects model to empirically prove the positive association between GDP (levels and growth rate) and our institutional indices. Finally, we employ the Generalized Propensity Score method proposed by Hirano and Imbens (2004) to properly address the issues of endogeneity and omitted variable bias and to identify the causal effect of institutions on GDP.

Our estimates show that the obtained institutional indices vary across the two groups of countries. We show that improvements in some institutions (i.e., larger values of our institutional indices) may cause both higher levels and growth rates of the long-run per capita GDP. Such effects appear stronger for those countries which have been classified as “low- and middle-income”, where, comparatively, markets are more dysfunctional and bureaucracies typically less efficient. Specifically, we document the important role played by the legal system in determining the long-run level of GDP in “low- and middle-income” countries. This has an important policy implication. In countries where basic institutions are often lacking, market-friendly policies may not yield desired results or may even be counterproductive. In such a context, reforms should aim at establishing a reliable legal system and protecting property rights.

Differently from the large body of the literature on the topic,Footnote 4 which focuses on the linear association between some institutional indices and GDP (levels and growth rate), a final important feature of our analysis is that it looks at and finds non-linear causal effects. Institutions affect GDP directly and indirectly, trough their interaction with cluster membership. Improvements in institutions always determine a positive level effect on per capita GDP. Our estimates document an interesting non-linear causal effect of (our proxy for) welfare state on GDP: the positive impact of this indicator tends to increase up to some limit (being smaller in the group of “low- and middle-income” countries) and then starts to decline (more sharply in the group of “low- and middle-income” countries). Also, all our institutional indicators display non-linear effects (despite not always statistically significant) on GDP growth. As a further check for non-linearity, we carry on a threshold analysis based on the augmented Solow model. This exercise confirms the presence of thresholds effects in the two groups of countries.

Our work relates closely to the empirical literature on the link between institutions and GDP dynamics, which has significantly increased over the last three decades as we have listed above. In general, a positive and direct relationship between institutions and GDP levels/growth rates is found. Estimates, however, substantially vary in terms of magnitude across different samples and/or specifications. Moreover, most of the papers rely only on few variables to capture institutional quality and/or do not provide any causal evidence on the relationship between institutions and GDP dynamics. Since the literature is vast, here we focus our attention to those studies which, like ours, build upon Mankiw et al. (1992) (MRW, hereafter).

Using a large sample of countries over the period 1975–1990, Dawson (1998) find that one standard deviation increase of an initial value of the “economic freedom” index above the mean provides a 3.78 percentage point higher growth rate in the subsequent 15-year sample period, holding the level of freedom fixed over the period. Taking data from 97 countries over the period 1974–89, Knach and Keefer (1995) introduce two institutional variables into an MRW regression, meant to capture the security of property rights and the enforcement of contracts, and find that an increase of one standard deviation in their “rule of law” index leads to an increase in the GDP growth rate by 0.504 of its standard deviation. In a subsequent paper, Keefer and Knack (1997) also show that whenever good institutions are absent convergence tends to be slower.

Analyzing a sample of 127 countries over the period 1950–1994, Hall and Jones (1999) show that differences in capital accumulation, productivity, and therefore output per worker are fundamentally related to differences in “social infrastructure” across countries. The positive impact of the “rule of law” on GDP growth has been found by Barro (1997), for a panel of 100 countries over the period 1960–1990, while Rodrik et al. (2004), using the data set of Acemoglu et al. (2001), find institutions to be crucial in determining the long-run level of a country’s income. Their estimates indicate that a one standard deviation increase in institutional quality produces a two log-points rise in per capita incomes. For a panel of 56 countries over the period 1981–2010, Nawaz (2015) find that the impact on GDP growth of various institutional variables is relatively larger in “high-income” countries as compared to the “low- and middle-income” ones.

Using a large sample of countries over the period 1960–2000, Minier (2007) focuses on the indirect effect of (political) institutions on growth, by introducing parameter heterogeneity into a growth regression. In such a frame, there are typically multiple growth regimes and threshold effects, which are ultimately affected by institutional quality. Minier’s estimates shed light on the interesting link between institutions and trade. Specifically, the weaker are the institutions of a country (proxied by several policy-related variables), the more it suffers from trade openness.Footnote 5

While most studies present a linear linkage between institutions and growth, there is also an empirical growth literature that deals with the non-linearities in the canonical cross-country growth regression.Footnote 6 For instance, using data on 100 countries over the years 1995–2018, Li and Kumbhakar (2022) propose a quantile regression model in which countries are grouped according to their growth rates, finding a positive effect of economic freedom on per capita GDP growth. In particular, they show that countries that fall into the 20th-50th percentiles of per capita GDP have a positive and significant effect of economic freedom on growth, whereas the effect is not significant below or above these percentiles. Our work belongs to this strand of the literature, examining whether the (causal) relationship between institutions and growth is subject to non-linearities after constructing optimal institutional indices.

The rest of the paper is organized as follows. Section 2 outlines and discusses the methodology proposed to derive the set of institutional indices and the empirical model to assess the role of institutions in explaining GDP dynamics. Section 3 describes the data set. Section 4 presents the estimates, with some comments. Section 5 is a conclusion.

2 Model and methodology

2.1 Institutional indices

Our first goal is to compute time-dependent summaries of indicators of interest. The main purpose of creating these institutional and policy indices is to identify unidimensional latent variables to summarize multidimensional indicators that, to some extent, are measuring similar characteristics from a different perspective. These latent variables can then be used for ranking and identifying different levels (doses) of the characteristics of interest (e.g., the reliability and fairness of the legal system). Notice that the resulting summaries are optimal from a specific mathematical perspective. However, they can only give a partial point of view on the information contained in the data.

There are different methods available for dimension reduction. The most widely used (e.g., principal component analysis) is anyway restricted to cross-sectional data and would not be appropriate for multidimensional measurements (in our case: a collection of indices that are deemed to measure different aspects of the same unidimensional latent trait) that are repeatedly measured over time (Hall et al. 2006). Among the different possible approaches proposed by the literature (e.g., Chen and Buja 2009; Maruotti et al. 2017), we opt for a methodology based on the specification of a latent Markov model (Bartolucci et al. 2013, 2014) for the latent trait, as in e.g. Xia et al. (2016) or Vogelsmeier et al. (2019). Specifically, we employ the methodology proposed by Farcomeni et al. (2021), whose main advantage is that it allows us to explicitly consider dependence arising from measurements on the same agent that is repeated over time.

Formally, let \(X_{itm}\) denote the m-th indicator for country i at time t. Let also \(U_{it}\) denote an unobserved discrete latent variable and \(w_m\) be the weight of latent class separation for \(m=1,\ldots ,M\). We assume \(Z_{it} = \sum _{m=1}^M w_m X_{itm}\) follows a latent Markov model according to which \(Z_{it}\) is independent of \(Z_{is}\) conditionally on \(U_{it}\), which follows a homogeneous first-order Markov chain. Additionally, conditional on \(U_{it}=j\) we assume \(Z_{it}\) is Gaussian with mean \(\xi _j(w)\). The optimal weights \({\hat{w}}_m\) for \(m=1,\ldots ,M\) optimize latent class separation, that is, maximize

$$\begin{aligned} \sum _{j=1}^k \sum _{t} {\hat{p}}_{tj}(w) ({\hat{\xi }}_{j}(w) - {\bar{\xi }}_t(w))^2, \end{aligned}$$
(1)

under the constraint \(\sum _m w_m^2=1\), where \(p_{tj}(w) = \Pr (U_{it} = j)\) and \({\bar{\xi }}_t(w)=\sum _j p_{tj}(w){{\hat{\xi }}}_j(w)\). In words, we set weights so that the latent means (the means of each subgroup as identified by \(U_{it}\)) are as far from each other as possible.

The resulting summary is a linear combination of the initial dimensions which optimizes the separation of clusters of agents (e.g., countries that have a more or a less reliable legal system). Weights can be used for the interpretation and assessment of the importance of the original variables. A limitation is a Gaussian assumption for \(Z_{it}\), which might not hold in practice if any \(X_{ith}\) is severely skewed, or if H is small.

Our methodology identifies five groups of indicators, which we summarize separately, creating treatment variables \(z_1\) to \(z_5\) (see Tables 11 and 12 in the Appendix for detailed descriptions) and jointly (treatment variable z). Finally, we normalize and scale the resulting indicators on a score of 0 (e.g., no reliability and fairness of the legal system) to 10 (e.g., highest reliability and fairness of the legal system).Footnote 7

2.2 The augmented Solow model

The rest of the paper is aimed at quantifying the causal effect of the institutional indices derived above on GDP levels and growth rates. To do this, we extend the canonical MRW’s setting to account for a direct impact of institutions on the Total Factor Productivity (TFP) [see, e.g. Nawaz and Khawaja (2019)].Footnote 8

For a country i at time t, we assume that the aggregate output is obtained through the following linearly homogeneous production function:

$$\begin{aligned} Y_{it}=K_{it}^{\alpha }H_{it}^{\beta }\left( A_{it}L_{it}\right) ^{1-\alpha -\beta }\qquad \text {with}\quad \alpha +\beta <1 \end{aligned}$$
(2)

where Y is the level of real GDP, K is the stock of physical capital, H is the stock of human capital, A is the Harrod-neutral technological progress and L is the labor force. We assume that the labor force and technology grow at the exogenously given rates n and g, respectively. For the sake of simplicity, we also assume that both forms of capital depreciate at the same constant rate \(\delta \). Let now \(\ln \left( \frac{Y_{it}}{L_{it}}\right) ^*\) denote the (natural logarithm of the) level of per capita GDP in the long-run, such that

$$\begin{aligned} \ln \left( \frac{Y_{it}}{L_{it}}\right) ^*= & {} \ln A_{i0}+ g_{it} +\left( \frac{\alpha }{1-\alpha -\beta }\right) \ln (s_{k})_{it}\\{} & {} +\left( \frac{\beta }{1-\alpha -\beta }\right) \ln (s_{h})_{it}-\left( \frac{\alpha +\beta }{1-\alpha -\beta }\right) \ln (n+g+\delta )_{it}, \end{aligned}$$

where \(s_k\) and \(s_h\) indicate the exogenous fractions of total income invested in physical capital and human capital, respectively. Notice that the term A is a reduced form to capture the large set of factors, other than inputs, that affect the steady-state level of GDP, such as resource endowments, climate, and institutions. Specifically, as in Dawson (1998), the notion that institutions affect productivity can be easily incorporated in the model by assuming A to be a function of institutions (z). Therefore, differently from MRW, in which \(\ln (A)_{it}=\alpha +\epsilon _{it}\), with \(\epsilon _i \sim N(0,1)\) representing a country-specific shock, in our set-up, we assume: \(\ln (A)_{it}=f(z_{it})+\epsilon _{it}\).Footnote 9 Using this, we obtain the following empirical equation:

$$\begin{aligned} \ln \left( \frac{Y_{it}}{L_{it}}\right) ^*= & {} \psi _0 +\psi _1f(z_{it})+ \psi _2\ln (s_{k})_{it}\nonumber \\{} & {} +\psi _3 \ln (s_{h})_{it}+\psi _4\ln (n+g+\delta )_{it} +\epsilon _{it}, \end{aligned}$$
(3)

where \(\psi _0+\psi _1f(z_{it})\) is the TFP, \(\psi _1\) captures the effect of institutions on per capita GDP, \(\psi _2\equiv \left( \frac{\alpha }{1-\alpha -\beta }\right) \), \(\psi _3\equiv \left( \frac{\beta }{1-\alpha -\beta }\right) \) and \(\psi _4\equiv -\left( \frac{\alpha +\beta }{1-\alpha -\beta }\right) \). This specification implies that differences in institutions have a homogeneous effect on the level of productivity across countries (\(\psi _1\)). The growth of per capita income can be then expressed as a function of the determinants of the steady-state and the initial level of income, i.e

$$\begin{aligned} \ln \left( \frac{{Y_t}/{L_t}}{{Y_0}/{L_0}}\right) =\left( 1-e^{-\lambda t}\right) \ln \left( \frac{Y_t}{L_t}\right) ^*- \left( 1-e^{-\lambda t}\right) \ln \left( \frac{Y_0}{L_0}\right) \qquad (\lambda >0), \end{aligned}$$
(4)

where \({Y_0}/{L_0}\) is the per capita income at some initial time and \(\lambda \) indicates the speed of conditional convergence toward the steady-state. Plugging (3) into (4) we finally get the following empirical equation:

$$\begin{aligned} \ln \left( \frac{Y}{L}\right) _{it}-\ln \left( \frac{Y}{L}\right) _{i0}= & {} \zeta _0+\zeta _1z_{it}+\zeta _2\ln (s_{k})_{it}\nonumber \\{} & {} +\,\zeta _3\ln (s_{h})_{it}+\zeta _4\ln (n+g+\delta )_{it}+\zeta _5\ln \left( \frac{Y}{L}\right) _{i0}+\epsilon _{it},\nonumber \\ \end{aligned}$$
(5)

where \(\zeta _0\equiv (1-e^{\lambda t})\psi _0\), \(\zeta _1\equiv (1-e^{-\lambda t})\psi _1f(z_{it})\), \(\zeta _2\equiv (1-e^{-\lambda t})\frac{\alpha }{1-\alpha -\beta }\), \(\zeta _3\equiv (1-e^{-\lambda t})\frac{\beta }{1-\alpha -\beta }\), \(\zeta _4\equiv -(1-e^{-\lambda t})\frac{\alpha +\beta }{1-\alpha -\beta }\) and \(\zeta _4\equiv -(1-e^{-\lambda t})\).

2.3 Estimation method

We first divide countries into groups according to a model-based clustering method. To do so, we restrict to the (log of) GDP in 1980 and compare twenty possible Gaussian mixture models, combining \(k=1,\ldots ,9\) groups with homogeneous or heterogeneous cluster-specific variance. The resulting optimal clustering is then used as a control, being a possible proxy for residual unobserved heterogeneity.

We then estimate Gaussian mixed-effects models in which we include fixed effects for treatment (z, \(z_1, \ldots , z_5\)), its square, interactions with cluster indicators, and control variables. For each endpoint \(x_{it}\) this leads to the equation

$$\begin{aligned} x_{it}= & {} \eta _{i0} +\eta _1 z_{it} + \eta _2 z_{it}^2 + \eta _3 c_i + \eta _4 z_{it}c_i + \eta _5 z_{it}^2c_i + \eta _6 t \nonumber \\{} & {} +\, \eta _7\ln (s_{k})_{it}+\eta _8 \ln (s_{h})_{it}+\eta _9\ln (n+g+\delta )_{it} +\epsilon _{it}, \end{aligned}$$
(6)

The model above reduces to (3) where the augmented Solow model is year and cluster (\(c_i\)) specific, with \(f(z_{it}) = (\eta _1 z_{it} + \eta _2 z_{it}^2 + \eta _3 c_i + \eta _4 z_{it}c_i + \eta _5 z_{it}^2c_i)/\psi _1\).Footnote 10

Subsequently, we put forward a causal analysis using a Generalized Propensity Score (GPS) method (Hirano and Imbens 2004). This is a generalization of the propensity score method for continuous treatments. Accordingly, we estimate a fixed-effect model to predict each treatment using controls and a country-specific intercept, as

$$\begin{aligned} E[z_{it}]= & {} {{\hat{z}}}_{it} = \eta _{0,i} + t + \eta _1\ln \left( y\right) _{i, t-1}\nonumber \\{} & {} +\, \eta _2\ln (s_{k})_{it}+\eta _1\ln (s_{h})_{it}+\eta _3\ln (n+g+\delta )_{it}, \end{aligned}$$
(7)

where y denotes the log of real per capita GDP, \(\ln \left( {Y}/{L}\right) \). The resulting predicted treatment \({{\hat{z}}}_{it}\) and its square is then included in a regression model to predict the outcome \(x_{it}\), which is either the log-GDP or its growth rate, as in

$$\begin{aligned} E\left[ x_{it}\right]= & {} \omega _0 + \omega _1 z_{it} + \omega _2z_{it}^2 + \omega _3{{\hat{z}}}_{it} + \omega _4{{\hat{z}}}_{it}^2 \nonumber \\{} & {} +\, \omega _5{{\hat{z}}}_{it}\times z_{it} + \omega _6 cluster_i\times z_{it} +\omega _7cluster_i\times z_{it}^2, \end{aligned}$$
(8)

together with the treatment, its square, and interactions of treatment and GPS with cluster indicators. The resulting predicted dose-response surface can be used to assess causal relationships between the treatment and endpoint, as discussed in Hirano and Imbens (2004) and references therein.

We note that a limitation of the GPS method is that it requires a selection-on-observables assumption, unlike Instrumental Variables (IV), Difference-in-Differences (DiD), and similar methods. The latter is not simply applicable in our context anyway as reliable IV are not available for our setting; and complex dose-response relationships are not amenable to assumptions underlying the DiD method. Similar reasoning about these assumptions applies for instance to panel cointegration methods and Generalized Method of Moments (GMM) estimation.

3 Data

To construct our sample, we merge information from three different sources. Our final sample contains country-level data for 80 countries from 1980 to 2015 taken over every fifth year.Footnote 11 Our main dependent variable is the real per capita GDP (y) taken from The World Bank (2018). We used this variable to construct our second dependent variable, which is the 5 years average growth rate of the real per capita GDP (Growth). This leaves us with seven data points for each country while at the same time controlling for initial income (\(y_{t-1}\)) which starts from 1980. Data on the total population used in constructing effective labor (\(n+g+\delta \)) and the investment share (I/GDP) that are seen to affect GDP dynamics were also taken from The World Bank (2018). The rate of human capital accumulation has been proxied by the Human Capital Index (HC) taken from the PWT (2018).

Finally, the variables used in the construction of our optimal institutional indices were taken from the Fraser Institute (2018) database.Footnote 12 The optimal summary index (z) and the optimal sub-indices (\(z_i\), \(i=1,\dots ,5\)) have been obtained by applying the methodology proposed in Sect. 2.1. Specifically, the summary index, z is constructed from the sub-indices Public sector size \((z_1)\), Reliability and fairness of the legal system \((z_2)\), Liquidity market openness \((z_3)\), Degree of (trade) protectionism \((z_4)\), and Regulation \((z_5)\). As part of our investigation, we will conduct several robustness analyses with the five optimal sub-indices of institutions (\(z_1\) to \(z_5\)) as alternative treatments to the overall institutional variable z. A detailed description of the Fraser Institute (2018) variables used to construct our treatment indices and the variables employed in our regressions can be found in Tables 11 and 12 in the Appendix.

Table 1 presents the summary statistics of key variables used in the analysis. Overall, there are 560 observations across 80 countries for 7-year periods taken every fifth year. On average, the natural logarithm of real per capita GDP is about 8.56 (equivalent to 5218 (in millions of US Dollars)), and countries’ GDP growth rates are approximately 0.08. The average institutional index is approximately 7.1 (score out of 10). The analysis also includes the binary variable ‘cluster’, which is 1 for “high-income” countries and zero otherwise.Footnote 13 Table 2 reports the correlation matrix among key variables.

Table 1 Summary statistics
Table 2 Correlation matrix for key variables

4 Results

4.1 Regime Membership

To partially remove effects of initial conditions, we classify countries with respect to their initial per capita GDP in 1980 (\(y_{0}\)). Clearly some countries will move to other clusters and others will persist in their initial cluster. By adjusting we remove confounding due to the initial status of each country. Using the Bayesian Information Criterion (BIC) and as suggested by the Classification Trimmed Likelihood (CTL) curves (Garcia-Escudero et al. 2011; Farcomeni and Greco 2015) presented in Fig. 1, we identify two clusters. The figure shows the objective function at convergence for the different number of clusters and increasing trimming levels \(\alpha \). The curves for \(k=2,3,4\) clusters almost overlap, while there is a gap for \(k=1\) versus \(k=2\), indicating that the optimal number of groups is \(k=2\).

We are then left with a predictable grouping reported in Table 3. This leads to the variable ‘cluster’, the indicator of being a “high-income” country (Cluster 2). Overall, there are 23 “high-income” countries out of the 80 countries in our sample.Footnote 14

Fig. 1
figure 1

CTL curves

Table 3 Classification of countries based on initial income (1980)

4.2 Estimates

4.2.1 Institutions and GDP level

Table 4 reports the results of the model for GDP level using the Main institutional index (Model 1) and the five sub-indices (Models 2–6).

Table 4 Mixed-effect estimates: institutions and GDP level

In the analysis conducted on the whole sample, we find that the effect on the long-run level of income of our aggregate institutional index (z) is essentially null in “low- and middle- income” countries (0.001) while it is positive (despite not statistically significant) in “high-income” countries (0.046). Parameter estimates for physical capital (0.102) and human capital (0.587), which are both statistically significant, are in line with the recent empirical literature based on MRW.Footnote 15

The results presented in the remaining five alternative specifications (models 2–6) employ a set of covariates including one sub-index in each estimation. For “low- and middle-income” countries, the sub-index Reliability and fairness of the legal system (model 3) positively (0.058) and significantly (\(p\) value \(<0.001\)) affects the level of income in the long-run while we find a negative impact of the Liquidity market openness (model 4) sub-index (\(-\)0.035, with a \(p\) value \(<0.005\)).Footnote 16

4.2.2 Institutions and GDP growth

The analysis conducted on the whole sample shows that improvements in the main institutional index (z) foster economic development in “low- and middle-income” countries.

Table 5 Mixed-effect estimates: institutions and GDP growth

Table 5 reports the estimates of the growth regression model. The index z is found to have a positive impact (0.030 with a \(p\) value \(<0.01\)) on the 5-year average real per capita GDP growth rate (model 1). The effect is not conclusive for “high-income” countries since the parameter for the interaction \(z \times cluster\) is not statistically significant. The coefficients for physical capital (0.159) and human capital (0.182) are in line with the literature based on MRW while the coefficient for the lagged value of GDP (\(-\)0.061) indicates that there is a slight tendency toward convergence in our sample.

The results for the baseline growth regression when using the five alternative synthetic sub-indices taken in isolation are reported in models 2–6 of the table. There is evidence of Public sector size (\(z_1\)) being harmful to growth for “low- and middle-income” countries (\(-\)0.025, \(p\) value \(<0.05\)) while the GDP growth effect of the Degree of (trade) protectionism (\(z_4\)) is negative in “high-income” countries (\(-\)0.070, \(p\) value \(<0.01\)).

4.2.3 Estimates with all five sub-indices of institutions

Table 6 Mixed-effect model, institutions and GDP -level/-growth

Results in Table 6 present estimates for using all the five sub-indices of institutions (\(z_i: i=1,\ldots , 5\)) as regressors together with the other covariates. From model 1 of the table, we find a positive and statistically significant impact on GDP (0.056, \(p\) value \(<0.01\)) of the sub-index Reliability and fairness of the legal system in “low- and middle-income” countries.

In the growth specification (model 2), the sub-indices that have statistically significant effects are Public sector size for “low- and middle-income” countries (\(-\)0.027, \(p\) value \(<0.05\)) and the Degree of (trade) protectionism (\(z_4\)) for “high-income” countries (\(-\)0.054, \(p\) value \(<0.05\)).

4.3 Generalized Propensity Score Analysis

We use the Generalized Propensity Score (GPS) estimator to evaluate the causal effect of each treatment on GDP dynamics. Tables 7 and 8 report the estimates while Figs. 2 and 3 present dose-response curves for “high-income” (solid line) and “low- and middle-income” (dotted line) countries in models 1–6.

Table 7 GPS estimates: institutions and GDP level

From Table 7 and Fig. 2, we see that with the partial exception of Public sector size (\(z_1\)) (see the second plot of Fig. 2 in which the dotted lines do not always lie above the solid ones), an improvement in institutions causes a more pronounced level effect on GDP in “high-income” countries.

Fig. 2
figure 2

Dose-response: causal effect of institutions on GDP level. Note: The treatments used in the various panels are (1)—Main institutional index (z), (2)—Public sector size (\(z_1\)), (3)—Reliability and fairness of the legal system (\(z_2\)), (4)—Liquidity market openness (\(z_3\)), (5)—Degree of (trade) protectionism (\(z_4\)), (6)—Regulation (\(z_5\))

Estimates in Table 8 and dose-response curves in Fig. 3 exhibit some form of non-linearity in the causal effect of institutions on growth.Footnote 17 The overall index (z) and sub-indices—with the exception of \(z_4\) for “low- and middle-income” countries—display a concave pattern.

The non-linear relationship in the causal effect of Public sector size (\(z_1\)) on GDP growth rate is reminiscent of Barro (1990). Public provision of infrastructure, rule of law, and protection of property rights is particularly important for growth in the early phases of the economic development. In Panel (2) of Fig. 3, the dotted curve lays above the solid one for \(z_1\ge 5\), suggesting that, to exert a positive effect on growth in “low- and middle-income” countries, the size of the public sector cannot be too low. However, as it gets too large, distortionary effects due to high taxes and public borrowing, as well as diminishing returns to public capital may emerge.Footnote 18

The non-monotonic effect of the strength of regulation (\(z_5\)) on GDP growth in the cluster of “high-income” countries seems to capture the stylized fact that a heavier regulatory burden tends to reduce productivity growth in OECD countries.Footnote 19

Trade protectionism (\(z_4\)) appears to be an important source of growth in “low- and middle-income” countries. Despite far from been conclusive, this result is consistent with the correlation between protectionist or inward-oriented trade strategies and growth in the so-called “first era of globalization”.Footnote 20

Table 8 GPS estimates: institutions and GDP growth
Fig. 3
figure 3

Dose-response: causal effect of institutions on GDP growth. Note: See notes under Fig. 2

4.4 Sub-sample Analysis

With the copious number of studies revealing institutional lapses in developing countries, Tables 1516, and 17 as well as Tables 18 and 19 (all of them in the Appendix) report results of the analysis conducted on a restricted sub-sample of “low- and middle-income” countries when using the mixed effect and GPS approaches, respectively.Footnote 21 Notice that in this sub-sample analysis, we do not include the interaction \(z -cluster\), since it is not identifiable in the sub-sample. The reason is that we stratified by cluster and this variable is a constant in each sub-sample.

From the results presented in Tables 15 and 16, we find no significant effect of institutions on GDP level but a positive linear effect (0.027, \(p\) value \(<0.01\)) on its growth rate. In terms of the sub-indices, we observe a non-linear relationship between GDP dynamics and Public sector size (\(z_1\)) as well as Degree of (trade) protectionism (\(z_6\)), such that increases in the sub-indices causes higher income and faster growth only if they do not exceed values around 4. There is also a significant non-linear relationship between GDP growth and Liquidity market openness (\(z_4\)) but the effect is weak (\(-\)0.001, \(p\) value \(< 0.10\)) and decreases at higher values of the index.

Such non-linearities appear even clearer from the dose-response curves shown in Figs. 4 and 5. The beneficial effect on GDP due to improvements in institutions (z) emerges only for higher values of the index (\(z>5\)), as shown in Panel (1) of Fig. 4. Almost the opposite instead occurs when we assess the causal impact of z on GDP growth, with a dose-response plot showing a concave pattern, as illustrated in Panel (1) of Fig. 5.

4.5 Threshold Effects

We have documented that improvements in a country’s institutional indices produce different effects on GDP (levels and growth rates), depending on whether the country belongs to the “high-income” or the “low- and middle-income” cluster. The analysis presented in Sect. 4.3 provides evidence about the possible non-linear causal effects of institutions on GDP dynamics. Those estimates, however, pertain to the reduced form regressions (7) and (8) which go beyond the standard (log-)linear growth model. To reconcile the issue of non-linear effects with the canonical growth model, we carry on a threshold analysis which incorporates all the restrictions provided by the augmented Solow model presented in Sect. 2.2.

To test for the presence of potential threshold effects within the various classifications provided in Table 3, we employ the dynamic panel threshold strategy proposed by Seo and Shin (2016), which allows for non-linear asymmetric dynamics, unobserved heterogeneity, and treats economic institutions as an endogenous variable.Footnote 22 For the sake of space, we restrict our attention to the relationship between the main institutional index, (z) and the GDP growth rate.

The model considered is of the form:

$$\begin{aligned} \triangle y_{it} = \lambda \triangle y_{it-1} + \triangle x'_{it} \beta + (1, x_{it}')\delta 1 \{{\hat{z}}_{it}> \gamma \} - (1,x_{it-1}')\delta 1 \{{\hat{z}}_{it-1} > \gamma \} + \triangle \epsilon _{it}, \nonumber \\ \end{aligned}$$
(9)

where \(y_{it}\) is the natural logarithm of real per capita GDP, \(z_{it}\) is our optimal measure of institutions (transition variable) and \(x_{it}\) is a set of covariates including natural logarithms of total population, human and physical capital. Also, \(\gamma \) is the threshold parameter and the error term, \(\epsilon _{it}\). We used lagged values of political institutions as one of the instruments that lead to the selection of economic institutions together with the other exogenous covariates in an attempt to address the issue of endogeneity. The use of this instrument is motivated by the hierarchy of institutions hypothesis introduced by Acemoglu et al. (2005) where political institutions have been documented to set the stage for their economic institutional counterparts which affect economic outcomes of a country.Footnote 23 From equation (9), the hypothesis of interest is the null, \(H_0: \delta = 0\) as against the alternative \(H_1: \delta \ne 0\).

Using the first difference generalized method of moments estimator (FD-GMM), Models 1, 2, and 3 of Table 9 presents the results with the full sample of 80 countries, “high-income”, and “low- and middle-income” countries, respectively. To have comparable results, we report the estimated coefficients for countries below (\(\phi \)) and above (\(\tau \)) the estimated threshold effects in each cluster, respectively.

Table 9 Institutional threshold effects

Following the old “rule of thumb” (see Steiger and Stock 1997; Stock and Yogo 2002) which says for the weak identification surrounding the instrumental variable not to be considered a problem, the \(F\)-statistics should be at least 10, we found the \(F\)-statistic to be above 10 for “high-income” countries and close to the 10 for “low- and middle-income” countries and the overall sample. In general, the estimated threshold effects (\({\hat{\gamma }}\)) are statistically significantly different from zero and similar to those reported in Acquah (2021) who used the original institutional indices from the Fraser Institute (2018) in a similar estimation approach. Particularly, for economic institutions to influence GDP growth, it must on average develop to a point of 6, 8, and 7 (out of a score of 10) for the full sample of 80, “high-income” and “low- and middle-income” countries, respectively. Since the threshold variable is unit-free, we interpret the estimated long-run effect of institutions towards GDP growth in reference to the estimated threshold parameter (\({\hat{\gamma }}\)) as a way of providing some understanding into the gains or losses of institutions for countries whose institutional developments are below (\({\hat{\phi }}_{\triangle q}\)) and above (\({\hat{\tau }}_{\triangle q}\)) the estimated threshold effect in what follows. From Table 9, we observe a significant difference in the parameter estimates of countries above and below the estimated threshold effect when using our institutional index. Above the estimated threshold effect of 8 (out of 10), changes (if the change persists for 5 years) in our institutional index leads to an increase in the growth rate of “high-income” countries by 0.4 percentage points (Model 2). The corresponding effect is positive for “low- and middle-income” countries but statistically not significant and negative in the overall sample. Interestingly, below the threshold of 7 (out of 10), improvements in the institutional measure are associated with an increase in the GDP growth rate by 0.026 percentage points for the “low- and middle-income” countries (Model 3). The coefficient estimates of the other variables are equally different in magnitude and/or signs for the sample above and below the threshold effect in Models 1–3.

4.6 Instrumental Variables

In this section, we assess how our institutional measures perform in comparison to the most frequently used proxy for institution, namely the Rule of law index, within a framework that puts the joint role of institutions and human capital center stage.Footnote 24 To do this we estimate a more parsimonious model in which both our institutional measures and human capital are simultaneously treated as endogenous and instrumented using historical variables. Specifically, we use i) the mortality rate of European settlers in former colonies to instrument country’s institutional quality, as in Acemoglu et al. (2001), and ii) the presence of Protestant missionary activity to instrument human capital in the former colonies, as in Acemoglu et al. (2014).Footnote 25

Because the empirical model is identical to that in Acemoglu et al. (2014), we shall be brief. The dependent variable is the (log of the) current level of GDP. Table 10 presents a comparison between the estimates of the main model in Acemoglu et al. (2014), in which the Rule of law index is used as a proxy for institutions and the (average) Years of Schooling as a proxy for human capital, and two models using our institutional measures, i.e., the Public sector size \(z_1\) (Model 1) and the Reliability and fairness of the legal system \(z_2\) (Model 2), respectively.Footnote 26 The bottom half of the table provides the first stages for the two endogenous variables. The differences in the variables used and the estimation technique justify the differences in the magnitude of the parameters.

As in Acemoglu et al. (2014), the coefficient on human capital is positive and significant (\(p\) value \(<0.001\)) while the coefficient on our institutional measure is positive and barely significant (\(p\) value \(<0.10\)) in both Models 1 and 2. First stage estimates are in line with those in Acemoglu et al. (2014) and document a negative association between settlers mortality and institutions, which is statistically significant only in Model 2, and between settlers mortality and human capital, which is instead always statistically significant. These results survive to several robustness checks.Footnote 27

Overall, the IV regressions, in which both institutions and human capital are instrumented using historical sources of variation, show a positive effect of the two variables on the current level of GDP. The effect of Public sector size (Model 1) and the Reliability and fairness of the legal system (Model 2), however, tends to be lower in magnitude and less precisely estimated than the one of the (log of) Human Capital Index. This result is in line with Glaeser et al. (2004) and the literature that suggests that human capital is a more basic source of economic prosperity than political institutions.

Table 10 2SLS, a comparison with Acemoglu et al. (2014)

5 Concluding remarks

This paper contributes to the debate on the nexus among institutions and economic development in two ways. It provides a new set of indices to capture a country’s institutional environment and it empirically investigates whether there is any causality running from institutions to economic development.

Building on Fraser Institute (2018), we propose a dimension reduction approach to obtain a new set of indices to summarize the multidimensionality of a country’s institutional setting. To identify the causal effect of these brand-new institutional indices on GDP (levels and growth rate) we employ the Generalized Propensity Score estimation approach. Using a large sample of countries over the period 1980–2015, our analysis documents the positive and statistically significant impact that improvements in institutions have on the growth rate of per capita GDP, in the economies that, according to our classification, belong to the cluster of “low- and middle-income”. Moreover, we find a sizable effect of human capital on GDP dynamics.

Our causal analysis also shows non-linearities in the effects that different institutions have on income and growth. The empirical model used to test causality takes into account the role of physical and human capital and lets institutions interact with cluster membership. The sub-index that captures the extent of welfare state, which we term Public sector size (\(z_{1}\)), displays a concave pattern in both regression models. Improvements of this index produce gains in terms of higher income and faster growth especially in less advanced economies, provided that the value of the sub-index is not too high. Despite not always statistically significant, improvements in all the other considered institutions cause a positive level effect that is larger for “low- and middle-income” countries.

The Mixed-Effect Model also stresses reliability and fairness of the legal system as a crucial driver for economic development. This result is reminiscent of La Porta et al. (2008) and has several policy implications. Specifically, our analysis reveals that the design and the implementation of legal reforms appear to be particularly important in “low- and middle-income” countries. Policy interventions aimed at improving this institution are complex. Such interventions pertain to i) drafting and enacting of laws and regulations, ii) enforcing laws and regulations, and iii) resolving and settling disputes. Like many economists, political scientists, and legal scholars have pointed out, however, legal reforms in a society emerge as an equilibrium outcome, thus reflecting the balance between different interests of different social groups.Footnote 28 Moreover, the so-called “legal transplant” has rarely turned out to be successful.Footnote 29

Finally, we document interesting threshold effects which support the existence of non-linearities. Again, higher values in our institutional indices, which typically translate into advances in institutional quality, are particularly important for those countries which are below the estimated threshold and belong to the cluster of “low- and middle-income” countries.