What Really Drives Economic Growth in Sub-Saharan Africa? Evidence from the Lasso Regularization and Inferential Techniques

Ofori, Isaac K.; Obeng, Camara K.; Asongu, Simplice A.

doi:10.1007/s13132-022-01055-1

What Really Drives Economic Growth in Sub-Saharan Africa? Evidence from the Lasso Regularization and Inferential Techniques

Open access
Published: 21 October 2022

Volume 15, pages 144–179, (2024)
Cite this article

Download PDF

You have full access to this open access article

Journal of the Knowledge Economy Aims and scope Submit manuscript

What Really Drives Economic Growth in Sub-Saharan Africa? Evidence from the Lasso Regularization and Inferential Techniques

Download PDF

2457 Accesses
4 Citations
Explore all metrics

Abstract

The question of what really drives economic growth in sub-Saharan Africa (SSA) has been debated for many decades now. However, there is still a lack of clarity on the variables crucial for driving growth as prior contributions have been executed at the backdrop of preferential selection of covariates in the midst of several potential drivers of economic growth. The main challenge with such contributions is that even tenuous variables may be deemed influential under some model specifications and assumptions. To address this and inform policy appropriately, we train algorithms for four machine learning regularization techniques— the Standard lasso, the Adaptive lasso, the minimum Schwarz Bayesian information criterion lasso, and the ElasticNet— to study patterns in a dataset containing 113 covariates and identify the key variables affecting growth in SSA. We find that only 7 covariates are key for driving growth in SSA. The estimates of these variables are provided by running the lasso inferential techniques of double-selection linear regression, partialing-out lasso linear regression, and partialing-out lasso instrumental variable regression. Policy recommendations are also provided in line with the AfCFTA and the green growth agenda of the region.

What Really Drives Financial Sector Development in Ghana?

Variable Selection and Estimation in Kink Regression Model

Of needles and haystacks: revisiting growth determinants by robust Bayesian variable selection

Article 30 June 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The debate on the sources of growth continues to generate attention in the political and academic landscapes due to its relevance for policy formulations on welfare, international competition, and economic management. From the saving-oriented (Domar, 1947; Harrod, 1939) and technical progress neoclassical theories of economic growth (Solow, 1956) to the imperfect market-augmented endogenous growth theories of Romer (1990), Aghion and Howitt (1990), and Grossman and Helpman (1991), economists are still exploring which variables matter for growth. The need to identify the key drivers of economic growth has even become crucial than ever following the emergence of the coronavirus pandemic (IMF, 2020; World Bank, 2020). For developing economies such as those in sub-Saharan Africa (SSA), knowledge on the key drivers of economic growth is a great step in formulating and implementing policies to foster, sustain, and share growth. Additionally, identifying the key drivers of economic growth would be a giant breakthrough on the parts of policymakers and developing partners in mapping out growth strategies in line with the green growth^{Footnote 1} agenda of the region.

A plethora of prior contributions identifies covariates such as trade openness, foreign direct investment, and innovation (Agbloyor et al., 2014; Sakyi et al., 2015), financial development (Opoku et al., 2019; Peprah et al., 2019), macroeconomic management (Alagidede & Ibrahim, 2017), institutional quality (Berhane, 2018; Chakamera, 2018), human capital (Anyanwu, 2014; Gyimah-Brempong et al., 2006), and ICT (Adeleye & Eboagu, 2019; Asongu & Odhiambo, 2019) as drivers of economic growth in SSA. A conspicuous lacuna in the extant scholarship, however, is that, all these variables deemed crucial for economic growth are selected based on the researcher’s discretion even in large dataset regression problems. The concern with preferential selection of covariates is that, even tenuous drivers of growth may be deemed highly influential under certain assumptions, model specification, or estimation techniques. Another challenge is that, the preferential selection of covariates in the midst of several potential determinants of growth partly contributes to the inconclusive results in big data regression problems. Addressing this challenge and thus informing policy appropriately can be through the application of machine learning^{Footnote 2} (artificial intelligence) techniques for regularization, and inference (see Tibshirani, 1996; Zou & Hastie, 2005; Zou, 2006). Indeed, machine learning techniques have been applied in various fields, for example, in health (see Mateen et al., 2020; Doupe et al., 2019; Beam & Kohane, 2018), transportation (Bhavsar et al., 2017; Tizghadam et al., 2019), games and psychology (Sandeep et al., 2020; De Almeida-Rocha & Duarte, 2019; Luxton, 2016), and finance (Bredt, 2019; Bazarbash, 2019; Akbari et al., 2021).

Despite the rise in the application of machine learning techniques in several fields, rigorous empirical works exploring its applicability and power in selecting variables crucial for economic growth in SSA are hard to find. This fundamentally forms the contribution of this paper. The first objective, therefore, is to train several machine learning algorithms to identify the main drivers of economic growth in SSA. The second objective is to provide reliable estimates and confidence intervals for these main determinants of economic growth, taking into consideration possible endogeneity, multicollinearity, and modeling complexities. To the best of our knowledge, this study is the first of its kind in SSA to apply machine learning techniques in selecting the main drivers of economic growth. Particularly, following renewed efforts to achieve, sustain, and share growth gains in line with implementation of the African Continental Free Trade Area (AfCFTA) and the institution of the African Agenda 2063, our results could prove crucial to the course by aiding the planning, modeling, and the targeting of growth.

Our choice of the study area is informed by a number of factors. First, as Kaufman et al. (2010) note, SSA countries are fundamentally common in terms of institutions. Despite lags in several facets of governance such as the rule of law, regulatory quality, and corruption-control, the quality of these indicators, as the World Governance Indicators suggest, is rising steadily across the region. However, macroeconomic challenges relating to inflation, exchange rate fluctuations, macroeconomic bailouts, and geopolitical fragilities are common among countries in the region. Second, SSA countries are remarkably similar in terms of structural or real sector setting (OECD/ILO, 2019; UNCTAD, 2021; World Bank, 2021a). For instance, most of the region’s active workforce is employed in the agricultural sector and are more susceptible to political, financial, and trade shocks. Also worth mentioning is the common goal of SSA countries in using economic integration^{Footnote 3} as a vehicle to spur industrialization, growth, poverty alleviation, and equitable income distribution. Another peculiarity is the low industrial output but fast rising service sector, providing policymakers with opportunities to leapfrog classical development processes (IMF 2020). Third, as noted by the African Development Bank (2018), countries in the region are markedly common in infrastructural development. Particularly, SSA countries report sharp deficits in digital and physical infrastructure such as ICT, electricity, transportation, as well as water and sanitation compared to their North African counterparts. Finally, countries in SSA are substantially similar in terms of growth trajectories, level of development, and lingering concerns of inability to build sustained growth momentum.

The rest of the paper is organized as follows: the next section presents a brief survey of economics-related studies applying machine learning. The data and empirical models are also presented in Sect. 3. The results and discussions are presented in Sect. 4 while Sect. 5 concludes with some policy recommendations.

Literature Survey on Empirical Works Using Machine Learning

The literature on economic growth is vast and an attempt to present all of them will be a daunting one. Therefore, attention is paid to the recent advances and applications of machine learning regularization techniques in the area of growth and development. For instance, this study is similar to Schneider and Wagner (2012) who focus solely on the lasso (least absolute shrinkage and selection operator) in determining the key drivers of growth in the NUTS2 region^{Footnote 4} of the European Union over the period 1995–2005. The results indicate that covariates such as initial GDP per capita, human capital, and initial unemployment rate matter for economic growth.

Similarly, in identifying which income distribution measure matter for development outcomes, Dutt and Tsetlin (2016) applied the Elasticnet and the lasso techniques to select from 37 potential covariates of development. The authors find that the poverty headcount indicator matters most in predicting three development outcomes (i.e., per capita income, schooling, and institutional quality). A similar work is Tkacz (2001), which, in forecasting Canadian GDP growth, applied the neural network algorithms. The study finds that, relative to traditional methods such as the linear and univariate forecasting methods, neural network techniques yield lower forecast errors on annual growth rate. The author goes further to indicate that neural techniques perform better in forecasting long-term growth than short-term growth. Further, Richardson et al. (2021) explore the power of several machine learning techniques^{Footnote 5} relative to classical methods in forecasting real GDP growth in New Zealand. The authors find that machine learning algorithms outperform classical statistical methods in prediction. Jung et al. (2018) also employ machine learning algorithms of lasso, ridge, Elasticnet, neural networks, and super learner to examine the GDP growth of the G7 countries. The authors provide strong evidence to conclude that machine learning algorithms outperformed standard prediction techniques.

In the case of SSA, however, the literature shows that researchers have not explored how relevant these techniques can be in aiding policymakers plan and target growth. The results we provide could prove invaluable in helping policymakers turn around the slow growth (real GDP per capita) trajectories of the SSA as presented in Figs. 1 and 8 in the Appendix.

Literature Survey on Drivers of Economic Growth Based on Traditional Techniques

In this section, we present a survey of the literature on the effects of several covariates included in this study on economic growth. Using a dataset on 21 SSA countries for the period 2000–2014, Ngongang (2015) employed the dynamic GMM technique to examine the relationship between financial development and economic growth. The author finds a significant positive relationship between the variables. In the same way, Ibrahim and Alagidede (2018) use a panel dataset spanning 1980–2014 for 29 SSA countries to examine the conditional and unconditional effects of financial development in economic growth. The results suggest that while financial development has a positive impact on economic growth, the joint effect of financial development and investment is rather remarkable. Kodongo and Ojah (2016) also explore the link between infrastructure and economic development in SSA countries. The results, which are based on system GMM estimator and a dataset on 45 SSA countries for the period 2000–2011, show that relative to middle-income countries, infrastructure plays a salient role in the economic development of least developed countries.

Omoteso and Mobolaji (2014) also apply the panel fixed effect, random effect, and the maximum likelihood estimation techniques to test the linear relationship between governance and economic growth in some selected SSA countries for the period 2002 to 2009. The authors find strong evidence to conclude that while political stability and regulatory quality enhance growth, government effectiveness adversely affect economic growth. Using a panel of 27 countries in SSA, Kebede and Takyi (2017) also employed the panel causality and system GMM estimation techniques to examine the relationship between institutional quality and economic growth. While the authors find a unidirectional causality from economic growth to institutional quality, the reverse does not hold. The results further show that institutional quality, trade openness, financial development, and debt positively affect economic growth.

In exploring the link between government expenditure and economic growth, Olaoye et al. (2020) apply the system GMM and the Driscoll and Kraay estimator to examine the asymmetrical phenomenon in government spending and growth relationship in 15 ECOWAS countries. Aside from confirming the asymmetric link between government spending and economic growth, the authors find evidence of an inverted U-shaped connection between government spending and economic growth. Also, Adams and Opoku (2015) investigate the effect of FDI on economic growth using a panel of 22 SSA for the period 1980–2011. The authors find evidence from the GMM estimator to show that although unconditionally FDI does not drive economic growth, the joint effect of FDI and regulations is positive and statistically significantly. Adams et al. (2016) also examine the link between energy consumption and economic growth, and the modulating role of democracy using a panel data of 16 SSA countries from 1971 to 2013. The study provides evidence from the panel vector autoregressive model to show that energy consumption enhances economic growth in the region. The study further finds that the joint effect of democracy and energy consumption on economic growth is positive and significant.

In addition, Adams and Klobodu (2016) assess the effect of remittances and regime change on economic growth for 33 SSA counties over the period 1970–2012. Their results from the system GMM estimation technique show that while remittances do not significantly affect growth, regime change suppresses growth. The study concludes that the growth-enhancing effect of remittances is amplified in the presence of a democratic and stable government. Appiah-Otoo and Song (2021) also use a panel of 123 countries composed of 45 high-income countries, 58 middle-income countries, and 20 low-income countries for the period 2002–2017 to examine the impact of ICT on economic growth. The authors provide strong evidence that the effect of ICT diffusion on growth across rich and poor countries is significantly different and that poor countries tend to gain more from the ICT innovations. Employing a panel dataset on 20 African countries, Akadiri and Akadiri (2018) applied the fixed effect estimator to test the relationship between growth and income inequality, on the one hand, and the pathway through which growth determinants influence income inequality for the period 1991 to 2015. The study finds evidence of positive long-run relationship between income inequality and growth. The study further reveals that population growth, mortality rate, government consumption expenditure, and foreign direct investment are principal determinants of the long-run growth and income inequality in the sampled countries.

In the same vein, Mavikela et al. (2019) examined the effect of inflation on economic growth for South Africa and Ghana with data over the period 2001 to 2016. Evidence from the quantile regression shows that while high inflation is positively related with growth in Ghana, it is the opposite in the case of South Africa. The study further shows an adverse effect of inflation at all threshold levels on growth in the post 2008/2009 global financial crisis.

Data and Methodology

Data

The study employed a large balanced panel spanning 1980–2019 for the analysis. The study sampled 42 SSA countries^{Footnote 6} on grounds of data availability. The outcome variable, economic growth, is the annual real GDP growth rate and is drawn from the World Development Indicators (World Bank, 2021b). Data on 113 potential drivers of growth are considered based on the extant scholarship on economic growth. Taking into consideration the real sector of the economies under consideration, variables such as vulnerable employment, inflation, and self-employment are considered (Bittencourt et al., 2015; Barro, 2013). Likewise, we include variables such as trade openness, and tariff considering the rise in economic globalization of SSA following the implementation of the AfCFTA and the projected rise in FDI inflow to the region in 2022 (UNCTAD, 2021; OECD/ACET, 2020). The essence of economic integration for growth in marginalized settings like SSA rests in the classical trade argument that it can foster social progress and the contemporary view that trade is essential for innovation diffusion, technological transfer, global value chain participation and export diversification (Asongu & Odhiambo, 2019; Asongu & Nwchukwu, 2016; Sakyi et al., 2015; Adams & Opoku., 2015).

Variables such as financial development and infrastructure are also considered due to their contribution to growth through resource allocation and the facilitation of economic activities (Koomson et al., 2020; Opoku et al., 2019; Peprah et al., 2019, African Development Bank, 2018). We source data on financial development from the World Bank’s Global Financial Development Database (Čihák et al., 2013) and the International Monetary Fund’s financial development index (Svirydzenka, 2016). Also, the study includes welfare variables of poverty and inequality due to their prevalence in the selected countries despite gains chalked in recent years and the fact that such developments waste human capital, consequently dragging down growth.

Data on poverty and inequality are sourced from the World Bank’s Poverty and Equity Database, and the Global Consumption and Poverty Project (Lahoti et al., 2016), while that of globalisation^{Footnote 7} is drawn from the Konjunkturforschungsstelle (KOF) index (Gygli et al., 2019). Per empirical evidence on the contribution of institutions and policy to growth, we consider country policy and institutional scores on macroeconomic management, trade policy, social protection, social inclusion, and financial sector management (Akobeng, 2016; Anyanwu, 2003; Asongu & Gupta, 2015; Asongu & Nwachukwu, 2017; Fosu, 2012). Also, we consider ICT skills, access, and usage given the momentous rise in the digital infrastructure of the region (Appiah-Otoo & Song, 2021; Tchamyou et al., 2019; Adeleye et al., 2019). The definitions and sources of all the variables are reported in Table 1.

Table 1 Variable definition and data sources

Full size table

Estimation Strategy

The empirical focus of this paper is in two parts. The first part is dedicated to the specification of the variable selection techniques while the inferential models are presented in the second part. In line with the objectives of the study, we do not employ traditional panel data estimation techniques for the analysis. For instance, the panel least squares estimator is inappropriate as it cannot explicitly perform variable selection from the 113 potential drivers of growth. Second, traditional methods such as the panel corrected standard errors and generalized method of moments cannot be relied upon as the presence of more predictors can cause the required matrix (${X}^{^{\prime}}X$) to be invertible. Even if it is possible, the presence of too many covariates may cause overfitting. In the presence of overfitting, although the attendant estimates are not biased, they are less efficient^{Footnote 8} (James et al., 2013). This is due to the fact that as the covariates become large, least squares assumptions of no multicollinearity, homoscedasticity, and exogeneity typically break down, therefore overfitting the model. This causes the out of sample error to increase, making inference and predictions flawed (James et al., 2013).

Addressing this econometric concern can be through the application of machine learning regularization techniques, which are effective for variable selection regardless of the number of covariates, model specification, nonlinearity, and time (Tibshirani, 1996). In this study, therefore, we train recent machine learning regularization algorithms to learn patterns in the underlying dataset to identify the main drivers of economic growth. Regularization is done by utilizing the bias-variance trade-off, where a tuning parameter (i.e., the bias) is introduced to reduce the variance associated with large datasets and consequently yield sparse estimates. In specifics, we train algorithms for four alternative shrinkage models—the first three from the lasso family (i.e., the Standard lasso, the minimum Schwarz Bayesian information criterion lasso, and the Adaptive lasso) and the Elasticnet to achieve the first objective.^{Footnote 9} Next, we perform causal inference on the selected covariates in objective 1 by running the lasso inferential models of double-selection linear lasso, partialing-out lasso linear regression, and partialing-out lasso instrumental variable regression to address objective 2. To this end, the STATA (version 16) and R (version 3.6) software are employed. The latter is employed primarily for data engineering and descriptive purposes while the data partitioning, regularization, and inferential estimates are carried out using the former.

Specification of Regularization Models

Specification of Standard Lasso and Minimum BIC Lasso Models

To address the ineffectiveness of traditional regression techniques in variable selection, Tibshirani (1996) introduced the standard lasso. Like other shrinkage techniques, the main advantages of the Standard lasso are that it (1) enhances the model interpretability by eliminating irrelevant variables that are not associated with the response variable; (2) enhances prediction accuracy, because shrinking and removing irrelevant predictors can reduce variance without a substantial increase in the bias; and (3) is limitless to data dimensionality.

In line with objective 1 of this study, the Standard lasso is applied to select the key drivers of economic growth by penalizing the model coefficients through a tuning parameter (λ) (Tibshirani, 1996; Belloni & Chernozhukov, 2014). Following Tibshiran (1996), we specify the objective function for the Standard lasso as shown in Eq. (1). For the Standard lasso algorithms to detect the key predicators of economic growth from a pool of several possible predictors, the penalty ($\lambda \sum_{j=1}^{\rho }|{\beta }_{j}|$), also referred to ${\mathcal{l}}_{1}$-norm, is introduced to obtain ${\widehat{\beta }}_{lasso}$ defined in Eq. (2):

$${Q}_{L}=\frac{1}{N}\sum\textstyle_{i=1}^{N}{\omega }_{i}f\left({y}_{it}, {\beta }_{0}+{X}_{it}{\beta }^{^{\prime}}\right)+\lambda \sum\textstyle_{j=1}^{p}{k}_{j}|{\beta }_{j}|$$

(1)

$${\widehat{\beta }}_{lasso}=min\left\{SSE+\lambda \sum\textstyle_{j=1}^{\rho }|{\beta }_{j}|\right\}$$

(2)

where ${y}_{it}$ is economic growth in country i in year t and ${X}_{it}$ is a vector of all the possible predictors of economic growth. The objective, therefore, is the minimization of the model sum of square errors with a given ${\mathcal{l}}_{1}$-norm. It is imperative to point out that if the tuning parameter, $\uplambda =0$, then we have a full model as in the least square estimator, while $\uplambda \to \infty$ is an intercept-only model. For brevity, we indicate that the specification of the minimum BIC lasso follows that of the Standard lasso with the same penalty and objective function but variable selection is based on the model with the least BIC (Schwarz, 1978). Some known drawbacks of these techniques are that, they (1) may become inconsistent as features grow rapidly and (2) are unable to perform hypothesis tests.

Specification of Adaptive Lasso Model

To enhance the consistency of regularization, Zou (2006) introduced the adaptive lasso technique, which in addition to the ${\mathcal{l}}_{1}$-norm penalty, adds the oracle property (${z}_{j}$). Relative to the Standard lasso, the oracle property enhances shrinkage or subset selection even when data attributes grow faster than the number of observations. In this study, we employ the Adaptive lasso technique as an alternative to the Standard lasso and minimum BIC lasso in addressing objective 1. Following Zou (2006), we minimize the objective function in (3) by applying the Adaptive lasso estimator (${\widehat{\beta }}_{AdaptiveLasso}$) specified in Eq. (4):

$${Q}_{L}=\frac{1}{N}\sum\textstyle_{i=1}^{N}{\omega }_{i}f\left({y}_{it}, {\beta }_{0}+{X}_{it}{\beta }^{^{\prime}}\right)+\lambda \sum\textstyle_{j=1}^{p}{k}_{j}|{\beta }_{j}|$$

(3)

$${\widehat{\beta }}_{AdaptiveLasso}=min\left\{SSE+\lambda \sum\textstyle_{j=1}^{\rho }{z}_{j}|{\beta }_{j}|\right\}$$

(4)

where ${y}_{it}$ is the outcome variable (economic growth) in country i in year t, ${X}_{it}$ is a vector of all 113 covariates of economic growth, and ${\beta }^{^{\prime}}$ are the attendant parameters.

Specification of Elasticnet Model

The Elasticnet method draws on the strengths of the Standard lasso and ridge regression by applying the ${\mathcal{l}}_{1}$ and ${\mathcal{l}}_{2}$ penalization norms. The strength of the Elasticnet is that in highly correlated covariates, it can produce sparse and consistent regularization than the lasso family algorithms (Zou & Hastie, 2005). Also, with the application of the ${\mathcal{l}}_{1}$ and ${\mathcal{l}}_{2}$ penalization norms, the Elasticnet becomes flexible in subset selection. To perform variable selection, the Elasticnet estimator minimizes the objective function:

$${Q}_{en}=\frac{1}{N}\sum\textstyle_{i=1}^{N}{\omega }_{i}f\left({y}_{it}, {\beta }_{0}+{X}_{i}{\beta }^{^{\prime}}\right)+\lambda \sum_{j=1}^{p}{k}_{j}\left\{\frac{1-\alpha }{2}{\beta }_{j}^{2}+|{\beta }_{j}|\right\}$$

(5)

where ${y}_{it}$, ${X}_{i}$, and ${\beta }^{^{\prime}}$ are as defined in previous sections and α is an additional Elasticnet penalty parameter,^{Footnote 10} which takes on values only in [0,1]. That is, sparsity occurs when 0 < α < 1 and λ > 0. This implies that in special cases, the Elasticnet plunges into either the ridge estimator (i.e., when λ = 0) or the Standard lasso estimator (i.e., when λ = 1).

Choice of Tuning Parameter

A fundamental concern regarding variable selection is the choice of the tunning parameter (λ). A good value of λ is essential for the overall performance of regularization models as it controls the strength of shrinkage and the concomitant prediction and inference (Schneider & Wagner, 2012). Among the widely used methods for choosing an efficient λ are cross validation (CV), Bayesian information criterion (BIC), and Akaike information criterion (AIC) (Tibshirani & Taylor, 2012). But it needs to be pointed out that if regularization becomes too strong, relevant variables may be omitted and coefficients may be shrunk excessively. Therefore, information criteria such as the BIC and AIC might be preferable to CV, since they are faster to compute and are less volatile in small samples (Zou et al., 2007). However, to the extent that setting λ under a researcher’s discretion can yield “target sparsity” and harm both predictive capacity and inferences (Hastie et al., 2019), we rely on BIC and CV^{Footnote 11}in determining λ.

Specification of Lasso Inferential Models

Since the aforementioned variable selection techniques do not provide estimates and confidence intervals essential for inference,^{Footnote 12} we apply the lasso inferential techniques to provide robust estimates on the selected predictors of economic growth. In specifics, we run the double-selection lasso linear model (DSL), the partialing-out lasso linear regression (POLR), and the partialing-out lasso instrumental variable regression (POIVLR) using the selected covariates in objective 1 as the variables of interest and the unselected (redundant) variables as controls.

It is imperative to note that due to the sparsity of the regularization techniques, the control variables are usually many. In view of this, the lasso inferential models consider these controls as irrelevant, and therefore, their inferential statistics are not reported. However, the number of relevant controls and instruments are indicated as part of the general regression statistics. Further, unlike the variables of interest, which the researcher has no flexibility of adding or excluding from model, the researcher can indicate the number of controls in the model.^{Footnote 13} The strength of these models is that they are built to produce unbiased and efficient estimates irrespective of data dimensionality, model specification, and multicollinearity.

Double-Selection Lasso Linear Model

In line with the second objective, we follow Belloni et al. (2014) by specifying the double-selection lasso (DSL) linear model as:

$$E\left[Y|d,x\right]=\psi {\alpha }^{^{\prime}}+{\phi \beta }^{^{\prime}},$$

(6)

where y is economic growth and is modeled to depend on $\psi$, containing J covariates of interest (i.e., the Elasticnet or lasso selected key drivers of economic growth) and $\phi$, which contains $p$ controls (i.e., the weak drivers of economic growth). The DSL estimator produces estimates on $J$ while relaxing the estimates for $p$.

Partialing-Out Lasso Linear Regression

In reference to the DSL, an added advantage of the partialing-out lasso linear regression (POLR) is that it enhances the efficacy of inference as the model becomes too complex. Following Belloni et al. (2012) and Chernozhukov et al. (2015), we specify the POLR estimator as:

$$E\left[Y|d,x\right]=d{\alpha }^{^{\prime}}+{X\beta }^{^{\prime}},$$

(7)

where $y$ is outcome variable (economic growth), $d$ is a vector containing the $J$ predictors of interest (i.e., the nonzero selected covariates of economic growth), and $X$ contains the $p$ controls (i.e., the unselected predictors of economic growth). Like the DSL, the POLR yields estimates, standard errors, and confidence intervals on the $J$ covariates while relaxing that of the $p$ controls.

Partialing-Out Lasso Instrumental Variable Regression

In large data regression problems like this study, sources of endogeneity abound largely due to bi-causality. For example, endogeneity can arise from the argument articulated in the supply-leading and demand-following hypotheses concerning financial development and economic growth (King & Levine, 1993). To address this, we follow Chernozhukov et al. (2015) by performing a partialing-out lasso instrumental variable regression (POIVLR). The POIVLR is specified as:

$$y=\Psi {\alpha }_{d}^{^{\prime}}+\Phi {\alpha }_{f}^{^{\prime}}+{X\beta }^{^{\prime}}+ \varepsilon ,$$

(8)

where $y$ is economic growth;$\Psi$ comprises ${J}_{d}$ endogenous covariates of interest; $f$ contains the ${J}_{f}$ exogenous covariates of interest; and $X$ contains ${\mathcal{p}}_{x}$ controls. Allowing for potential endogeneity primarily due to simultaneity, ${\mathcal{p}}_{z}$ outside instrumental variables denoted by $z$ that are correlated with $d$ but not with $\varepsilon$ are introduced. As aforesaid, the simultaneity between financial development and economic growth presents endogeneity concerns which are addressed using the $z$ instruments.^{Footnote 14} Theoretically, the controls and instrument can grow with the sample size; however, $\beta$ and nonzero coefficients in $z$ must be sparse.

Data Engineering and Partitioning

One of the key requirements of effective regularization is that the underlying dataset is strongly balanced. To this end, we employ the K-nearest neighbor (KNN) data imputation technique to address missing observations, particularly for variables such as the policy and institutional indicators (see Fig. 10). The KNN follows the principle that developments regarding variables drawn from a similar population exhibit similar properties (Van Hulse & Khoshgoftaar, 2014). In principle, the KNN selects the nearby neighbors based on a distance metric and estimates the missing observation with the attendant mean or mode. It is worth noting that while the mean rule is used to address missing observations in numerical variables, the latter is employed to address missing observations in categorical variables (Pan et al., 2015). Per this principle, this study relies on the mean rule, which uses the Minkowski distance as specified in Eq. (9) in addressing the missing observations.

$$d\left(i,j\right)={({\left|{x}_{i1}-{x}_{j1}\right|}^{q}+{\left|{x}_{i2}-{x}_{j2}\right|}^{q}+\dots +{\left|{x}_{i\rho }-{x}_{j\rho }\right|)}^{q}}^{1/q},$$

(9)

where $q$ is the Minkowski coefficient, $d\left(i,j\right)$ is the Minkowski distance for observations i and j, and $x$ are the variables. That said, we follow Ofori et al. (2022) by partitioning the dataset into two parts—the training set (70%) and testing test (30%) samples. We do this by applying the simple random and stratified data splitting techniques. In line with Ofori et al. (2022), we take cues from James et al. (2013) that among all other possible sets, the 70–30 and 80–20 splits are the data partitioning sets allowing reasonable representation of all variables in both the training and testing samples.

Presentation and Discussion of Results

Exploratory Data Analysis

For brevity, the exploratory data analysis is limited to the data partitioning results,^{Footnote 15} the distribution of economic growth, and the summary statistics. Information gleaned from the summary statistics in Table 2^{Footnote 16} shows an average economic growth (i.e., real GDP growth rate) value of 3.58 percent in the training set as compared to 3.95 percent in the testing set. Also, the average trade openness value as a percentage of GDP is 67.48 in the training set compared to 66.85 percent in the testing set. Additionally, we observe a mean unemployment rate of 7.58 percent in the training set compared to 7.68 percent in the testing set. It is also evident from Table 2 that the average transparency, accountability, and corruption score of 2.81 and 2.8 in the training and testing sets, respectively. Finally, Fig. 9 in the Appendix shows that 99.9 percent observations were present in the dataset before the data imputation (see the data engineering results in Fig. 10 in the Appendix).

Table 2 Summary statistics for training and testing sets

Full size table

Data Partitioning and Distribution of Economic Growth

A major decision regarding regularization is the form the outcome variable takes—either level or log transformed. On the latter, the distribution of economic growth as we show in Fig. 2 (right) is right-skewed. However, at level, as shown in Fig. 2 (left), economic growth is more symmetric and less heavy-tailed. At the backdrop that skewed distribution can have dire implications for regularization and the attendant inferential statistics, we run our shrinkage models using economic growth at level. Further, though non-standardization of covariates of economic growth does not constrain regularization, it is essential for ensuring the internal consistency of the data and comparability of the covariates. In view of this, the standardize option is invoked.

On data partitioning, we perform a 70–30 split of the dataset using the stratified method (see Fig. 3 (left)). Additionally, in checking the reliability or consistency of the stratified split, we run the simple random data splitting technique, which yields similar results (Fig. 3 (right)).

Regularization Results on the Main Drivers of Economic Growth in SSA

In this section, the results for the first objective are presented. As we show from Figs. 4, 5, 6 and 7 the lassos and Elasticnet algorithms select different non-zero coefficients (i.e., predictors) of economic growth. We find that the Standard lasso selects 12 covariates as key drivers of economic growth under a ten-fold cross-validation tuning parameter ($\uplambda$) value of 0.33 (see Fig. 4). Further, the Adaptive lasso selects only 10 covariates from the total 113 as chief drivers of economic growth in SSA with a tuning parameter ($\uplambda$) value of 0.24. Similarly, we find a special case for the Elasticnet regularization as it selects covariates based on a minimum cross-validation lambda of 0.33 and a minimum cross-validation alpha of 1. While the Elasticnet plunges into the Standard lasso (i.e., selects 12 non-zero predictors), we find a sparser regularization in the minimum BIC lasso as it selects only 7 covariates of the total 113.

In Table 3, a detailed output of how covariates enter and leave the respective shrinkage models is presented. The results from the minimum BIC lasso, which yields the best regularization indicates that the key drivers of economic growth in SSA are manufacturing (value addition), population, financial development, government spending, macroeconomic management, globalization, and social inclusion. The appropriateness of the results is evident in the post-estimation tests of cross-validation and coefficient path plots associated with each model (Figs. 4, 5, 6 and 7).

Table 3 Variable selection in regularization models

Full size table

Inferential Results for the Main Drivers of Economic Growth in SSA

In this section, the estimates on the 7 covariates of growth identified in objective 1 are provided. The results, which are reported in Table 4 are based on the DSL, POLR, and POIVLR estimation techniques, meaning that they are robust to heteroskedasticity, endogeneity, and model misspecification. To inform policy appropriately, we run three separate results for the (i) full sample, (ii) low-income countries, and (iii) middle- and high-income countries.

Table 4 Lasso estimates on the main drivers of economic growth in sub-Saharan Africa

Full size table

To begin with, we find that manufacturing value addition matters for economic growth in SSA. The results show that a 1 percent increase in manufacturing value addition boosts economic growth by 0.06 percent. Across the low-income and middle-income divide, however, we find that manufacturing value addition is significant only in the case of the former. The evidence suggests that with appropriate economic governance, it is low-income countries that can make remarkable strides in economic growth through enhanced manufacturing value addition considering the implementation of the AfCFTA. This is more so as improvement in manufacturing can spur forward and backward linkages as well as global value chain participation.

Further, the results show that although financial development is directly related to economic growth in both low-income and middle-income countries, it is statistically significant only in the former. In terms of magnitudes, the results suggest that for every 1-point increase in financial development, economic growth rises by 0.14 percent in low-income countries. The effect of financial development is remarkable, suggesting that access to financial products and services can propel the huge informal sector of low-income countries to realize their innovative and entrepreneurial objectives. This is more so considering the fact that lags in financial access are glaring in the low-income countries compared to middle-income countries.

Additionally, we find that economic globalization drives economic growth in SSA. In the remit of low-income and middle-income countries, however, we find that economic integration matters for growth only in the case of the latter. The plausible explanation for this is that, relative to low-income countries, middle-income countries have made remarkable strides in developing their manufacturing base, coupled with a good absorptive capacity that can enable them to gain significantly from economic globalization. Albeit statistically insignificant, the positive relationship between growth and economic globalization for low-income countries provides sheer optimism considering the implementation of the AfCFTA and the expected rebound of FDI to Africa from 2022.

The result on economic globalization is linked to the remarkable finding on macroeconomic management. There is strong empirical evidence to show that every 1-point increase in the score of macroeconomic management boosts economic growth by 0.73 percent (column 3). This result is even strong (i.e., 0.82%) in the case of low-income countries (column 6). Indeed, one of the major problems of the region has been poor macroeconomic management often resulting in bailouts by foreign institutions.^{Footnote 17} Although these bailouts have proved effective in propelling beneficiary countries toward prudent macroeconomic management paths, gains are mostly disrupted following exist, signifying the need for sustained commitment to fiscal and monetary discipline in SSA.

Also, we find that government expenditure is instrumental for economic growth in SSA. The result shows that a 1 percent increase in government expenditure boosts economic growth by 0.06 percent. However, this evidence is only significant in low-income SSA countries. A possible explanation for this is that, in middle-income countries, a high percentage of government expenditure goes into the recurrent expenditure compared to capital expenditure.

Moreover, we find that urban population matters for economic growth in the SSA. Additionally, the result reveal that urbanization is effective in fostering growth in the low-income countries compared to middle-income countries. This evidence appeals to logic in that economic activities driving growth in low-income SSA countries are mostly concentrated in urban centers. The result is in line with a World Bank (2009) report which argues that urban concentration is crucial in fostering growth in economies at the early stages of development. There is also the supporting evidence of gains from urbanization in that it reduces poverty and inequalities in opportunities, services, assets (Sekkat, 2017), and income inequality (see Oyvat, 2016).

Also, we provide strong empirical evidence to show that improving the coverage of social inclusion polices promotes economic growth in SSA by 3 percent (column 3). The result suggests that rolling out social intervention programs can propel SSA countries towards sustained growth trajectories. This is more so as social inclusion policies can build private sector capacity to withstand socioeconomic shocks. This is however not effective for growth in the low-income countries. This is also possible since institutions for developing human capital in these settings are weak, thereby providing little or no growth gains for such expenditure.

Conclusion and Policy Recommendations

The study contributes to the economic growth literature on SSA by employing recent advances in machine learning to identify the key drivers of growth. In doing so, we train algorithms for four machine learning regularization models—the Standard lasso, the minimum BIC lasso, the Adaptive lasso, and the Elasticnet based on a dataset spanning 1980–2019 for 42 African countries. Our results show that machine learning techniques are powerful and effective in reducing model complexities associated with large-data regression problems. In this study, while both the Standard lasso and Elasticnet techniques select 12 covariates as the main determinants of economic growth, the minimum BIC lasso selects only 7 out of the total 113 possible predictors. The uniqueness of the study is that it presents policymakers interested in the SSA growth agenda, variables to target to foster and sustain growth. These variables are manufacturing (value addition), urban population, financial development, government spending, macroeconomic management, economic globalization, and social inclusion.

For middle-income SSA countries, we suggest the following recommendations. First, in line with the implementation of the AfCFTA and the green growth agenda of the SSA, it is recommended that policymakers invest strategically in the manufacturing sectors of their economies. This can prove crucial in turning around the slow growth trajectories of the region as economic globalization can spur the industrialization through forward and backward linkages, innovation diffusion, and global value chain participation. Policymakers are therefore advised to build the technical workforce of their economies to make sense of the knowledge and innovation transfers associated with economic integration. Second, to improve the ability, opportunities, and dignity of the marginalized to contribute meaningfully to national development, policymakers are to invest strategically in areas such as health, education, and vocational training. This is more so as ICT diffusion can reduce inequalities in accessing information and high cost of accessing opportunities due to polarization of administrative procedures in the SSA.

For low-income countries, efforts should be made to develop the financial sector. This could prove crucial for efficient resource allocation, which can be a gamechanger in spurring the industrialization agenda of the region thorough competition, innovation, dynamism, and enhanced global value chain participation. Resources should thus be channeled towards the development of payment system platforms and services, financial innovation, and information flow on consumers. In this regard, institutions interested in SSA’s development agenda such as the African Development Bank, the IMF, and the World Bank should provide technical and logistical support to aid the transformation of the region’s predominantly low productive informal sector to a more dynamic, highly competitive and export-oriented one.

Additionally, we recommend that policymakers commit to prudent macroeconomic management. We reckon that in a setting like SSA where vulnerabilities are widespread, sound macroeconomic management will prove momentous in mitigating the welfare setbacks imposed by socioeconomic shocks (e.g., Covid-19) while lessening the impact of future ones. This also calls for the need to channel resources into productive expenditure like infrastructure and energy supply, which could contribute to ensuring that economic globalization propel these countries sustained growth trajectories.

The study leaves room for future works. First, considering the contributions this study makes through machine learning techniques, the academic community can also draw on similar techniques, for instance, to identify factors key for analyzing poverty and inequality. Second, these techniques can be employed to examine whether the growth-globalization relationship we find differs between landlocked and non-landlocked countries. Finally, considering the green growth agenda of the continent, regularization techniques can be employed to determine whether durable shared growth is driven largely by environmental factors or income growth and distributions.

Notes

Green growth refers to achieving sustainable growth trajectories that is environmentally friendly (OECD, 2017).
Machine learning has gained attention in recent times due to its ability to detect relevant patterns in big data for prediction and analysis.
Countries in SSA have collectively signed onto the African Continental Free Trade Area.
Nomenclature of Territorial Units for Statistics.
Support-vector machine, neural network, lasso, boosted tree, regularized generalized linear model, and ridge.
Angola, Benin, Botswana, Burkina Faso, Burundi, Cabo Verde, Cameroon, Central African Republic, Chad, Comoros, Congo, Dem. Rep., Congo, Rep., Cote d'Ivoire, Ethiopia, Gabon, Gambia (The), Ghana, Guinea, Guinea-Bissau, Kenya, Lesotho, Liberia, Madagascar, Malawi, Mali, Mauritania, Mauritius, Mozambique, Namibia, Niger, Nigeria, Rwanda, Sao Tome and Principe, Senegal, Seychelles, Sierra Leone, South Africa, Sudan, Tanzania, Togo, Uganda, Zambia.
The KOF globalization index is a cross-country composite index comprising key dimensions of global interrelationships (i.e., economic, social, trade, financial, culture and political).
Inefficiency due to model complexity, specification problems, and/ or overfitting. Further, the traditional least squares estimator is not only less sparse but also more susceptible and sensitive to problems like multicollinearity and outliers.
Since the ordinary least squares technique and ridge regression cannot yield variable selection, their estimations are relaxed.
This adds to the regular $\lambda$ penalty.
In this study, we invoke the tenfold cross-validation.
Traditional estimation techniques such as the OLS cannot be employed either as the new variability introduced in the dataset by the regularization techniques are not captured by such techniques.
We include 56 out of the remaining 106 covariates as control against the backdrop that several alternative measures of globalization, institutional quality, and welfare are used.
List of instruments in POIVLR: transparency score, trade score, public management score, macroeconomic management score, gender equality score, financial sector management score, internet access (per 1 million of the population), mobile cellular subscription (per 100 of the population), fixed telephone subscription (per 100 of the population), fixed broadband subscription (per 100 of the population).
That is the distribution of economic growth in the training and testing sets.
See the Appendix.
For example, the case of the IMF and Ghana in 2015/2016 and the IMF and South Africa in 2021.

References

Adams, S., & Klobodu, E. K. M. (2016). Remittances, regime durability and economic growth in Sub-Saharan Africa (SSA). Economic Analysis and Policy, 50, 1–8.
Google Scholar
Adams, S., Klobodu, E. K. M., & Opoku, E. E. O. (2016). Energy consumption, political regime and economic growth in sub-Saharan Africa. Energy Policy, 96, 36–44.
Google Scholar
Adams, S., & Opoku, E. E. O. (2015). Foreign direct investment, regulations and growth in sub-Saharan Africa. Economic Analysis and Policy, 47, 48–56.
Google Scholar
Adeleye, B. N., & Eboagu, C. (2019). Evaluation of ICT development and economic growth in Africa. Netnomics, 1–25.
African Development Bank. (2018). Africa’s infrastructure: Great potential but little impact on inclusive growth. Africa Development Outlook.
Agbloyor, E. K., Abor, J. Y., Adjasi, C. K. D., & Yawson, A. (2014). Private capital flows and economic growth in Africa: The role of domestic financial markets. Journal of International Financial Markets, Institutions and Money, 30, 137–152.
Google Scholar
Aghion, P., & Howitt, P. (1990). A model of growth through creative destruction (No. w3223). National Bureau of Economic Research.
Akadiri, S., & Akadiri, A. C. (2018). Growth and inequality in Africa: Reconsideration. Academic Journal of Economic Studies, 4(3), 76–86.
Google Scholar
Akbari, A., Ng, L., & Solnik, B. (2021). Drivers of economic and financial integration: A machine learning approach. Journal of Empirical Finance, 61, 82–102.
Google Scholar
Akobeng, E. (2016). Growth and institutions: A potential medicine for the poor in Sub-Saharan Africa. African Development Review, 28(1), 1–17.
Google Scholar
Alagidede, P., & Ibrahim, M. (2017). On the causes and effects of exchange rate volatility on economic growth: Evidence from Ghana. Journal of African Business, 18(2), 169–193.
Google Scholar
Anyanwu, J. C. (2003). Estimating the macroeconomic effects of monetary unions: The case of trade and output. African Development Review, 15(2–3), 126–145.
Google Scholar
Anyanwu, J. C. (2014). Factors affecting economic growth in Africa: Are there any lessons from China? African Development Review, 26(3), 468–493.
Google Scholar
Appiah-Otoo, I., & Song, N. (2021). The impact of ICT on economic growth-Comparing rich and poor countries. Telecommunications Policy, 45(2), 102082.
Google Scholar
Asongu, S., & Gupta, R. (2015). Trust and quality of growth: A note. Economics Bulletin, 36(3), A181.
Google Scholar
Asongu, S. A., & Nwachukwu, J. C. (2016). Welfare spending and quality of growth in developing countries: A note on evidence from hopefuls, contenders and best performers. The Social Science Journal, 53(4), 495–500.
Asongu, S. A., & Nwachukwu, J. C. (2017). Quality of growth empirics: Comparative gaps, benchmarking and policy syndromes. Journal of Policy Modeling, 39(5), 861–882.
Google Scholar
Asongu, S. A., & Odhiambo, N. M. (2019). Mobile banking usage, quality of growth, inequality and poverty in developing countries. Information Development, 35(2), 303–318.
Google Scholar
Barro, R. J. (2013). Inflation and economic growth. Annals of Economics & Finance, 14(1).
Bazarbash, M. (2019). Fintech in financial inclusion: Machine learning applications in assessing credit risk (May 2019). IMF Working Paper No. 19/109.
Beam, A. L., & Kohane, I. S. (2018). Big data and machine learning in health care. JAMA, 319(13), 1317–1318.
Google Scholar
Belloni, A., Chen, D., Chernozhukov, V., & Hansen, C. (2012). Sparse models and methods for optimal instruments with an application to eminent domain. Econometrica (80), 2369–2429, Arxiv, 2010.
Belloni, A., Chernozhukov, V., & Hansen, C. (2014). High-dimensional methods and inference on structural and treatment effects. Journal of Economic Perspectives, 28(2), 29–50.
Google Scholar
Berhane, K. (2018). The role of financial development and institutional quality in economic growth in Africa in the era of globalization. In Determinants of economic growth in Africa (pp. 149–196). Palgrave Macmillan, Cham.
Bhavsar, P., Safro, I., Bouaynaya, N., Polikar, R., & Dera, D. (2017). Machine learning in transportation data analytics. In Data analytics for intelligent transportation systems, 283–307, Elsevier.
Bittencourt, M., Van Eyden, R., & Seleteng, M. (2015). Inflation and economic growth: Evidence from the Southern African Development Community. South African Journal of Economics, 83(3), 411–424.
Google Scholar
Bredt, S. (2019). Artificial Intelligence (AI) in the financial sector—Potential and public strategies. Frontiers in Artificial Intelligence, 2, 16.
Chakamera, C., & Alagidede, P. (2018). The nexus between infrastructure (quantity and quality) and economic growth in Sub Saharan Africa. International Review of Applied Economics, 32(5), 641–672.
Google Scholar
Čihák, M., Demirgüč-Kunt, A., Feyen, E., & Levine, R. (2013). Financial development in 205 economies, 1960 to 2010 (No. w18946). National Bureau of Economic Research.
Chernozhukov, V., Hansen, C., & Spindler, M. (2015). Valid post-selection and post-regularization inference: An elementary. General Approach. Annual REview of Economics, 7(1), 649–688.
Google Scholar
De Almeida Rocha, D., & Duarte, J. C. (2019). Simulating human behaviour in games using machine learning. In 2019 18th Brazilian Symposium on Computer Games and Digital Entertainment, 163–172. IEEE.
Domar, E. (1947). Expansion and employment. American Economic Review, 14(1), 34–55.
Google Scholar
Doupe, P., Faghmous, J., & Basu, S. (2019). Machine learning for health services researchers. Value in Health, 22(7), 808–815.
Google Scholar
Dutt, P., & Tsetlin, I. (2016). Income distribution and economic development: Insights from machine learning. Economics & Politics.
Fosu, A. K. (2012). The African economic growth record, and the roles of policy syndromes and governance. Good growth and governance in Africa: Rethinking development strategies, 175–218.
Grossman, G. M., & Helpman, E. (1991). Innovation and growth in the global economy. MIT press.
Gygli, S., Haelg, F., Potrafke, N., & Sturm, J. E. (2019). The KOF globalisation index revisited. Review of International Organizations, 14(3), 543–574.
Google Scholar
Gyimah-Brempong, K., Paddison, O., & Mitiku, W. (2006). Higher education and economic growth in Africa. The Journal of Development Studies, 42(3), 509–529.
Google Scholar
Harrod, R. (1939). An essay in dynamic theory. The Economic Journal, 44(1), 14–31.
Google Scholar
Hastie, T., Tibshirani, R., & Wainwright, M. (2019). Statistical learning with sparsity: the lasso and generalizations. Chapman and Hall/CRC.
Ibrahim, M., & Alagidede, P. (2018). Effect of financial development on economic growth in sub-Saharan Africa. Journal of Policy Modeling, 40(6), 1104–1125.
Google Scholar
IMF. (2020). World economic outlook: A long and difficult ascent. October. Washington, DC.
James, G., Witten, D., Hastie, T., & Tibshirani, R. (2013). An introduction to statistical learning (Vol. 112). Springer.
Google Scholar
Jung, J. -K., Patnam, M., & Ter-Martirosyan, A. (2018). An algorithmic crystal ball: Forecasts-based on machine learning. IMF Working Papers. Washington, D.C.: International Monetary Fund
Kaufman, D., Kraay, A., & Mastruzzi, M. (2010). The worldwide governance indicators: Methodology and analysis. World Bank Policy Research Paper, (5430).
Kebede, J. G., & Takyi, P. O. (2017). Causality between institutional quality and economic growth: Evidence from sub-Saharan Africa. European Journal of Economic and Financial Research.
King, R. G., & Levine, R. (1993). Finance and growth: Schumpeter might be right. The Quarterly Journal of Economics, 108(3), 717–737.
Google Scholar
Kodongo, O., & Ojah, K. (2016). Does infrastructure really explain economic growth in Sub-Saharan Africa? Review of Development Finance, 6(2), 105–125.
Google Scholar
Koomson, I., Villano, R. A., & Hadley, D. (2020). Effect of financial inclusion on poverty and vulnerability to poverty: Evidence using a multidimensional measure of financial inclusion. Social Indicators Research, 149(2), 613–639.
Google Scholar
Lahoti, R., Jayadev, A., & Reddy, S. (2016). The global consumption and income project (GCIP): An overview. Journal of Globalization and Development, 7(1), 61–108.
Google Scholar
Luxton, D. D. (2016). An introduction to artificial intelligence in behavioral and mental health care. In Artificial intelligence in behavioral and mental health care. Academic Press, 1–26
Mateen, B. A., Liley, J., Denniston, A. K., Holmes, C. C., & Vollmer, S. J. (2020). Improving the quality of machine learning in health applications and clinical research. Nature Machine Intelligence, 2(10), 554–556.
Google Scholar
Mavikela, N., Mhaka, S., & Phiri, A. (2019). The inflation-growth relationship in SSA inflation-targeting countries. Studia Universitatis Babes-Bolyai, Oeconomica, 64(2), 84–102.
Google Scholar
Ngongang, E. (2015). Financial development and economic growth in Sub-Saharan Africa: A dynamic panel data analysis. European Journal of Sustainable Development, 4(2), 369.
OECD. (2017). Green growth indicators 2017. OECD Green Growth Studies, OECD Publishing, Paris. https://doi.org/10.1787/9789264268586-en
OECD/ACET. (2020). Quality infrastructure in 21st century Africa: Prioritising, accelerating and scaling up in the context of Pida (2021–30).
OECD, ILO. (2019). Tackling vulnerability in the informal economy, development centre studies. OECD Publishing.
Google Scholar
Ofori, I. K., Quaidoo, C., & Ofori, P. E. (2022). What drives financial sector development in Africa? Insights from machine learning. Applied Artificial Intelligence, 1–33.
Olaoye, O. O., Eluwole, O. O., Ayesha, A., & Afolabi, O. O. (2020). Government spending and economic growth in ECOWAS: An asymmetric analysis. The Journal of Economic Asymmetries, 22, e00180.
Google Scholar
Omoteso, K., & Mobolaji, H. I. (2014). Corruption, governance and economic growth in Sub-Saharan Africa: a need for the prioritisation of reform policies. Social Responsibility Journal, 0(2), 316–330.
Opoku, E. E. O., Ibrahim, M., & Sare, Y. A. (2019). The causal relationship between financial development and economic growth in Africa. International Review of Applied Economics, 33(6), 789–812.
Google Scholar
Oyvat, C. (2016). Agrarian structures, urbanization, and inequality. World Development, 83, 207–230.
Google Scholar
Pan, R., Yang, T., Cao, J., Lu, K., & Zhang, Z. (2015). Missing data imputation by K nearest neighbours based on grey relational structure and mutual information. Applied Intelligence, 43(3), 614–632.
Google Scholar
Peprah, J. A., Kwesi Ofori, I., & Asomani, A. N. (2019). Financial development, remittances and economic growth: A threshold analysis. Cogent Economics & Finance, 7(1), 1625107.
Google Scholar
Romer, P. M. (1990). Endogenous technological change. Journal of political Economy, 98(5, Part 2), S71–S102.
Richardson, A., van Florenstein Mulder, T., & Vehbi, T. (2021). Nowcasting GDP using machine-learning algorithms: A real-time assessment. International Journal of Forecasting, 37(2), 941–948.
Google Scholar
Sakyi, D., Commodore, R., & Opoku, E. E. O. (2015). Foreign direct investment, trade openness and economic growth in Ghana: An empirical investigation. Journal of African Business, 16(1–2), 1–15.
Google Scholar
Sandeep, S., Shelton, C. R., Pahor, A., Jaeggi, S. M., & Seitz, A. R. (2020). Application of machine learning models for tracking participant skills in cognitive training. Frontiers in Psychology, 11, 15–32.
Google Scholar
Schneider, U., & Wagner, M. (2012). Catching growth determinants with the adaptive lasso. German Economic Review, 13(1), 71–85.
Google Scholar
Schwarz, G. (1978). Estimating the dimension of a model. Annals of Statistics, 6(2), 461–464.
Google Scholar
Sekkat, K. (2017). Urban concentration and poverty in developing countries. Growth and Change, 48(3), 435–458.
Google Scholar
Svirydzenka, K. (2016). Introducing a new broad-based index of financial development. IMF Working Paper No. 16/5.
Solow, R. M. (1956). A contribution to the theory of economic growth. The Quarterly Journal of Economics, 70(1), 65–94.
Google Scholar
Tchamyou, V. S., Asongu, S. A., & Odhiambo, N. M. (2019). The role of ICT in modulating the effect of education and lifelong learning on income inequality and economic growth in Africa. African Development Review, 31(3), 261–274.
Google Scholar
Tkacz, G. (2001). Neural network forecasting of Canadian GDP growth. International Journal of Forecasting, 17(1), 57–69.
Google Scholar
Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Methodological), 58(1), 267–288.
Tibshirani, R. J., & Taylor, J. (2012). Degrees of freedom in lasso problems. The Annals of Statistics, 40(2), 1198–1232.
Google Scholar
Tizghadam, A., Khazaei, H., Moghaddam, M. H., & Hassan, Y. (2019). Machine learning in transportation. Journal of Advanced Transportation, Special Issue, 2019, 1–3.
Google Scholar
UNCTAD (United Nations Conference on Trade and Development). (2021). Investment Trends Monitor. Issue 38. Geneva: UNCTAD.
Van Hulse, J., & Khoshgoftaar, T. M. (2014). Incomplete-case nearest neighbor imputation in software measurement data. Information Sciences, 259, 596–610.
Google Scholar
World Bank. (2009). Reshaping economic geography; World Development Report 2009. World Bank.
Google Scholar
World Bank. (2020). Global economic prospects, June 2020. World Bank.
World Bank. (2021a). COVID-19 and food security. Brief, World Bank, Washington, DC. https://www.worldbank.org/en/topic/agriculture/brief/foodsecurity-and-covid-19
World Bank. (2021b). World development indicators. World Bank.
Google Scholar
Zou, H. (2006). The adaptive lasso and its oracle properties. Journal of the American Statistical Association, 101(476), 1418–1429.
Google Scholar
Zou, H., & Hastie, T. (2005). Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B (statistical Methodology), 67(2), 301–320.
Google Scholar
Zou, H., Hastie, T., & Tibshirani, R. (2007). On the “degrees of freedom” of the lasso. The Annals of Statistics, 35(5), 2173–2192.
Google Scholar

Download references

Funding

Open access funding provided by Università degli Studi dell'Insubria within the CRUI-CARE Agreement.

Author information

Authors and Affiliations

Department of Economics, University of Insubria, Via Monte Generoso, 71, 21100, Varese, Italy
Isaac K. Ofori
The Harriet Tubman Institute, York University, 4700 Keele Street, Toronto, On, M3J 1P3, Canada
Isaac K. Ofori
Economic Policy Division, Faculty of Economics and Business Administration, Friedrich Schiller University of Jena, Jena, Germany
Isaac K. Ofori
Department of Economic Studies, School of Economics, University of Cape Coast, Cape Coast, Ghana
Camara K. Obeng
School of Economics, University of Johannesburg, Johannesburg, South Africa
Simplice A. Asongu

Authors

Isaac K. Ofori
View author publications
You can also search for this author in PubMed Google Scholar
Camara K. Obeng
View author publications
You can also search for this author in PubMed Google Scholar
Simplice A. Asongu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Isaac K. Ofori.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ofori, I.K., Obeng, C.K. & Asongu, S.A. What Really Drives Economic Growth in Sub-Saharan Africa? Evidence from the Lasso Regularization and Inferential Techniques. J Knowl Econ 15, 144–179 (2024). https://doi.org/10.1007/s13132-022-01055-1

Download citation

Received: 05 November 2021
Accepted: 14 September 2022
Published: 21 October 2022
Issue Date: March 2024
DOI: https://doi.org/10.1007/s13132-022-01055-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

What Really Drives Economic Growth in Sub-Saharan Africa? Evidence from the Lasso Regularization and Inferential Techniques

Abstract

Similar content being viewed by others

What Really Drives Financial Sector Development in Ghana?

Variable Selection and Estimation in Kink Regression Model

Of needles and haystacks: revisiting growth determinants by robust Bayesian variable selection

Introduction

Literature Survey on Empirical Works Using Machine Learning

Literature Survey on Drivers of Economic Growth Based on Traditional Techniques

Data and Methodology

Data

Estimation Strategy

Specification of Regularization Models

Specification of Standard Lasso and Minimum BIC Lasso Models

Specification of Adaptive Lasso Model

Specification of Elasticnet Model

Choice of Tuning Parameter

Specification of Lasso Inferential Models

Double-Selection Lasso Linear Model

Partialing-Out Lasso Linear Regression

Partialing-Out Lasso Instrumental Variable Regression

Data Engineering and Partitioning

Presentation and Discussion of Results

Exploratory Data Analysis

Data Partitioning and Distribution of Economic Growth

Regularization Results on the Main Drivers of Economic Growth in SSA

Inferential Results for the Main Drivers of Economic Growth in SSA

Conclusion and Policy Recommendations

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation