Background

Food security is essential to people’s health and well-being [1]. Further, the World Health Organization (WHO) argues that health is wealth and poor health is an integral part of poverty; governments should actively seek to preserve their people’s lives and reduce the incidence of unnecessary mortality and avoidable illnesses [2]. However, lack of food is one of the factors which affect health outcomes. Concerning this, the Food Research and Action Center noted that the social determinants of health, such as poverty and food insecurity, are associated with some of the most severe and costly health problems in a nation [3].

According to the FAO, the International Fund for Agricultural Development (IFAD), and the World Food Programme (WFP), food insecurity is defined as "A situation that exists when people lack secure access to sufficient amounts of safe and nutritious food for normal growth and development and an active and healthy life" ([4]; p50). It is generally believed that food security and nutrition are crucial to improving human health and development. Studies show that millions of people live in food insecurity, which is one of the main risks to human health. Around one in four people globally (1.9 billion people) were moderately or severely food insecure in 2017 and the greatest numbers were in SSA and South Asia. Around 9.2% of the world's population was severely food insecure in 2018. Food insecurity is highest in SSA countries, where nearly one-third are defined as severely insecure [5]. Similarly, 11% (820 million) of the world's population was undernourished in 2018, and SSA countries still share a substantial amount [5]. Even though globally the number of people affected by hunger has been decreasing since 1990, in recent years (especially since 2015) the number of people living in food insecurity has increased. It will be a huge challenge to achieve the SDGs of zero hunger by 2030 [6]. FAO et al. [7] projected that one in four individuals in SSA were undernourished in 2017. Moreover, FAO et al. [8] found that, between 2014 and 2018, the prevalence of undernourishment worsened. Twenty percent of the continent's population, or 256 million people, are undernourished today, of which 239 million are in SSA. Hidden hunger is also one of the most severe types of malnutrition (micronutrient deficiencies). One in three persons suffers from inadequacies related to hidden hunger, which impacts two billion people worldwide [9]. Similarly, SSA has a high prevalence of hidden hunger [10, 11].

An important consequence of food insecurity is that around 9 million people die yearly worldwide due to hunger and hunger-related diseases. This is more than from Acquired Immunodeficiency Syndrome (AIDS), malaria, and tuberculosis combined [6]. Even though the hunger crisis affects many people of all genders and ages, children are particularly affected in Africa. There are too many malnourished children in Africa, and malnutrition is a major factor in the high infant mortality rates and causes physical and mental development delays and disorders in SSA [12]. According to UN statistics, chronic malnutrition globally accounts for 165 million stunted or underweight children. Around 75% of these kids are from SSA and South Asia. Forty percent of children in SSA are impacted. In SSA, about 3.2 million children under the age of five dies yearly, which is about half of all deaths in this age group worldwide. Malnutrition is responsible for almost one child under the age of five dying every two minutes worldwide. The child mortality rate in the SSA is among the highest in the world, about one in nine children pass away before the age of five [12].

In addition to the direct impact of food insecurity on health outcomes, it also indirectly contributes to disordered eating patterns, higher or lower blood cholesterol levels, lower serum albumin, lower hemoglobin, vitamin A levels, and poor physical and mental health [13,14,15]. Iodine, iron, and zinc deficiency are the most often identified micronutrient deficiencies across all age groups. A deficiency in vitamin A affects an estimated 190 million pre-schoolers and 19 million pregnant women [16]. Even though it is frequently noted that hidden hunger mostly affects pregnant women, children, and teenagers, it further affects people’s health at all stages of life [17].

With the above information, researchers and policymakers should focus on the issue of food insecurity and health status. The SDGs that were developed in 2015 intend to end hunger in 2030 as one of its primary targets. However, a growing number of people live with hunger and food insecurity, leading to millions of deaths. Hence, this study questioned what is the impact of food insecurity on people's health outcomes in SSA countries. In addition, despite the evidence implicating food insecurity and poor health status, there is a lack of macro-level empirical studies concerning the impact of food insecurity on people’s health status in SSA countries, which leads to a knowledge (literature) gap. Therefore, this study aims to examine the impact of food insecurity on life expectancy and infant mortality in SSA countries for the period ranging from 2001–2018 using panel mean regression approaches.

Theoretical and conceptual framework

Structural factors, such as climate, socio-economic, social, and local food availability, affect people’s food security. People’s health condition is impacted by food insecurity through nutritional, mental health, and behavioral channels [18]. Under the nutritional channel, food insecurity has an impact on total caloric intake, diet quality, and nutritional status [19,20,21]. Hunger and undernutrition may develop when food supplies are scarce, and these conditions may potentially lead to wasting, stunting, and immunological deficiencies [22]. However, food insecurity also negatively influences health due to its effects on obesity, women's disordered eating patterns [23], and poor diet quality [24].

Under the mental health channel, Whitaker et al. [25] noted that food insecurity is related to poor mental health conditions (stress, sadness, and anxiety), which have also been linked to obesity and cardiovascular risk [26]. The effects of food insecurity on mental health can worsen the health of people who are already sick as well as lead to disease acquisition [18]. Similarly, the behavioral channel argues that there is a connection between food insecurity and health practices that impact disease management, prevention, and treatment. For example, lack of access to household food might force people to make bad decisions that may raise their risk of sickness, such as relying too heavily on cheap, calorically dense, nutrient-poor meals or participating in risky sexual conduct. In addition, food insecurity and other competing demands for survival are linked to poorer access and adherence to general medical treatment in low-income individuals once they become sick [27,28,29,30]

Food insecurity increases the likelihood of exposure to HIV and worsens the health of HIV-positive individuals [18]. Weiser et al. [31] found that food insecurity increases the likelihood of unsafe sexual activities, aggravating the spread of HIV. It can also raise the possibility of transmission through unsafe newborn feeding practices and worsening maternal health [32]. In addition, food insecurity has been linked to decreased antiretroviral adherence, declines in physical health status, worse immunologic status [33], decreased viral suppression [34, 35], increased incidence of serious illness [36], and increased mortality [37] among people living with HIV.

With the above theoretical relationship between target variables and since this study focuses on the impact of food insecurity on health outcomes, and not on the causes, it adopted the conceptual framework of Weiser et al. [18] and constructed Fig. 1.

Fig. 1
figure 1

A conceptual framework of food insecurity and health. Source: Modified and constructed by the author using Weiser et al. [18] conceptual framework. Permission was granted by Taylor & Francis to use their original Figs. (2.2, 2.3, and 2.4); to develop the above figure. Permission number: 1072954

Several findings associate food insecurity with poorer health, worse disease management, and a higher risk of premature mortality even though they used microdata. For instance, Stuff et al. [38] found that food insecurity is related to poor self-reported health status, obesity [39], abnormal blood lipids [40], a rise in diabetes [24, 40], increased gestational diabetes[41], increased perceived stress, depression and anxiety among women [25, 42], Human Immunodeficiency Virus (HIV) acquisition risk [43,44,45], childhood stunting [46], poor health [47], mental health and behavioral problem [25, 48, 49].

The above highlight micro-level empirical studies, and since the scope of this study is macro-level, Table 1 provides only the existing macro-level empirical findings related to the current study.

Table 1 Empirical review

Empirical findings in Table 1 are a few, implying a limited number of macro-level level empirical findings. Even the existing macro-level studies have several limitations. For instance, most studies either employed conventional estimation techniques or overlooked basic econometric tests; thus, their results and policy implications may mislead policy implementers. Except for Hameed et al. [53], most studies’ data are either outdated or unbalanced; hence, their results and policy implications may not be valuable in the dynamic world and may not be accurate like balanced data. Besides, some studies used limited (one) sampled countries; however, few sampled countries and observations do not get the asymptotic properties of an estimator [56]. Therefore, this study tries to fill the existing gaps by employing robust estimation techniques with initial diagnostic and post-estimation tests, basic panel econometric tests and robustness checks, updated data, a large number of samples.

Methods

Study setting and participants

According to Smith and Meade [57], the highest rates of both food insecurity and severe food insecurity were found in Sub-Saharan Africa in 2017 (55 and 28%, respectively), followed by Latin America and the Caribbean (32 and 12%, respectively) and South Asia (30 and 13%). Similarly, SSA countries have worst health outcomes compared to other regions. For instance, in 2020, the region had the lowest life expectancy [58] and highest infant mortality [59]. Having the above information, this study's target population are SSA countries chosen purposively. However, even though SSA comprises 49 of Africa's 55 countries that are entirely or partially south of the Sahara Desert. This study is conducted for a sample of 31 SSA countries (Angola, Benin, Botswana, Burkina Faso, Cameroon, Cabo Verde, Chad, Congo Rep., Côte d'Ivoire, Ethiopia, Gabon, The Gambia, Ghana, Kenya, Lesotho, Liberia, Madagascar, Malawi, Mali, Mauritania, Mauritius, Mozambique, Namibia, Nigeria, Rwanda, Senegal, Sierra Leone, South Africa, Sudan, Tanzania, and Togo). The sampled countries are selected based on data accessibility for each variable included in the empirical models from 2001 to 2018. Since SSA countries suffer from food insecurity and related health problems, this study believes the sampled countries are appropriate and represent the region. Moreover, since this study included a large sample size, it improves the estimator’s precision.

Data type, sources, and scope

This study uses secondary data collected in December 2020 online from the databases of the Food and Agricultural Organization (FAO), the United Nations Development Programme (UNDP), and the World Bank (WB) (see Table 2). In addition, the study uses yearly balanced data from 2001 to 2018, which is appropriate because it captures the Millennium Development Goals, SDGs, and other economic conditions, such as the rise of SSA countries’ economies and the global financial crisis of the 2000s. Therefore, this study considers various global development programs and events. Generally, the scope of this study (sampled countries and time) is sufficient to represent SSA countries. In other words, the study has n*T = 558 observations, which fulfills the large sample size criteria recommended by Kennedy [56].

Table 2 Data type, sources, and measurements

The empirical model

Model specification is vital to conduct basic panel data econometric tests and estimate the relationship of target variables. Besides social factors, the study includes economic factors determining people's health status. Moreover, it uses two proxies indicators to measure both food insecurity and health status; hence, it specifies the general model as follows:

$$\overbrace{LNLEXP\left(LNINFMOR\right)}^{Target\;variables}=\overset{Target\;variables}{f\overbrace{(PRUND(AVRDES)},\;}\overbrace{GDPPC,\;GOVEXP,\;MNSCHOOL,\;URBAN)}^{Control\;variables}$$
(1)

The study uses four models to analyze the impact of food insecurity on health outcomes.

$${LNLEXP}_{it}={\alpha }_{0}+{\alpha }_{1}{PRUND}_{it}+{\alpha }_{2}{GDPPC}_{it}+{\alpha }_{3}{GOVEXP}_{it}+{\alpha }_{4}{MNSCHOOL}_{it}+{\alpha }_{5}{URBAN}_{it}+{\upeta }_{it}$$
(1A)
$${LNLEXP}_{it}={\beta }_{0}+{\beta }_{1}{AVRDES}_{it}+{\beta }_{2}{GDPPC}_{it}+{\beta }_{3}{GOVEXP}_{it}+{\beta }_{4}{MNSCHOOL}_{it}+{\beta }_{5}{URBAN}_{it}+{v}_{it}$$
(1B)
$${LNINFMOR}_{it}={\theta }_{0}+{\theta }_{1}{PRUND}_{it}+{\theta }_{2}{GDPPC}_{it}+{\theta }_{3}{GOVEXP}_{it}+{\theta }_{4}{MNSCHOOL}_{it}+{\theta }_{5}{URBAN}_{it}+{\varepsilon }_{it}$$
(1C)
$${LNINFMOR}_{it}={\delta }_{0}+{\delta }_{1}{AVRDES}_{it}+{\delta }_{2}{GDPPC}_{it}+{\delta }_{3}{GOVEXP}_{it}+{\delta }_{4}{MNSCHOOL}_{it}+{\delta }_{5}{URBAN}_{it}+{\mu }_{it}$$
(1D)

where LNLEXP and LNINFMOR (dependent variables) refer to the natural logarithm of life expectancy at birth and infant mortality used as proxy variables for health outcomes. Similarly, PRUND and AVRDES are the prevalence of undernourishment and average dietary energy supply adequacy – proxy and predictor variables for food insecurity.

Moreover, to regulate countries’ socio-economic conditions and to account for time-varying bias that can contribute to changes in the dependent variable, the study included control variables, such as GDPPC, GOVEXP, MNSCHOOL, and URBAN. GDPPC is GDP per capita, GOVEXP refers to domestic general government health expenditure, MNSCHOOL is mean years of schooling and URBAN refers to urbanization. Further, nit, vit, εit, and μit are the stochastic error terms at period t. The parameters \({\alpha }_{0}, { \beta }_{0}, { \theta }_{0},{ \delta }_{0}\) refer to intercept terms and \({\alpha }_{1}-{\alpha }_{5}, {\beta }_{1}-{\beta }_{5}, { \theta }_{1}-{\theta }_{5}, and {\delta }_{1}-{\delta }_{5}\) are the long-run estimation coefficients. Since health outcomes and food insecurity have two indicators used as proxy variables, this study estimates different alternative models and robustness checks of the main results. Furthermore, the above models did not address heterogeneity problems; hence, this study considers unobserved heterogeneity by introducing cross-section and time heterogeneity in the models. This is accomplished by assuming a two-way error component for the disturbances with:

$$\left\{\begin{array}{c}{\upeta }_{it}\\ {\nu }_{it}\\ {\varepsilon }_{it}\\ {\mu }_{it}\end{array}=\varpi =\right.{\delta }_{i}+{\tau }_{t}+{\gamma }_{it}$$
(2)

From Eq. 2, the unobservable individual (cross-section) and unobservable time heterogeneities are described by \({\delta }_{i} and {\tau }_{t}\) (within components), respectively. Nonetheless, the remaining random error term is \({\gamma }_{it}\) (panel or between components). Therefore, the error terms in model 1A-1D will be substituted by the right-hand side elements of Eq. 2.

Depending on the presumptions of whether the error elements are fixed or random, the FE and RE models are the two kinds of models that will be evaluated. Equation (2) yields a two-way FE error component model, or just a FE model if the assumptions are that \({\delta }_{i} and {\tau }_{t}\) are fixed parameters to be estimated and that the random error component, \({\gamma }_{it}\), is uniformly and independently distributed with zero mean and constant variance (homoscedasticity).

Equation (2), on the other hand, provides a two-way RE error component model or a RE model if we suppose \({\delta }_{i} and {\tau }_{t}\) are random, just like the random error term, or \({\delta }_{i},{\tau }_{t}, and {\gamma }_{it}\) are all uniformly and independently distributed with zero mean and constant variance, or they are all independent of each other and independent variables [60].

Rather than considering both error components, \({\delta }_{i}, and {\tau }_{t}\), we can examine only one of them at a time (fixed or random), yielding a one-way error component model, FE or RE. The stochastic error term \({\varpi }_{it}\) in Eq. 2 will then be:

$${\varpi }_{it}={\delta }_{i}+{\gamma }_{it}, or$$
(3)
$${\varpi }_{it}={\tau }_{t}+{\gamma }_{it}$$
(4)

Statistical analysis

This study conducted descriptive statistics, correlation analysis, and initial diagnosis tests (cross-sectional and time-specific fixed effect, outliers and influential observations, multicollinearity, normality, heteroscedasticity, and serial correlation test). Moreover, it provides basic panel econometric tests and panel data estimation techniques. For consistency, statistical software (STATA) version 15 was used for all analyses.

Descriptive statistics and correlation analysis

Descriptive statistics is essential to know the behavior of the variables in the model. Therefore, it captures information, such as the mean, standard deviation, minimum, maximum, skewness, and kurtosis. Similarly, the study conducted Pearson correlation analysis to assess the degree of relationship between the variables.

Initial diagnosis

Cross-sectional and time-specific fixed effect

One can anticipate differences arising over time or within the cross-sectional units, given that the panel data set comprises repeated observations over the same units gathered over many periods. Therefore, before estimation, this study considered unexplained heterogeneity in the models. One fundamental limitation of cross-section, panel, and time series data regression is that they do not account for country and time heterogeneity [60]. These unobserved differences across nations and over time are crucial in how the error term is represented and the model is evaluated. These unobserved heterogeneities, however, may be represented by including both country and time dummies in the regression. However, if the parameters exceed the number of observations, the estimate will fail [60]. However, in this study, the models can be estimated. If we include both country and time dummies, we may assume that the slope coefficients are constant, but the intercept varies across countries and time, yielding the two-way error components model. As a result, this study examines the null hypothesis that intercepts differ across nations and time in general.

Detecting outliers and influential observations

In regression analysis, outliers and influential observations may provide biased findings. Therefore, the Cooks D outlier and influential observation test was used in the study to handle outliers and influencing observations. To evaluate whether these outliers have a stronger impact on the model to be estimated, each observation in this test was reviewed and compared with Cook’s D statistic [61]. Cook distance evaluates the extent to which observation impacts the entire model or the projected values. Hence, this study tested the existence of outliers.

Normality, heteroscedasticity, multicollinearity, and serial correlation test

Before the final regression result, the data used for the variables were tested for normality, heteroscedasticity, multicollinearity, and serial correlation to examine the characteristics of the sample.

Regression models should be checked for nonnormal error terms because a lack of Gaussianity (normal distribution) can occasionally compromise the accuracy of estimation and testing techniques. Additionally, the validity of inference techniques, specification tests, and forecasting critically depends on the normalcy assumption [62]. Similarly, multicollinearity in error terms leads to a dataset being highly sensitive to a minor change, instability in the regression model, and skewed and unreliable results. Therefore, this study conducted the normality using Alejo et al. [62] proposed command and multicollinearity (using VIF) tests.

Most conventional panel data estimation methods rely on homoscedastic individual error variance and constant serial correlation. Since the error component is typically connected to the variance that is not constant during the observation and is serially linked across periods, these theoretical presumptions have lately reduced the applicability of various panel data models. Serial correlation and heteroskedasticity are two estimate issues frequently connected to cross-sectional and time series data, respectively. Similarly, panel data is not free from these issues because it includes cross-sections and time series, making the estimated parameters ineffective, and rendering conclusions drawn from the estimation incorrect [63]. Therefore, this study used the Wooldridge [63] test for serial correlation in linear panel models as well as the modified Wald test for heteroskedasticity.

Basic panel econometric tests

The basic panel data econometric tests are prerequisites for estimating the panel data. The three main basic panel data tests are cross-sectional dependence, unit root, and cointegration.

Cross-sectional dependence (CD)

A growing body of the panel data literature concludes that panel data models are likely to exhibit substantial CD in the errors resulting from frequent shocks, unobserved components, spatial dependence, and idiosyncratic pairwise dependence. Even though the impact of CD in estimation depends on several factors, relative to the static model, the effect of CD in dynamic panel estimators is more severe [64]. Moreover, Pesaran [65] notes that recessions and economic or financial crises potentially affect all countries, even though they might start from just one or two countries. These occurrences inevitably introduce cross-sectional interdependencies across the cross-sectional unit, their regressors, and the error terms. Hence, overlooking the CD in panel data leads to biased estimates and spurious results [64, 66]. Further, the CD test determines the type of panel unit root and cointegration tests we should apply. Therefore, examining the CD is vital in panel data econometrics.

In the literature, there are several tests for CD, such as the Breusch and Pagan [67] Lagrange multiplier (LM) test, Pesaran [68] scaled LM test, Pesaran [68] CD test, and Baltagi et al. [69] bias-corrected scaled LM test (for more detail, see Tugcu and Tiwari [70]). Besides, Friedman [71] and Frees [72, 73] also have other types of CD tests (for more detail, see De Hoyos and Sarafidis [64]). This study employs Frees [72] and Pesaran [68] among the existing CD tests. This is because, unlike the Breusch and Pagan [67] test, these tests do not require infinite T and fixed N, and are rather applicable for both a large N and T. Additionally, Free’s CD test can overcome the irregular signs associated with correlation. However, it also employs Friedman [71] CD for mixed results of the above tests.

Unit root test

The panel unit root and cointegration tests are common steps following the CD test. Generally, there are two types of panel unit root tests: (1) the first-generation panel unit root tests, such as Im et al. [74], Maddala and Wu [75], Choi [76], Levin et al. [77], Breitung [78] and Hadri [79], and (2) the second-generation panel unit root tests, such as [66, 80,81,82,83,84,85,86,87,88,89].

The first-generation panel unit root tests have been criticized because they assume cross-sectional independence [90,91,92,93]. This hypothesis is somewhat restrictive and unrealistic, as macroeconomic time series exhibit significant cross-sectional correlation among countries in a panel [92], and co-movements of economies are often observed in the majority of macroeconomic applications of unit root tests [91]. The cross-sectional correlation of errors in panel data applications in economics is likely to be the rule rather than the exception [93]. Moreover, applying first-generation unit root tests under CD models can generate substantial size distortions [90], resulting in the null hypothesis of nonstationary being quickly rejected [66, 94]. As a result, second-generation panel unit root tests have been proposed to take CD into account. Therefore, among the existing second-generation tests, this study employs Pesaran’s [66] cross-sectionally augmented panel unit root test (CIPS) for models 1A–1C. The rationale for this is that, unlike other unit root tests that allow CD, such as Bai and Ng [80], Moon and Perron [87], and Phillips and Sul [84], Pesaran’s [66] test is simple and clear. Besides, Pesaran [66] is robust when time-series’ heteroscedasticity is observed in the unobserved common factor [95]. Even though theoretically, Moon and Perron [87], Choi [96] and Pesaran [66] require large N and T, Pesaran [66] is uniquely robust in small sample sizes [97]. Therefore, this study employs the CIPS test to take into account CD, and heteroskedasticity in the unobserved common factor and both large and small sample countries. However, since there is no CD in model 1D, this study employs the first-generation unit root tests called Levin, Lin, and Chu (LLC), Im, Pesaran, Shin (IPS) and Fisher augmented Dickey–Fuller (ADF) for model 1D.

Cointegration test

The most common panel cointegration tests when there is CD are Westerlund [98], Westerlund and Edgerton [99], Westerlund and Edgerton [100], Groen and Kleibergen [101], Westerlund’s [102] Durbin-Hausman test, Gengenbach et al. [103] and Banerjee and Carrion-i-Silvestre [104]. However, except for a few, most tests are not coded in Statistical Software (STATA) and are affected by insufficient observations. The current study primarily uses Westerlund [98] and Banerjee and Carrion-i-Silvestre [104] for models 1A–1C. However, to decide uncertain results, it also uses McCoskey and Kao [105] cointegration tests for model 1C. The rationale for using Westerlund’s [98] cointegration test is that most panel cointegration has failed to reject the null hypothesis of no cointegration due to the failure of common-factor restriction [106]. However, Westerlund [98] does not require any common factor restriction [107] and allows for a large degree of heterogeneity (e.g., individual-specific short-run dynamics, intercepts, linear trends, and slope parameters) [92, 107, 108]. Besides, its command is coded and readily available in STATA. However, it suffers from insufficient observations, especially when the number of independent variables increases. The present study employs the Banerjee and Carrion-i-Silvestre [104] and McCoskey and Kao [105] cointegration tests to overcome this limitation. The two Engle-Granger-based cointegration tests applicable when there is no CD and are widely used and available in STATA are Pedroni [109, 110] and Kao [111]. However, the Pedroni test has two benefits over Kao: it assumes cross-sectional dependency and considers heterogeneity by employing specific parameters [112]. Hence, this study uses the Pedroni cointegration test for model 1D.

Panel data estimation techniques

The panel data analysis can be conducted using different estimation techniques and is mainly determined by the results of basic panel econometric tests. Thus, this study mainly employs the Driscoll-Kraay [113] standard error (DKSE) (for models 1A and 1B), FE (for model 1C), and two-step GMM (for model 1D) estimation techniques to examine the impact of food insecurity on health outcomes. It also employs the Granger causality test. However, for robustness checks, it uses fully modified ordinary least squares (FMOLS), panel-corrected standard error (PCSE), and feasible generalized least squares (FGLS) methods (for models 1A and 1B). Moreover, it uses a random effect (RE) for model 1C and panel dynamic fixed effect (DFE) techniques for model 1D.

Even though several panel estimation techniques allow CD, most of them – such as cross-section augmented autoregressive distributed lag (CS-ARDL), cross-section augmented distributed lag (CS-DL), common correlated effects pooled (CCEP), and common correlated effects mean group (CCEMG) estimators – require a large number of observations over groups and periods. Similarly, the continuously updated fully modified (CUP-FM) and continuously updated bias-corrected (CUP-BC) estimators are not coded in STATA. Others, like the PCSE, FGLS, and seemingly unrelated regression (SUR), are feasible for T (the number of time series) > N (the number of cross-sectional units) [114, 115]. However, a DKSE estimate is feasible for N > T [114]. Therefore, depending on the CD, cointegration test, availability in STATA, and comparing N against T, this study mainly employs the DKSE regression for models 1A and 1B, FE model for model 1C, and GMM for model 1C.

Finally, to check the robustness of the main result, this study employs FMOLS, FGLS, and PCSE estimation techniques for models 1A and 1B. Furthermore, even though the Hausman test confirms that the FE is more efficient, the study employs the RE for model 1C. This is because Firebaugh et al. [116] note that the RE and FE models perform best in panel data. Besides, unlike FE, RE assumes that individual differences are random. In addition, this study uses panel DFE for model 1D (selected based on the Hausman test). Finally, the robustness check is also conducted using an alternative model (i.e., when a dependent variable is without a natural log and Granger causality test).

Results

Descriptive statistics and correlation analysis

Table 3 shows the overall mean of LNLEXP of the region is 4.063 years which indicates that the region can achieve only 57.43 (using ln(x) = 4.063 = loge(x) = e4.063, where e = 2.718) years of life expectancy. This is very low compared to other regions. Besides, the ranges in the value of LNLEXP are between 3.698 and 4.345 or (40–76 years), implying high variation. Similarly, the mean value of LNINFMOR is 3.969; implying SSA countries recorded 52 infants death per 1000. Moreover, the range of LNINFMOR is between 2.525 and 4.919 or (12 – 135 infant death per 1000), implying high variation within the region. The mean value of people’s prevalence for undernourishment is 21.26; indicating 21% of the population is undernourished. However, the mean value of AVRDES is 107.826, which is greater than 100, implying that the calorie supply is adequate for all consumers if the food is distributed according to the requirements of individuals. When we observe the skewness and kurtosis of the variables of the models, except for LNLEXP and LNINFMOR, all variables are positively skewed. In addition, all variables have positive kurtosis with values between 2.202 and 6.092.

Table 3 Descriptive statistics and correlation analysis

Table 3 also shows the degree of relationship between variables, such that most values are below the threshold or rule of thumb (0.7) for a greater association [117]. However, the association between LNINFMOR and LNLEXP, as well as between PRUNP and AVRDES, is over the threshold and seems to have a multicollinearity issue. Nevertheless, these variables did not exist together in the models, indicating the absence of a multicollinearity problem.

Cross-sectional and time-specific fixed effect

Table 4 shows whether the cross-sectional specific and time-specific FE in extended models (model 1A-1D plus Eq. 2) are valid. The result reveals that the null hypothesis of the captured unobserved heterogeneity is homogenous across the countries, and time is rejected at 1%, implying the extended models are correctly specified. Besides, to check the robustness of the two-way error component model relative to the pooled OLS estimator, this study conducted an additional poolability test. The result shows the null hypothesis that intercepts homogeneity (pooling) is rejected at a 1% level; thus, the FE model is most applicable, but the pooled OLS is biased.

Table 4 Test for individual cross-sections and time-specific effects

Detecting outliers and influential observations

Cooks D is an indicator of high leverage and residuals. The impact is high when D exceeds 4/N, (N = number of observations). A D > 1 implies a significant outlier problem. The Cooks D result of this study confirms the absence of outliers' problem (see supplementary file 1).

Normality, heteroscedasticity, serial correlation, and multicollinearity tests

The results in Table 5 indicate that the probability value of the joint test for normality on e and u are above 0.01, implying that the residuals are normally distributed. The heteroscedasticity results show that the probability value of the chi-square statistic is less than 0.01 in all models. Therefore, the null hypothesis of constant variance can be rejected at a 1% level of significance. In other words, the modified Wald test result for Groupwise heteroskedasticity presented in Table 5, rejects the null hypothesis of Groupwise homoskedasticity observed by the probability value of 0.0000, which implies the presence of heteroscedasticity in the residuals. Similarly, all models suffer from serial correlation since the probability value of 0.0000 rejects the null hypothesis of no first-order serial correlation, indicating the presence of autocorrelation in all panel models. Finally, the multicollinearity test reveals that the models have no multicollinearity problem since the Variance inflation Factors (VIF) values are below 5.

Table 5 Normality, heteroskedasticity, serial correlation, and multicollinearity results

Cross-sectional dependence test

Results in Table 6 strongly reject the null hypothesis of cross-sectional independence for models 1A – 1C. However, for model 1D, the study found mixed results (i.e., Pesaran [68] fails to reject the null hypothesis of no CD while Frees [72] strongly rejects it). Thus, to decide, this study employs the Friedman [71] CD test. The result fails to reject the null hypothesis of cross-sectional independence, implying that two out of three tests fail to reject the null hypothesis of cross-sectional independence in model 1D. Therefore, unlike others, there is no CD in model 1D (see Table 6).

Table 6 Cross-sectional dependence tests

Unit root tests

Table 7 shows that all variables are highly (at 1% level) significant either at level (I(0)) or first difference (I(1)), which implies all variables are stationary. In other words, the result fails to reject the null hypothesis of unit root (non-stationary) for all variables at a 1%-significance level, either at levels or the first differences. Thus, we might expect a long-run connection between these variables collectively.

Table 7 Unit root tests

Cointegration tests

The results in Table 8 show that both the Westerlund [98] and Banerjee and Carrion-i-Silvestre [104] cointegration tests strongly reject the null hypothesis of no-cointegration in models 1A and 1B. However, model 1C provides a mixed result, i.e. the Banerjee and Carrion-i-Silvestre [104] test rejects the null hypothesis of no cointegration, yet the reverse is true for the Westerlund [98] test. Therefore, this study conducted further cointegration tests for model 1C. Even though Westerlund and Edgerton [99] suffer from insufficient observation, it is based on the McCoskey and Kao [105] LM test [118]. Thus, we can use a residual-based cointegration test in the heterogeneous panel framework proposed by McCoskey and Kao [105]. However, an efficient estimation technique of cointegrated variables is required, and hence the FMOLS and DOLS estimators are recommended. The residuals derived from the FMOLS and DOLS will be tested for stationarity with the null hypothesis of no cointegration amongst the regressors. Since the McCoskey and Kao [105] test involves averaging the individual LM statistics across the cross-sections, for testing the residuals FMOLS and DOLS stationarity, McCoskey, and Kao [105] test is in the spirit of IPS (Im et al. [74]) [119].

Table 8 Panel cointegration test

Though FMOLS and DOLS are recommended for the residuals cointegration test, DOLS is better than FMOLS (for more detail, see Kao and Chiang [120]); therefore, this study uses a residual test derived from DOLS. The result fails to reject the null hypothesis of no cointegration. Two (Banerjee and Carrion-i-Silvestre [104] and McCoskey and Kao [105]) out of three tests fail to reject the null hypothesis of no cointegration; hence, we can conclude that there is no long-run relationship among the variables in model 1C.

Unlike other models, since there is CD in model 1D, this study employs the Pedroni [109] and Kao [111] cointegration tests for model 1D. The result strongly rejects the null hypothesis of no cointegration, which is similar to models 1A and 1B, that a long-run relationship exists among the variables in model 1D (see Table 5).

Panel data estimation results

Table 9 provides long-run regression results of all models employing appropriate estimation techniques such as DKSE, FE, and two-step GMM, along with the Granger causality test. However, the DKSE regression can be estimated in three ways: FE with DKSE, RE with DKSE, and pooled Ordinary Least Squares/Weighted Least Squares (pooled OLS/WLS) regression with DKSE. Hence, we must choose the most efficient model using Hausman and Breusch-Pagan LM for RE tests (see supplementary file 2). As a result, this study employed FE with DKSE for models 1A and 1B. Further, due to Hausman's result, absence of cointegration and to deal with heterogeneity and spatial dependence in the dynamic panel, this study employs FE for the model1C (see the supplementary file 2). However, due to the absence of CD, the presence of cointegration, and N > T, this study uses GMM for model 1D. Moreover, according to Roodman [121], the GMM approach can solve heteroskedasticity and autocorrelation problems. Furthermore, even though two-step GMM produces only short-run results, it is possible to generate long-run coefficients from short-run results [122, 123].

Table 9 DKSE, FE, Two-step GMM, and Granger causality results

The DKSE result of model 1A shows that a 1% increment in people's prevalence for undernourishment reduces their life expectancy by 0.00348 PPs (1 year or 366 days). However, in model 1C, a 1% rise in the prevalence of undernourishment increases infant mortality by 0.0119 PPs (1 year or 369 days). The DKSE estimations in model 1B reveal that people’s life expectancy rises by 0.00317 PPs with every 1% increase in average dietary energy supply. However, the GMM result for model 1D confirms that a 1% incrementin average dietary energy supply reduces infant mortality by 0.0139 PPs. Moreover, this study conducted a panel Granger causality test to confirm whether or not food insecurity has a potential causality to health outcomes. The result demonstrates that the null hypothesis of change in people’s prevalence for undernourishment and average dietary energy supply does not homogeneously cause health outcomes is rejected at 1% significance, implying a change in food insecurity does Granger-cause health outcomes of SSA countries (see Table 9).

In addition to the main results, Table 9 also reports some post-estimation statistics to ascertain the consistency of the estimated results. Hence, in the case of DKSE and FE models, the validity of the models is determined by the values of R2 and the F statistics. For instance, R2 quantifies the proportion of the variance in the dependent variable explained by the independent variables, representing the model’s quality. The results in Table 9 demonstrate that the explanatory variables explain more than 62% of the variance on the dependent variable. Cohen [125] classifies the R2 value of 2% as a moderate influence in social and behavioral sciences, while 13 and 26% are considered medium and large effects, respectively. Therefore, the explanatory variables substantially impact this study's models. Similarly, the F statistics explain all independent variables jointly explain the dependent one. For the two-step system GMM, the result fails to reject the null hypothesis of no first (AR(1)) and second-order (AR(2)) serial correlation, indicating that there is no first and second-order serial correlation. In addition, the Hansen [126] and Sargan [127] tests fail to reject the null hypothesis of the overall validity of the instruments used, which implies too many instruments do not weaken the model.

Robustness checks

The author believes the above findings may not be enough for policy recommendations unless robustness checks are undertaken. Hence, the study estimated all models without the natural logarithm of the dependent variables (see Table 10). The model 1A result reveals, similar to the above results, individuals’ prevalence for undernourishment significantly reduces their life expectancy in SSA countries. That means a 1% increase in the people's prevalence of undernourishment reduces their life expectancy by 0.1924 PPs. Moreover, in model 1B, life expectancy rises by 0.1763 PPs with every 1% increase in average dietary energy supply. In model 1C, the rise in infants’ prevalence for undernourishment has a positive and significant effect on their mortality rate in SSA countries. The FE result implies that a 1% rise in infants’ prevalence for undernourishment increases their mortality rate by 0.9785 PPs. The GMM result in model 1D indicates that improvement in average dietary energy supply significantly reduces infant mortality. Further, the Granger causality result confirms that the null hypothesis of change in the prevalence of undernourishment and average dietary energy supply does not homogeneously cause health outcomes and is rejected at a 1% level of significance. This implies a change in food insecurity does Granger-cause health outcomes in SSA countries (see Table 10).

Table 10 DKSE, FE, Two-step GMM, and Granger causality results

The study also conducted further robustness checks using the same dependent variables (as Table 9) but different estimation techniques. The results confirm that people’s prevalence of undernourishment has a negative and significant effect on their life expectancy, but improvement in average dietary energy supply significantly increases life expectancy in SSA countries. However, the incidence of undernourishment in infants contributes to their mortality; however, progress in average dietary energy supply for infants significantly reduces their mortality (see Table 11).

Table 11 FMOLS, PCSE, FGLS, RE, and DFE results

Discussion

The main objective of this study is to examine the impact of food insecurity on the health outcomes of SSA countries. Accordingly, the DKSE result of model 1A confirms that the rise in people’s prevalence for undernourishment significantly reduces their life expectancy in SSA countries. However, the FE result shows that an increment in the prevalence of undernourishment has a positive and significant impact on infant mortality in model 1C. This indicates that the percentage of the population whose food intake is insufficient to meet dietary energy requirements is high, which leads to reduce life expectancy but increases infant mortality in SSA countries. The reason for this result is linked to the insufficient food supply in SSA due to low production and yields, primitive tools, lack of supporting smallholder farms and investment in infrastructure, and government policies. Besides, even though the food is available, it is not distributed fairly according to the requirements of individuals. Moreover, inadequate access to food, poor nutrition, and chronic illnesses are caused by a lack of well-balanced diets. In addition, many of these countries are impacted by poverty, making it difficult for citizens to afford nutritious food. All these issues combine to create an environment where individuals are more likely to suffer malnutrition-related illnesses, resulting in a lower life expectancy rate. The DKSE estimation result in model 1B reveals that improvement in average dietary energy supply positively impacts people's life expectancy in SSA countries. However, the improvement in average dietary energy supply reduces infant mortality.

Based on the above results, we can conclude that food insecurity harms SSA nations' health outcomes. This is because the prevalence of undernourishment leads to increased infant mortality by reducing the vulnerability, severity, and duration of infectious diseases such as diarrhea, pneumonia, malaria, and measles. Similarly, the prevalence of undernourishment can reduce life expectancy by increasing the vulnerability, severity, and duration of infectious diseases. However, food security improves health outcomes – the rise in average dietary energy supply reduces infant mortality and increases the life expectancy of individuals.

Several facts and theories support the above findings. For instance, similar to the theoretical and conceptual framework section, food insecurity in SSA countries can affect health outcomes in nutritional, mental health, and behavioral channels. According to FAO et al. [128], the prevalence of undernourishment increased in Africa from 17.6% of the population in 2014 to 19.1% in 2019. This figure is more than twice the global average and the highest of all regions of the world. Similarly, SSA is the world region most at risk of food insecurity [129]. According to Global Nutrition [130] report, anemia affects an estimated 39.325% of women of reproductive age. Some 13.825% of infants have a low weight at birth in the SSA region. Excluding middle African countries (due to lack of data), the estimated average prevalence of infants aged 0 to 5 months who are exclusively breastfed is 35.73%, which is lower than the global average of 44.0%. Moreover, SSA Africa still experiences a malnutrition burden among children aged under five years. The average prevalence of overweight is 8.15%, which is higher than the global average of 5.7%. The prevalence of stunting is 30.825%—higher than the worldwide average of 22%. Conversely, the SSA countries’ prevalence of wasting is 5.375%, which is higher than most regions such as Central Asia, Eastern Asia, Western Asia, Latin America and the Caribbean, and North America. The SSA region's adult population also faces a malnutrition burden: an average of 9.375% of adult (aged 18 and over) women live with diabetes, compared to 8.25% of men. Meanwhile, 20.675% of women and 7.85% of men live with obesity.

According to Saltzman et al. [17], micronutrient deficiencies can affect people’s health throughout their life cycle. For instance, at the baby age, it causes (low birth weight, higher mortality rate, and impaired mental development), child (stunting, reduced mental capacity, frequent infections, reduced learning capacity, higher mortality rate), adolescent (stunting, reduced mental capacity, fatigue, and increased vulnerability to infection), pregnant women (increased mortality and perinatal complications), adult (reduced productivity, poor socio-economic status, malnutrition, and increased risk of chronic disease), elderly (increased morbidity (including osteoporosis and mental impairment), and higher mortality rate).

Though this study attempts to fill the existing gaps, it also has limitations. It examined the impact of food insecurity on infant mortality; however, their association is reflected indirectly through other health outcomes. Hence, future studies can extend this study by examining the indirect effect of food insecurity on infant mortality, which helps to look at in-depth relationships between the variables. Moreover, this study employed infant mortality whose age is below one year; hence, future studies can broaden the scope by decomposing infant mortality into (neonatal and postnatal) and under-five mortality.

Conclusions

Millions of people are dying every year due to hunger and hunger-related diseases worldwide, especially in SSA countries. Currently, the link between food insecurity and health status is on researchers' and policymakers' agendas. However, macro-level findings in this area for most concerned countries like SSA have been given only limited attention. Therefore, this study examined the impact of food insecurity on life expectancy and infant mortality rates. The study mainly employs DKSE, FE, two-step GMM, and Granger causality approaches, along with other estimation techniques for robustness checks for the years between 2001 and 2018. The result confirms that food insecurity harms health outcomes, while food security improves the health status of SSA nations'. That means that a rise in undernourishment increases the infant mortality rate and reduces life expectancy. However, an improvement in the average dietary energy supply reduces infant mortality and increases life expectancy. Therefore, SSA countries need to guarantee their food accessibility both in quality and quantity, which improves health status. Both development experts and political leaders agree that Africa has the potential for agricultural outputs, can feed the continent, and improve socio-economic growth. Besides, more than half of the world's unused arable land is found in Africa. Therefore, effective utilization of natural resources is essential to achieve food security. Moreover, since the majority of the food in SSA is produced by smallholder farmers [131] while they are the most vulnerable to food insecurity and poverty [132, 133]; hence, special focus and support should be given to smallholder farmers that enhance food self-sufficiency. Further, improvement in investment in agricultural research; improvement in markets, infrastructures, and institutions; good macroeconomic policies and political stability; and developing sub-regional strategies based on their agroecological zone are crucial to overcoming food insecurity and improving health status. Finally, filling a stomach is not sufficient; hence, a person's diet needs to be comprehensive and secure, balanced (including all necessary nutrients), and available and accessible. Therefore, SSA countries should ensure availability, accessibility, usability, and sustainability to achieve food and nutrition security.