Background

The United Nations (UN) sustainable development goal number three’s (SDG3) focus on ensuring healthy lives and promoting well-being for all at all ages, highlights the priority of health in the international development agenda [1]. The World Health Organization’s (WHO) global agenda on Universal Health Coverage (UHC) called for access of all people to quality, comprehensive and effective health services at affordable costs without financial hardship [2, 3]. However, many low- and middle-income countries (LMICs) rely extensively on out-of-pocket (OOP) expenditures to finance health care services with adverse consequences on the use of services and with the risk of financial catastrophic expenditure and impoverishment [4]. Expansion of health insurance coverage is one strategy to attain SDG3 and UHC goals. Economic theory suggests that people purchase health insurance not only to avoid the risk of financial loss, but also as a mechanism for gaining access to healthcare that would otherwise be unaffordable [5,6,7]. Healthcare financing through health insurance is gaining popularity in developing countries in their pursuit to achieve UHC [8, 9]. Cutler and Zeckhauser [10] posit that the primary purpose of health insurance and its delivery is to improve health. Good health increases productivity, lifetime earnings, and time spent in activities that maximize utility [11, 12]. People care about health insurance because they are concerned about their health [13] and evidence suggests that the effect of being without health insurance can be significant [14, 15]. Although a widely held expectation is that increase in health care finance and expenditure translates into health status improvements, the evidence is scarce and inconclusive [13, 16, 17] particularly for LMICs like Kenya.

Though studies examining the impact of health expenditures on health outcomes mostly document positive associations [18,19,20], experiences differ by region and health measures. Bein et al. [19] found a positive association between health expenditures and life expectancy while that of neonatal, infant and under five deaths was negative, in eight East African countries. Akinci et al. [21] found that increases in total healthcare expenditures reduce infant, under-five, and maternal mortality rates for Middle Eastern and Northern African countries. However, Anyanwu and Erhijakpor [22] documented region-dependent relationship between health expenditures and infant and under-five mortality whereby it was positive for Sub-Saharan African (SSA) countries and negative for Northern Africa. In countries with good governance, increased health care spending was associated with improvements in infant and under-five child mortality [23].

Most studies focus on the impact of health insurance on health care utilization and OOP, with evidence suggesting that utilization is higher among the insured [4, 9, 14, 24,25,26,27,28,29] and benefiting those of low socio-economic status significantly more, as was the case for Ghana [30]. However, in their systematic review, Acharya et al. [31] found no strong evidence of impact of insurance on utilization, protection from financial risk and health status in LMICs and caution that increased utilization may not always indicate welfare improvements. Rather, they note that a few schemes afford protection from high OOP, but the impact is weaker on the poor. In South Africa, though private health insurance increased health service utilization, it did not result in lower OOP payments for scheme members compared to nonmembers, raising questions on the effectiveness of insurance in protecting against financial risks and catastrophic expenditures [32].

Proponents of the health insurance strategy argue that health insurance certainly increases the quantity of health care consumed, and many medical interventions have proven to be greatly beneficial [16] especially to some population groups [17]. However, arguments abound that more medical care is not always better and could actually be worse, leaving the question of health benefits of insurance uncertain. Furthermore, there are also other initiatives such as EPI (Expanded Programs on Immunization) or direct investments for medical care and health systems improvements that improve health besides health insurance, evidently raising questions about its effectiveness in improving population health. Aron-Dine et al. [33] note that over three decades later, the Rand Health Insurance Experiment (HIE) results [17, 34] are still widely held to be the “gold standard” of evidence for predicting the likely impact of health insurance reforms on medical spending, health care utilization and health outcomes. Rand [17] states that cost sharing in general had no adverse effects on participant health outcomes though there were exceptions. The poor and sickest patients achieved better outcomes with blood pressure control, vision control and dental care in more generous plans [13, 34]. There was a 10% reduction in mortality for those with hypertension [17]. The study suggested that cost sharing should be minimal or nonexistent for the poor, especially those with chronic disease [7, 17]. Currie and Gruber [35, 36] found that extending public insurance to pregnant women and children was associated with significant decreases in infant and child mortality and with increase in the use of preventive care among children in United States of America (USA).

Culyer and Newhouse [13] cite studies which show that the introduction of universal health coverage in Canada was associated with a decrease in the infant mortality rate. Dor et al. [37] show that private health insurance significantly improved health of older adults (45–64 years) in USA. Levy and Meltzer [16] concluded that their review showed some evidence to support the view that health insurance significantly improves health for specific health measures (e.g., control of hypertension, vision and dental health) or subpopulations (e.g., children, pregnant women, older working age adults). In addition, Dor et al. [37] concluded that there appears to be a general agreement that insurance effects are positive and persist under a variety of disease conditions classifications (chronic, non-chronic, symptomatic, hypertension). The general observation from these studies is that the impact of health insurance on health status cannot be generalized to different settings and population groups, much less in LMICs.

Very few studies examine the impact of health insurance on health outcomes in LMICs [31], with mixed findings. Wang et al. [28] used EuroQol-5 Dimension (EQ-5D) to measure health and report that insurance had positive effects on the health status for all insured and for the poor in China while Mensah et al. [38] showed lower levels of infant deaths in Ghana, though not statistically significant. Stone et al. [14] analyzed all inpatients discharged from a Kenyan teaching and referral hospital to establish the relationship between in-hospital death (mortality) and health insurance status. They found that 21.3% of 956 patients discharged had insurance (mostly National Hospital Insurance Fund (NHIF)) and that the proportion that died was more than twice as high among the uninsured (24.7%) compared to the insured (11.4%). Their findings suggest that insurance coverage was associated with decreased in-hospital mortality and called for expansion of insurance coverage. Haushofer et al. [39] used a randomized controlled trial in Kenya to assess the impact of providing health insurance on self-reported stress and stress hormone cortisol and demonstrated that health insurance reduced both health indicators among informal workers. Insurance was also found to improve sleep and had larger effects on the poor and vulnerable [39]. The foregoing reveals that that there is a paucity of studies that explicitly examine the impact of health insurance on health outcomes at the population level and particularly in SSA. Despite widespread efforts to expand health insurance in developing countries, there is scant evidence as to whether doing so actually improves people’s health [28].

Kenya has made a commitment to achieve UHC. Indeed, one of “The Big Four” agenda priority policy objectives of the Kenyan Government is to focus on initiatives that guarantee universal health care to all Kenyans by 2022, through health insurance [40]. The Government of Kenya’s (GOK) decision to implement its UHC initiatives through NHIF was a major achievement towards UHC [41]. Another milestone towards achievement of UHC in Kenya is the 2021 NHIF amendment bill, assented into law in January 2022 [42]. This is part of the GOK’s plan for achieving UHC, based on need rather than, the ability to pay. The bill stipulates that, all Kenyans must be registered under NHIF, in the new Universal Health Coverage (UHC) program, with the government committing to cover the poorer households. The success of UHC will largely depend on a reformed NHIF that improves efficiency, transparency, accountability and service quality [43, 44]. Although enrollment in health insurance in Kenya is low, it has been expanding in recent years [45,46,47,48]. Evidence on the question of whether health insurance improves health outcomes at the population level is important in relation to policy questions on the value of initiatives to expand coverage. This evidence is limited for Kenya. Furthermore, the paucity of impact assessment studies, differences in impacts given study populations, settings and disease conditions studied, constitutes knowledge gaps for Kenya as it strives towards UHC.

To address this knowledge gap, we utilized the most comprehensive and nationally representative Kenya Integrated Household Budget Survey (KIHBS) 2015/16 dataset to analyze the impact of health insurance on health outcomes in Kenya. To contextualize the reporting of the results on the impact, we analyzed the characteristics of the insured and uninsured populations in Kenya. We used the instrumental variables (IV) and control function estimation approaches to address potential endogeneity due to omitted variables, selection bias as well as heterogeneity. Tests of significance of differences in means and proportions across health status by insurance status and other control variables were performed using pairwise comparison and t-test statistics.

Kenya’s healthcare financing through insurance

Kenya has a mixed health financing system that relies on several sources of financing. These include revenues collected by the government through taxes and donor funding, the NHIF through member contributions, private health insurance, and out-of-pocket spending by citizens at points of care [8, 49]. Kenya’s health system financing is predominantly tax-funded, through budgetary allocations to the Ministry of Health (MoH) [49]. The government caters for about a third of financing with the rest being private and mainly OOP payments [48]. Existing studies indicate high OOP spending in Kenya [50, 51], with attendant catastrophic and impoverishing expenditures [52,53,54].

Kenya has not yet implemented a universal social health insurance scheme to date. The President declined to assent to a national social health insurance fund (NSHIF) Bill introduced in 2004 [51]. The health insurance landscape in Kenya consists of public, private, and community-based insurance schemes. The NHIF, established in 1966, is the government mandatory insurance scheme for those in the formal employment sector (private and public) and voluntary for those in the informal sector [55]. NHIF dominates the health insurance landscape due to its mandatory scheme for all formal employees and the low premiums, especially for the informal sector enrollees. Membership in private health insurance offered by general insurance firms as one of their products and largely driven by profit motives rather than health promotion is voluntary. It is usually very expensive and only affordable to the middle- and high-income population groups [51]. Other forms of private insurance include community-based health insurance schemes whose growth has been minimal [51]. Health insurance coverage in Kenya has remained below 20% of the population to date [45,46,47,48, 55, 56]. Munge et al. [48] revealed that the majority (88%) of Kenya’s insured population is covered by the NHIF, while the rest are covered by private health insurance. The growth in NHIF enrollment in the last ten years is attributed to reforms largely aimed at increasing coverage among those employed in the informal sector [8]. Private insurance coverage is low (4% of the population) and mainly concentrated in urban areas and among the middle- and high-income groups [51]. In Kenya, it is common for individuals to have more than one insurance scheme. As such, there is no mutual exclusivity in enrollment into any scheme.

The NHIF operates three main schemes namely; the civil service scheme (CSS), the national scheme (SupaCover), and health insurance subsidy for the poor (HISP). Each scheme offers different benefit packages with considerable variation between inpatient and outpatient care [8, 57, 58]. NHIF also implements the free maternity, older persons and persons living with severe disabilities, and secondary school students’ insurance programs in order to enhance social protection and inclusivity [58].

Informal sector workers enroll into NHIF by paying a fixed annual premium of KES6,000 (USD 59.9)Footnote 1 for SupaCover. Coverage rates are low and attrition is high among this group [8, 48], posing universal coverage challenges. Formal sector employees pay a graduated monthly rate based on salary scale, ranging from Kenya Shillings (KES)150 (USD 1.5) to KES1,700 (USD 17) [55, 58]. Co-payments vary across public (category A), private faith-based (category B) and high cost private (category C) hospitals. NHIF outpatient benefits are based on a positive list of services and payments are made through capitation based on the number of persons registered at a particular facility [59]. The capitation rate ranges between KES 1000 (USD 10) and KES 1400 (USD 14) per beneficiary. Inpatient payment rates for category A and B facilities are 100%, and up to a maximum of KES4,000 (USD 39.9) per day in category C [58]. The CSS benefit package, launched in 2012 was negotiated by the government and NHIF and is delivered through a capitation model at a rate of KES1,500 (USD 15) per annum for public facilities, and KES 2,850 (USD 28.4) for private facilities. Benefits and health facility access vary for civil servant job-group cadres who enjoy a better and wider benefit package than SupaCover [8, 57]. The vulnerable and special groups are exempted from premium payments and enjoy the SuperCover benefit package. Though heterogenous in scheme types, premiums and benefit entitlements, Kenya’s health insurance model resembles that of many LMICs pursuing healthcare financing reforms geared towards UHC [30, 31].

Methods

Data source

We use data from the Kenya Integrated Household Budget Survey (KIHBS) 2015-16. The survey, conducted by Kenya National Bureau of Statistics (KNBS) in 2015-16, is nationally representative and comprehensive in coverage. The survey details are described in KNBS [60]. Through multistage sampling a total of 2,400 clusters (urban = 988; rural = 1,412) were sampled from national sample survey and evaluation programme-five (NASSEP V) sampling frame. The second stage involved selection of 16 households from each of the clusters and the third stage sub-sampling of 10 households. The sample size was determined independently for each of the 47 counties, resulting in a national sample of 24,000 households and 92,768 individuals. We extracted data by merging four sets of questionnaires and associated modules to obtain our variables of interest. This resulted in 21,773 individuals. The IV 2SLS and control function estimation is based on 2,789 individuals due to listwise deletion in Stata.

Outcome measures and independent variables

We used mortality and chronic condition experience as measures of health outcomes. Both are binary variables indicating presence of death in the household in the last 24 months prior to the survey and presence of chronic illness in the last four weeks respectively. Chronic illness variable was reconstructed by using information on the type of illness or injury suffered and recoded into a binary variable (chronic = 1 and non-chronic illnesses = 0) based on Bernell and Howard [61]. Health insurance status is a binary variable where 1 = insured and 0, otherwise. It comprised of enrolment in any health insurance scheme. The choice of explanatory variables was guided by theoretical and empirical studies. Literature reviewed suggested that demand for health insurance and subsequent impact on health outcomes differs by socio-economic and demographic characteristics [7, 11, 13, 17, 34, 37, 39]. The variables included age, age groups [9, 32, 37, 56] gender [32, 51], marital status [9, 32, 37, 51, 56], religion [62], education [9, 11, 37, 51, 62], media exposure [56], source of energy [9], drinking water [9], residence [9, 56, 62], household size [9], employment status [9, 32, 51, 56, 62], income quintiles and out-of-pocket expenditures [4, 32, 37, 51, 56, 62]. These were used as control variables. In addition to proportion of individuals with insurance within a cluster, we included community level characteristics such as distances to basic services, as additional instruments [63,64,65].

Identification Strategy

The gold standard for assessing causal effects is randomized control trial (RCT) such as in the classic Rand HIE [17, 66], but this is rare particularly in LMICs settings. In their review of the impact of health insurance on health, Levy and Meltzer [16] state that many of the studies claiming to show a causal effect of health insurance on health suffer methodological inadequacies because the observed correlation between insurance and good health may be driven by other, unobservable factors. Enrolment into any health insurance scheme may be based on specific population needs that may self-select into insurance schemes and thereby possess different characteristics and outcomes from the non-enrolled population, leading to selection bias. Selection bias arises when a factor affecting participation is associated with the outcome variable. For instance, the sickly may enrol due to a known need for health care services while the healthy may opt out, in both instances self-selecting whether to participate or not. It is plausible to assume that there is a bi-directional causality between health status and health insurance uptake. An individual’s health insurance status is potentially determined by some similar factors that determine health status. As such health outcomes for the insured and uninsured individuals, on one hand, may result from health insurance status or differences between individuals by insurance status both observable and unobservable. On the other hand, health status can also directly affect insurance uptake. This bi-directional causality raises estimation problems [16]. In Kenya, enrolment into insurance is not a random process as people chose whether or not to enrol. Some are enrolled through mandatory requirements in the NHIF whereby they are selected by virtue of being in formal employment while those employed in the informal sector voluntarily take up NHIF. In addition, those with NHIF may also voluntarily purchase private health insurance.

Instrumental variables estimation is one of the empirical strategies that exploit the exogenous variation created by natural experiments, such that, health insurance varies in a way that is unrelated to unobservable characteristics that also determine health, allowing isolation of the causal effect of insurance on health. Although the instrument introduces an element of randomness into treatment assignment, approximating the effect of an experiment, it is extremely difficult but possible to identify relevant, valid and strong instruments [4, 66,67,68]. Furthermore, the control function approach corrects for potential bias due to non-linear interactions of unobservable variables with the observed regressor [63, 64, 69]. Previous studies [13, 30, 70] suggest that the proportion of the population at a cluster level with insurance, can be treated as exogenous, and therefore a potential instrument for health insurance.

Analytical models

We used instrumental variable (IV) estimation and control function approach (CFA) to establish the causal effects of health insurance on health status. Potential endogeneity between health status and health insurance may arise from errors-in-variables, omitted variables and simultaneous causality [71]. If the error terms of the health (mortality, chronic illness) production equation and the linear projection of the endogenous variable (insurance status) are correlated it means, there is some unobserved trait that makes people who purchase insurance more or less likely to be healthy in a future period, thereby biasing the coefficients. Dor et al. [37] gives an example: if insurance is positively correlated with an unobserved trait, e.g. ‘‘awareness’’ which leads a person to take better care of their health, the error terms would be positively correlated, biasing the coefficient of insurance upwards. The opposite would hold if the unobserved trait causes a person to neglect their health. The final direction of the simultaneity bias cannot be ascertained a priori. Mwabu [64] observes that endogeneity and heterogeneity bias, compromise the validity of OLS estimators, requiring approaches that exogenize endogenous regressors such as health insurance status. The intent of the IV approach is to exogenize the endogenous regressors using valid, relevant and strong instruments [71]. The instrumental variable approach requires that the instruments are tested for relevance, strength and validity. In addition, in order to use the approach, it is necessary to ascertain that the endogenous variable is indeed endogenous and that the coefficients of the control variables are zero, signifying absence of heteroskedastic errors. The estimation technique uses a single equation approach using two-stage least squares (2SLS) estimators [64, 71, 72].

The IV method is one of the most powerful tools in econometrics, because it allows consistent parameter estimation in the presence of correlation between explanatory variables and disturbances i.e. endogeneity [73]. The method also identifies causal or treatment effects and it essentially assumes that some components of non-experimental data are random [65]. Instruments are variables thought to have no direct association with the outcome (health outcomes) [74] and are powerful predictors of treatment (health insurance) [72]. Though finding relevant, valid and strong instruments is difficult, techniques for testing for these qualities exist [75,76,77] as discussed under diagnostic tests below. Following the literature [13, 16, 30, 35, 37] we use the proportion of individuals with insurance within a cluster excluding the individual as an instrument for health insurance. The argument is that the individual does not affect this proportion and is therefore treated as exogenous.

Equations 1–3 summarise the empirical specification for IV 2SLS and CFA based on Mwabu [64] and Baye and Fambon [78] with adaptation from Bascle [71].

$$Yi\;=\;\beta_1X_1\;+\;\beta_2M_{li}+_{...}+\;\beta_{i+r}M_{ri}\;+\;\mu_i$$
(1)
$$X_{i}=\pi_{1}V_{1i}+{...}+\pi_{m}V_{mi}+\pi_{m+1}M_{1i}+{...}+\pi_{m+r}M_{mi}+\upsilon_{i}$$
(2)

Where: Yi represents health outcome (mortality, chronic illness); Xi is endogenous determinant of health status, i.e. health insurance; M1i….Mri are each of the r exogenous covariates or control variables, such as socio-demographics; Vi are a vector of instruments that affect the endogenous determinant of health status (health insurance) but have no direct influence on health status; β and π are the parameters to be estimated; ui, and υi are the disturbance terms. Equation 1, represents the health production technology and is the structural equation of interest. Unobserved complementarity between health insurance and other inputs omitted from this equation, may result in reverse causality. This is eliminated through the use of instrumental variables that do not belong to Eq. 1 but are strongly correlated with the endogenous determinant of health production; health insurance. Equation 2 is a linear projection of health insurance on all the exogenous variables, M1i….Mri and Vi. Equation 2 is a reduced form linear probability model of health insurance (the endogenous) input into health production. The predicted values of health insurance are used to compute residuals that enter Eq. 3, which is Eq. 1 augmented into a control function. To correct for potential endogeneity and non-linear interactions of unobservable variables with the observed regressor specified in Eq. 1, the equation is extended as indicated in Eq. 3.

$$Y_{i}+\beta_{1}X+\beta_{2}M+\delta\,R+\gamma\left(R\,\mathrm{x}\,X\right)+\mu_{i}$$
(3)

Yi is as previously defined, X is previous Xi,M is previous M1i, ….Mri. R is the residual of the endogenous input (i.e. observed value of X minus its fitted value), (R x X ) is the interaction of the residual with an endogenous input. δ and γ are parameters to be estimated. The terms R and (R x X) in Eq. 3 are the control function variables. They control for the effects of unobservable factors that would contaminate the OLS estimates of the structural parameters of health status measures. R serves as a control for unobserved variables correlated with X, thus allowing X to be treated as though they were exogenous during estimation. (R x X) controls for effects of neglected non-linear interactions of the unobservable variables with the health status measures. Equation 3 is the correct specification in the absence of a priori information on the econometric problems present [64]. This equation was estimated using the maximum likelihood estimation (MLE) procedure in Stata/SE 14.1 statistical software.

Diagnostic tests

The Durbin and Wu-Hausman test statistics determine whether the endogenous regressors in the model are in fact exogenous [75, 76, 79, 80]. If the test statistic is significant, then the variables being tested must be treated as endogenous. If the endogenous regressors are in fact exogenous, then the OLS estimator is more efficient, and depending on the strength of the instruments and other factors, the sacrifice in efficiency by using an instrumental variables estimator can be significant. Thus, unless an instrumental variable estimator is really needed, OLS should be used instead. Various statistics are used to measure the relevance and strength of the excluded exogenous variables depending on the number of endogenous regressors. The partial R2 and the first-stage F-statistic are reported in case of one endogenous regressor whereas Shea’s partial R2 and adjusted partial R2 are reported in case of two or more endogenous regressors. Higher values of R2 may indicate stronger instruments, though not always the case [74, 77, 79,80,81,82].

The partial R2 statistic measures the correlation between the endogenous variable and the additional instruments after partialling out the effect of the other exogenous variables in the equation, that may cause higher R2. Bound et al. [74] and others have promoted using the partial R2 statistic. The minimum eigenvalue statistic is a further test of weak instruments based on Cragg and Donald [81]. If the model contains one endogenous variable the minimum eigenvalue statistic and the F-statistic are identical. The Stock and Yogo [82] minimum eigenvalue statistic tests the hypothesis that the set of instruments are weak. If the test statistic exceeds the critical value, the instruments are deemed strong. The critical values are based on the largest relative bias of the 2SLS estimator tolerable or the largest rejection rate of a nominal 5% Wald test tolerable, and these are provided in Stata output [79]. The F-statistic tests for the joint significance of the coefficients on the additional instruments. If the p-value shows that the F statistic is not significant, then the additional instruments have no significant explanatory power for the endogenous variable after controlling for the effect of the exogenous covariates. Stock et al. [77] suggest that the F statistic should exceed 10 for inference, based on the 2SLS estimator, to be reliable when there is one endogenous regressor. Sargan’s and Basmann’s chi2 statistic test overidentifying restrictions [75, 76]. In addition to the requirement that instrumental variables be correlated with the endogenous regressors, the instruments must also be uncorrelated with the structural error term, thereby assuming that the errors are independent and identically distributed normal. Tests of overidentifying restrictions test two different things simultaneously. One is whether the instruments are uncorrelated with the error term, indicating if the errors are homoscedastic or heteroskedastic. The other, is that the equation is misspecified and that one or more of the excluded exogenous variables should in fact be included in the structural equation. Thus, a significant test statistic could represent either an invalid instrument or an incorrectly specified structural equation. Murray [75, 76] suggests the use of these tests, among other strategies, for supporting instrument validity. Another strategy is to rule out links between instruments and error term based on theoretical reasoning and evidence of non-correlation. Thirdly, use alternative instruments and compare the results i.e. do all the instruments tell the same story? Fourthly, use and check intuition including appealing to economic theory to establish validity. The F-test indicates whether the coefficients on the control variables are zero. If the p-value is not significant, then the hypothesis that the coefficients are zero is supported, signifying that the errors are homoskedastic (i.e. no heterogeneity).

Results

Sample description

Table 1 presents descriptive statistics for the sample. Among the 21,773 individuals, 18% were covered by any health insurance. For the majority (94%), NHIF was the first source of health insurance while other sources included contributory private insurance (3.3%), employer contributory (1.2%), and private non-contributory (0.6%). This is an indication of limited levels (4%) of private insurance uptake in Kenya, with negligible contribution from both contributory and non-contributory employer schemes and underscores the prominence of the state-run health insurance scheme. It is worth noting that 38.3% of those with health insurance were employed in the formal sector while about 22% were employed in the informal sector (Table 2). The rest were not employed and therefore presumed to be dependents. The instrument for health insurance, proportion of individuals with insurance in a cluster excluding the individual was 12%. About 23% of individuals experienced OOP expenditures, with mean OOP expenditures of KES15,619 (USD155.8). Mortality was 3.8%, with female-headed households experiencing higher mortality (4.6%) compared to male-headed households (2.6%). Of those that experienced mortality, the majority were not covered by any form of health insurance, suggesting some role for health insurance in mitigating mortality. Similarly, those that experienced mortality incurred OOP expenditures averaging KES12,001(USD119.7) compared to KES14,805 (USD147.7) of those that did not. This is an indication that ability to finance health care through OOP may potentially mitigate mortality through healthcare access. About 24% of individuals had an illness or injury in the last four weeks preceding the survey out of whom 4% suffered chronic condition. Of those who had an illness or injury and suffered chronic condition, 55 and 62.4% respectively were females.

Table 1 Sample description
Table 2 Comparisons among Insured and Uninsured Populations

Table 2 compares insured (n = 3920) and uninsured (n = 17,883) individuals using pairwise mean comparison tests of the differences between means for continuous variables and tests on equality of proportions for categorical variables. The uninsured experienced significantly higher mortality which may imply poor access and delays in seeking care, compared to the insured. There were no significant differences in the presence of illness or injury, though the magnitude was higher among the insured. Chronic illness experience was significantly higher among the insured, suggesting adverse selection whereby the healthier population opts out of insurance. This may be driven by perceptions of benefits and costs in relation to health-related risks and insurance premiums, partly explaining the low levels of private health insurance in Kenya. A significantly higher proportion of the insured experienced OOP expenditures. The difference in mean OOP expenditures was positive and significant suggesting that the insured experienced higher OOP expenditures. While this seems contrary to theory and arguments for health insurance which predicts lower OOP expenditures for the insured, several plausible explanations may be advanced for this finding in the Kenyan context. First is the possibility that the insured are more likely to use health care services due to ease of access thereby incurring higher OOP spending in the process, given the insurance package type and extent of co-payment. Secondly, the insured may also need to use more health services in the face of adverse selection and third, is the possibility that the existing health insurance benefit packages are inadequate, requiring the insured to incur OOP expenditures. As expected, a significantly higher proportion of those in formal employments are insured (39% vs. 15%) while a higher proportion of those in the informal sector are uninsured (35% vs. 22%). The proportions of the insured increase with increase in income for the insured, and decrease with increase in income for the uninsured, and the differences are significant. The mean differences between the insured and uninsured are significant for all the socio-demographic characteristics. The insured are older, male, married, educated, Christians, exposed to media, have smaller household sizes and are predominantly urban dwellers. They also have better water and sanitation conditions as well as better access to basic social and community level services.

Impact of health insurance on health outcomes

Tables 3 and 4 present the OLS, IV 2SLS and CFA results for mortality and chronic illness models respectively. At 5.39 and 5.34, the Durbin X2 score and Wu-Hausman F-statistic were respectively both significant at p < 0.05 indicating that health insurance is endogenous (Table 3). The R2 of the first-stage regression was moderate at 0.323 suggesting moderately relevant instruments. Though the adjusted R2 is 0.315, partial R2 is rather low at 0.171 signifying that the instruments may be weakly relevant. It is worth noting that since we have only one endogenous regressor, the partial R2 and the Shea’s partial R2 for multiple instrument are identical. That, the first-stage F-statistic (71.09; p-value = 0.000) for testing the joint significance of instruments is greater than 10 indicates that the instruments are strong. At 71.09 (critical value = 20.25) of 5% 2SLS relative bias and 33.84 for a 10% rejection rate of a 2SLS size of nominal 5% Wald test, the minimum eigenvalue statistic further supports the finding that the set of instruments are strong. Sargan’s (6.18) and Basman’s (6.11) test statistics of overidentifying restrictions were both not significant. This suggests that the instruments are valid and that the structural equation is correctly specified. We conclude that the instruments for health insurance status in estimating mortality are weakly relevant, valid and strong and therefore reliable for inference. Furthermore, we establish that health insurance status is endogenous, that the structural equation is correctly specified, and that the IV 2SLS technique is appropriate for estimating the impact of health insurance on health outcomes (mortality).

Table 3 OLS, IV 2SLS and Control Function Approach Parameter Estimates of impact of Health Insurance on Health Status (Dependent variable = Mortality)
Table 4 OLS, IV 2SLS and Control Function Approach Parameter Estimates of impact of Health Insurance on Health Status (Dependent variable = Chronic Illness)

The Durbin X2 score (3.91: p-value = 0.05) and Wu-Hausman F-statistic (3.87: p-value = 0.05) confirm that health insurance is endogenous in the chronic illness model (Table 4). The R2 of the first-stage regression was moderate at 0.323 suggesting moderately relevant instruments. The partial R2 is rather low at 0.171 signifying that the instruments may be weakly relevant. However, the first-stage F-statistic (71.09; p-value = 0.000) for testing the joint significance of instruments is greater than 10, an indication that the instruments are strong. At 71.09 (critical value = 20.25) of 5% 2SLS relative bias and 33.84 for a 10% rejection rate of a 2SLS size of nominal 5% Wald test, the minimum eigenvalue statistic further supports the finding that the set of instruments are strong. Sargan’s (4.89) and Basman’s (4.84) test statistics of overidentifying restrictions were both not significant, an indication that the instruments are valid, and that the structural equation is correctly specified. We conclude that the instruments for health insurance in estimation of the impact of health insurance on health outcomes (chronic illness) are weakly relevant, valid and strong. Given that health insurance is endogenous and the structural equation is correctly specified, we can use the IV 2SLS estimates for inference.

The linear projection of health insurance on the exogenous covariates and the vector of instruments indicates that age positively and significantly predicts health insurance status (Tables 3 and 4, column 3). Older individuals are more likely to have health insurance. Also, a primary, secondary or higher education compared to none significantly promotes health insurance uptake. Media exposure, better access to drinking water and cleaner sources of energy for cooking also promote health insurance uptake. Improvements in income from the poorest to the richest income quintile positively and significantly predicts insurance uptake. Living in a household cluster with higher proportion of insured individuals increases the likelihood of having health insurance. Those experiencing OOP were likely to have insurance coverage. Easy access to basic community level services such as nearest secondary school, road and bus stage also positively predicts health insurance uptake. Being in a single marital status, whether by choice or circumstances negatively predicts health insurance uptake. Similarly, employment in the informal sector, being unemployed and non-Christian negatively predicts health insurance uptake.

Recall that due to the endogeneity of health insurance status, the OLS parameter estimates are biased. The IV 2SLS produces consistent estimates of health status production (mortality and chronic illness). The test on the coefficients of the control variables in the CFA equation are mixed. In both models, the F-test (3.50 and 2.92) indicates that the coefficient on the residual of health insurance status is significant at p < 0.1, indicating a marginal possibility of heteroskedastic errors. However, the F-test (0.73 and 0.53) on the coefficient of the interaction term between health insurance status and its residual is not significant, and is an indication that any neglected non-linear interactions have been controlled for. This finding requires that we interpret the CFA findings with caution, due to possible heterogeneity of the disturbances.

Tables 3 and 4 column 2 present the OLS estimates, column 4 the IV 2SLS and column 5 the CFA estimates of the impact of health insurance on mortality and chronic illness respectively. Table 3 indicates that the OLS coefficient on health insurance status is small, positive, not statistically significant and likely to be biased. The IV 2SLS coefficient is negative and significant and is nearly five and three times larger than the OLS and CFA coefficients. The CFA coefficient, though not significant is consistent with the IV 2SLS in terms of direction of impact. The size of the IV 2SLS and CFA coefficient indicates that we are likely to underestimate the impact of health insurance on health status if we do not take endogeneity into account. We conclude that having health insurance reduces mortality. Health insurance coverage should therefore be enhanced and promoted as it confers positive health benefits to the population. The coefficients on IV 2SLS and CFA for individuals who experienced OOP expenditures indicate worse off health status. Increasing OOP expenditures leads to higher mortality, suggesting the need for mitigating measures to reduce OOP expenditures. The impact of income on health status is negative for individuals in the richest quintile for the IV 2SLS and CFA. This is an indication that improvements in incomes are associated with better health outcomes. However, for the lower income quintiles the health status impact is adverse. Marital status has positive and significant impact on health status and the size of the coefficient is almost the same in all three models. Those in special single marital status are highly and significantly likely to experience mortality. This is consistent with the finding that this category of individuals was unlikely to have health insurance. It also suggests that this may be a marginalized population group in terms of economic empowerment and subsequent health status compared to the married population.

Table 4 indicates that the IV 2SLS coefficient on health insurance is 1.2 times larger than the CFA coefficient and nearly five times larger than the OLS one. Both the IV 2SLS and CFA coefficient on health insurance status are positive and significant, contrary to the finding using mortality as the health status measure. The positive coefficient suggests that having health insurance is linked to chronic illnesses, indicating a certain degree of adverse selection. It also points to the fact that the impact of health insurance on health outcome may be sensitive to the health outcome measure used. The coefficient on age is positive, significant and similar in IV 2SLS and CFA and slightly higher in OLS models. This indicates that as people get older, chronic illnesses increase and this is consistent with both theory and empirical literature. The impact of income on chronic illness is positive and significant for all quintiles except the middle-income quintile suggesting that, the poor and the rich experience poor health as measured by the presence of chronic illnesses. Experiencing higher OOP expenditures was linked to being in a worse off health status. Being employed in the informal sector or being unemployed has a positive coefficient in all models, implying worse off health status. Increasing levels of education are associated with better health status in the IV 2SLS and CFA models. Results also indicate that those in single marital status have worse off health status in terms of chronic illnesses compared to the married.

Discussion

An understanding of the differences between insured and uninsured populations and the impact of health insurance on health outcomes is vital for policy formulation on the value of initiatives to expand health insurance coverage. However, this is a largely under-investigated aspect in Kenya. To our knowledge, this is the first article that has utilized nationally representative data to understand the causal links between health insurance and health outcomes in Kenya. Furthermore, the estimation approach adopted accounts for potential reverse causality and heterogeneity biases. Our findings indicate that less than one-fifth of individuals were covered by any health insurance and private health insurance coverage is particularly low. This is consistent with Acharya et al.’s [31] observation that low enrollment persists for many insurance schemes in LMICs. Low insurance coverage is not conducive for financial risk protection and achievement of UHC. In addition, the extent of coverage in terms of the comprehensiveness of the insurance is important. Indeed, nearly one-quarter of individuals experienced OOP of a mean magnitude of 11% of per capita GDP (USD1410) in 2016 [83], clearly a significant proportion. In addition, the insured had higher chronic illnesses and OOP.

Both mortality and chronic illness experience were less than 4%, although nearly a quarter experienced any illness. This suggests that majority had good health status. The insured experienced lower mortality but higher illness and chronic illness suggesting the presence of adverse selection. That a higher proportion of the insured experienced OOP expenditures, though contrary to theory and arguments for health insurance, has several implications. It is plausible that ease of access to care for the insured results in higher OOP given the insurance package type and extent of co-payment. Barasa et al. [8] and Chuma et al. [84] highlight the problems with fragmentation of schemes and the adverse effects that service providers create through discriminating clients from different NHIF schemes, thereby occasioning OOP costs. It is imperative to note that the NHIF benefit packages for different population groups have exclusions and do not fully cover the healthcare costs at the point of use and for the full continuum of care even for disease conditions covered, e.g. kidney failureFootnote 2. Ataguba and Goudge [32] observe that insurance schemes with high co-payments due to low-cost minimum-benefits choices may lead to high OOP payments and create financial burden. Secondly, the insured may also use more health services [14] in the face of adverse selection and hence incur higher OOP costs. Third, is the possibility that the existing health insurance benefit packages are inadequate requiring the insured to incur OOP expenditures. These raise the question of whether health insurance in Kenya adequately shields against OOP costs and hence financial risk protection, requiring further empirical investigation. In their systematic review, Acharya et al. [31] document that few insurance schemes afford protection from OOP spending and Kenya appears to be one of those that do not. This calls for health insurance benefit packages that are comprehensive and adequate to reduce OOP expenditures. As expected, significantly more of those in formal employment and with higher incomes are insured. Insurance uptake differed significantly across all the socio-economic and demographic characteristics. The insured are older compared to the younger population, pointing to an area that should be targeted when trying to achieve UHC. In general, more of the insured were male, living in smaller households, urban dwellers, married, Christians, with secondary and higher education and exposure to media. Improving the levels of education and limiting family sizes should be key areas of focus for influencing the socio-economic and demographic characteristics associated with higher likelihood of health insurance uptake in Kenya.

We established that there is a bi-directional causality between health insurance status and health outcomes, namely mortality and chronic illness. Though our instruments as a set were weakly relevant, they were valid and strong and therefore reliable for inference. The finding that health insurance reduces mortality is valuable for Kenya, in its quest to improve population health and the achievement of UHC goals. Health insurance coverage should therefore be enhanced and promoted as it confers positive health benefits to the population. Our finding support Stone et al. [14] in Kenya, though their study was hospital-based. Other studies with similar findings include Rand [17], Currie and Gruber [35, 36] and Culyer and Newhouse [13], though in developed countries settings. Though our findings are Kenyan specific, they can be generalized to other LMICs with similar characteristics, albeit with caution. Using chronic illness as a health outcome measure resulted in unexpected impact of health insurance on health outcomes. The finding suggests that having health insurance is linked to chronic illnesses reinforcing the observation of a certain degree of adverse selection. It also points to the fact that the impact of health insurance on health status may be sensitive to the health status measure used. This calls for primary research using valid and reliable health status measurement and valuation tools rather than relying on limited secondary data in assessing the impact of health insurance and other health interventions on health outcomes.

The finding that the young and special-single were mostly uninsured and experienced higher mortality indicates key marginalized sections of the population requiring targeting for health insurance coverage in order to improve their health status. Conversely, though the older population had insurance, chronic illness experience increased with age, consistent with both theory and empirical literature. Since, health insurance reduces mortality and chronic illnesses mostly affect the older people, we suggest that the government should establish and strengthen comprehensive health insurance programs for the elderly. This is in light of age cut-offs for most private health insurers that leave this population in need and vulnerable. Individuals from larger households had worse health status, an indication that policies to reduce household sizes may indirectly contribute to UHC and other health system goals.

We have demonstrated that though health insurance coverage is low in Kenya, it is beneficial in reducing mortality. In view of the fragmented nature of the health insurance landscape, the government should in the short-term institute policy to consolidate different health insurance schemes in order to serve different population groups more effectively and equitably. To increase coverage, NHIF should design comprehensive benefit packages within the consolidated scheme with differentiated premiums, allowing individuals to choose the benefit package and premiums within the voluntarily component, while retaining the tax funded and social security health insurance component to cushion the poor and vulnerable. This brings in the question of the informal sector who are largely without insurance. The informal sector is widely heterogeneous and it is possible that sections of it may be characterized by higher income levels. The Chilean health insurance system allows those who can afford to opt out of the publicly funded health insurance to purchase private insurance [85] with requirement for everyone to have some form of insurance. This option can be explored in Kenya in the long-term as the economy grows to allow those in the formal and informal sectors of the economy, the choice of public or private scheme, but with mandatory requirements for all to have coverage. Such a policy could work if NHIF comes up with attractive schemes for the able informal sector segment. Indeed, there is a growing trend of like-minded people coming together as groups to negotiate for attractive group cover with NHIF (e.g. retirees and community-based organizations) pointing to opportunities for and an unmet demand for health insurance.

The fact that a higher proportion of the insured experienced OOP expenditures raises concern about the adequacy of the current health insurance arrangements in mitigating catastrophic health expenditures and impoverishment. This calls for a thorough re-examination and research on the adequacy and impact of health insurance on OOP expenditures. There is need to build a further evidence base to support the argument that health insurance protects against financial risks and impoverishment in Kenya, through government support.

This study had a number of limitations. Data limitations hampered the empirical assessments of the impact of health insurance on health status. Firstly, the dominance of NHIF, low levels of private insurance and nature of health insurance whereby individuals can have more than one type of insurance constrained the analysis of impact by insurance type. Further, health insurance data collected in routine surveys (e.g., KIHBS) are not disaggregated by type of enrolment (voluntary, mandatory), contributions (risk-rated, community-rated, income-based) and management (for-profit, non-profit, public). Secondly, other than mortality measures, there is a paucity of data on health measurement and valuation in Kenya. Though valid and reliable health valuation and measurement questionnaires have not been developed for Kenya, an EQ-5D version for Kenya is available. This study recommends that in order to inform the current efforts towards UHC through health insurance, the government should institute routine measurement and valuation of population health using the available health measurement and valuation instruments. In addition, during the national surveys, data on health insurance should be disaggregated. Moreover, an immediate policy should entail routine reporting of such data to a centralized health insurance agency in coordination with KNBS to facilitate empirical studies on the benefits of various health production inputs. Thirdly, the ideal analytical techniques to infer impact would be to use either experiments, panel or time series data that would allow for lags in treatments and effects. The cross-sectional nature of the data used in this study poses a limitation in impact assessment. Future studies should explore the generation and use of such data.

Conclusions and policy recommendations

Less than one-fifth of Kenyans are covered by any health insurance and private health insurance coverage is particularly low. The insured experienced lower mortality, more chronic illnesses and higher OOP expenditures. Experience of higher OOP expenditures may imply ease of access to care and more use of healthcare services for the insured while indicating shortcomings given the insurance package type and extent of co-payment, resulting in a financial burden. Kenya thus needs to design health insurance benefit packages that are comprehensive and adequate in coverage to reduce OOP expenditures, more so for the elderly who suffer more chronic illnesses. Targeting the young, special single and the vulnerable who were mostly uninsured and experienced higher mortality would be a desirable feature in the design of the insurance model, given the positive benefits of health insurance on health outcomes. Our finding that health insurance reduces mortality is valuable for Kenya in its quest to improve population health and the achievement of UHC goals.

We make the following policy recommendations to increase health insurance coverage as it confers positive health benefits to the population. In view of the fragmented nature of health insurance landscape, the government should in the short-term institute policy to consolidate different health insurance schemes in order to serve different population groups more effectively and equitably. To increase coverage, NHIF should design comprehensive benefit packages within the consolidated scheme with differentiated premiums, allowing individuals to choose the benefit package and premiums within the voluntary NHIF component, while retaining the tax funded and social security health insurance component to cushion the poor and vulnerable populations.

Policy reforms should also be instituted to reduce OOP and address the inadequacy of health insurance in ensuring access and protecting against financial risks. This calls for health insurance benefit packages that are comprehensive, adequate and accessible. This can be achieved through health systems strengthening to improve access to health services, institutional and governance reforms at the NHIF (the government’s preferred vehicle in the achievement of UHC) to improve and enhance benefits.

Recognizing the heterogeneous nature of the informal sector and that sections of it may be characterized by high income levels, a long-term policy recommendation is to allow those in the formal and informal sectors the choice to enroll in a public or private scheme, but with mandatory requirements for every citizen to have insurance coverage. The government should however maintain programs targeted at the poor and vulnerable populations.