Introduction

Income inequality refers to the uneven distribution of income between people, assessed across regions, states, or countries [1, 2]. It is generally agreed that income inequality is associated with health outcomes at an ecological level. Our focus is on the more disputed hypothesis that after accounting for the impact of individual income, there is a relationship between income inequality and health [3]. This relationship is of great interest to public health and policymakers [3,4,5], though research evaluating the relationship has produced mixed results [6]. A recent systematic review of reviews evaluating the relationship between income inequality and health found considerable variation in the findings of thirteen systematic reviews, none of which were considered high-quality [2]. The extensive statistical heterogeneity observed by these reviews indicates that a causal relationship between income inequality and individual health remains unconfirmed. Understanding reasons for different effect estimates observed across studies (i.e. statistical heterogeneity or transportability as it is known in causal inference) is an important part of causal assessment [7, 8].

Systematic reviews, because of the scientific approach to selecting, appraising, and synthesising evidence, are useful for transparently bringing together a relevant body of evidence, evaluating statistical heterogeneity, and critically assessing risks of bias [9, 10]. A rigorous, systematic, and transparent evaluation of the evidence and causality is necessary to be certain of the relationship between income inequality and health, which can then clarify the need for policymakers to intervene on the exposure [11, 12]. The meta-analysis by Kondo and colleagues [13], though it was considered the highest quality identified in the review of reviews and incorporated multilevel data, did not include a critical appraisal of the literature nor assess the certainty of their findings (e.g. via a GRADE assessment). Pickett and Wilkinson [14] (also included in the review of reviews [2]) incorporated Bradford Hill viewpoints to their review assessing the relationship between income inequality and mortality. However, they also did not use a systematic approach to searching, identifying, analysing, evaluating, and synthesising evidence. Thus, we have not identified reviews that have incorporated a rigorous and robust systematic review process that incorporates causal assessment.

Income inequality and health

Early understanding of the relationship between income inequality and health was largely based on ecological studies [15,16,17]. The unit of analysis in these early studies was populations (usually countries or states), and though it is not described in this way, these studies also appear to argue that the relationship between income inequality and aggregate health is confounded by country/state-level income (e.g. GDP, GDP per capita). The directed acyclic graph (DAG)Footnote 1 in Fig. 1 illustrates this relationship. There is a theorized non-linear relationship between previous area-level income and current aggregate health such that increasing area-level income increases aggregate health until a threshold where area-level income has no effect on aggregate health [20].

Fig. 1
figure 1

Directed acyclic graph (DAG) illustrating the relationship between income inequality (X), aggregate health (Y), and area-level income (N). Subscript t indicates the time

Area-level income confounding

Many ecological studies (and reviews of income inequality and health) address area-level income confounding (including related concepts such as GDP) by limiting the analysis to high-income countries (HIC) [16, 17] though it may not be appropriate to do so. These early ecological studies argued that the effect of area-level income confounding is hypothesized to be greater in low- and middle-income countries (LMIC) than in HIC [3]. However, recent evidence suggests the effect may be comparable in both HIC and LMIC [21]. In addition, considering the effect of income inequality on individual health in only HIC limits our understanding of the effect of income inequality on individual health in LMIC [22]. The extent to which area-level income is a confounding variable has also been debated. According to political economy theory, governmental policies affect income inequality which in turn affects future economic growth, regardless of the baseline level of inequality prior to government actions [23]. Economic growth affects current area-level income, such that area-level income may mediate the relationship between income inequality and aggregate health. However, the putative effect of income inequality on economic growth may depend on relative area-level income; the inverted-U hypothesis (known as the Kuznets curve [24]) suggests that the relationship between economic growth and income inequality is positive in LMIC and negative in HIC [25]. The empirical evidence regarding both the direction [25,26,27] and the sign of the relationship between income inequality and economic growth is mixed, with some arguing that the latter may be impacted by many other factors such as technological advancements [23, 25]. Thus, there is disagreement over the appropriateness of conditioning upon area-level income is an over-adjustment if it is a mediator rather than a confounder [22, 27].

Individual income confounding

With all other things being equal, a country with high-income inequality will likely have more low-income individuals (including a greater number of people in poverty i.e. falling substantially below the median income [28]) than a country with low-income inequality. The country with more low-income individuals will have disproportionally worse population health outcomes than the country with fewer low-income individuals. That is, the relationship between income inequality and individual health could entirely be explained by individual incomes [29]. Individual health, which has a non-linear relationship with individual income where the benefit of income for individual health depreciates with increasing income (known as the absolute income hypothesis) [30]. In other words, a £100 increase of income for low-income individuals (e.g. from £1000 to £1100) will produce a greater increase in health than a £100 increase for high-income individuals (e.g. from £100,000 to £100,100) [29]. Thus, individual income is a cross-level confounding variable [31] since it affects both the ecological exposure (i.e. income inequality is derived from individual incomes) and the individual-level outcome (individual-level health) [32] (see Fig. 2). Multilevel data, which include both aggregate- and individual-level variables, are necessary to disentangle the impact of individual income on income inequality and health and to evaluate the potential causal link [33].

Fig. 2
figure 2

Directed acyclic graph (DAG) illustrating cross-level confounding of an individual’s income on the relationship between income inequality and individual health, annotated with subscript t to account for time. Area-level variables (e.g. income inequality, X) are capitalized while individual-level variables are lower-cased (e.g. individual-level income i) [31]. Individual income should be conditioned upon (represented by the square around individual-level income) to remove the confounding effect of individual income on the relationship between income inequality and individual health

What is the mechanism by which income inequality causes individual health?

The psychosocial and neo-material frameworks offer two different, and often polarizing, explanations for the mechanisms through which income inequality affects individual health [5, 34]. While these frameworks are positioned as competing explanations [34], it is unlikely that either framework perfectly describes the mechanism between income inequality and individual health. While our systematic review does not aim to examine mechanisms, these can offer insight into the debate over which variables should be conditioned upon in any analysis. We use DAGs to articulate the different explanatory mechanisms described by each of these frameworks as they relate to our unit of analysis, individual health. Importantly, the DAGs are based both on our interpretation of key texts relating to each framework as well as our current understanding of the relationship structures. These frameworks were likely not written with causal thinking, as we know it today, in mind and thus illustrating these mechanisms through DAGs may not reflect the frameworks as they were originally intended.

Psychosocial framework

According to the psychosocial framework (popularized by Richard Wilkinson and Kate Pickett in their book, The Spirit Level [35]), social status and hierarchies amongst individuals are more noticeable in countries with high-income inequality than in those with less income inequality [36]. Income inequality leads to social differentiation and comparisons, which has adverse psychological consequences (e.g. reduced social cohesion and trust; stress from comparing yourself to people who have more incomes and material goods; frustration and despair that could lead to crime and violence) which in turn negatively affects health [37,38,39]. The heightened awareness of hierarchy in high-income inequality societies is also hypothesized to impact individual behaviour, such that attenuated social cohesion increases the likelihood that individuals will smoke, consume alcohol and have diets that increase their risk of mortality [36]. The relationship between income inequality, psychosocial factors (such as social cohesion and behaviour), mortality, and individual income is summarized in Fig. 3.

Fig. 3
figure 3

Direct acyclic graph (DAG) illustrating the relationship between income inequality and individual health mediated by psychosocial factors and confounded by individual income. Subscript t indicates time while area-level variables are capitalized and individual-level variables are lower-cased. This DAG reflects our general understanding of the psychosocial literature and is not intended to reflect the framework as described by any individuals [35]. According to the literature, psychosocial factors are theorized to mediate the effect of income inequality. Individual income is theorized to be a proxy for socioeconomic position, and some argue that conditioning upon individual income may be an over- adjustment that will underestimate the effect of income inequality on health. However, particularly as our outcome under study is individual health, we argue that individual income will not completely account for the individual social position (hence the line from individual income to psychosocial factors) and should be conditioned upon

As noted above, this DAG was developed to reflect how the psychosocial framework may be applied to our unit of analysis and our understanding of the impact of individual income on the relationship, and as a result, slightly diverges from some key elements of the original framework. Firstly, the outcome under study here is individual health, not aggregate health as originally described [38, 40]. Wilkinson, both in his early work on income inequality and life expectancy [17] and in his later work with Pickett when the psychosocial framework was formalized, has been criticized [41,42,43] for relying on ecological studies to make assumptions about individual health. They both disputed this claim and insisted that findings from ecological studies were only used to understand the relationship between income inequality and population health [14, 44].

The second key element of the psychosocial framework altered in our DAG is conditioning upon individual income. The psychological framework posits that individual income should not be conditioned upon, even if the unit of analysis is individual health, because it is a proxy for psychosocial consequences of social differentiation and comparison (which mediate rather than confound the relationship between income inequality and health) [44]. However, we argue that individual incomes (and other socio-economic characteristics such as socioeconomic position (SEP)) are poor proxies for psychosocial factors and should be conditioned upon as candidate confounders (as has been done in several studies [45,46,47] and reviews [13, 48]).

Neo-material framework

The neo-material framework hypothesizes that political and economic actions affect income inequality and public resources, such that individuals with fewer resources are disproportionately disadvantaged in countries with high-income inequality [5]. Political and economic actions can include taxes, cash transfers, political structure, and power of organized labour. In our interpretation of early work on the neo-material framework [5] (see Fig. 4), political and economic factors are an upstream confounder of the relationship between income inequality and individual health and while individual income is also a confounder, it is downstream of political and economic factors. In addition to income inequality, political and economic factors affect both public service provisions (on the pathways between income inequality and health) and individual income (e.g. cash transfers). The neo-material framework highlights why time lags may be necessary to understand the effect of income inequality on individual health [3]. For example, it may take as much as fifteen years [49] for the effect of limiting organized labour’s power to increase income inequality and adversely impact health.

Fig. 4
figure 4

Directed acyclic graph (DAG) of the simplified relationship between income inequality and individual health, confounded by individual income, political and economic factors and mediated by public service provisions. Time is indicated with a subscript t while area-level variables are capitalized and individual-level variables are lower-cased. This DAG reflects our general understanding of the neo-material literature, though the structure of the relationship (including whether public service provisions are a co-exposure and thus should be conditioned upon, not shown) remains debated

We also argue that political and economic actions can modify the effect of income inequality on individual health through public service provisions (e.g. strength labour unions to improve access to healthcare). However, it is important to note that the interpretation in Fig. 4 may not accurately represent the relationship; it may be that public service provisions are co-exposures of individual health and occur alongside income inequality [50] (i.e. without the arrow to income inequality shown in the DAG).

Methods

Information in this systematic review protocol is reported according to the Preferred Reporting Items for Systematic Review and Meta-Analysis Protocols (PRISMA-P) guidelines [51]. The systematic review was submitted to PROSPERO (CRD42021252791) and any subsequent amendments to the protocol will be documented and published in the systematic review.

Aims and objectives

The main aims of the systematic review are to understand if there is a relationship between income inequality and individual mortality and/or SRH, and, if so, if that relationship is causal. While our aims do not include understanding the putative causal mechanism, we will consider evidence of a causal mechanism to strengthen our certainty that a causal relationship exists. To address these aims, we will attempt to address the following questions:

  1. 1.

    What evidence is there of a relationship between income inequality and individual mortality and/or SRH?

  2. 2.

    What is the magnitude of the association between income inequality and individual mortality and/or SRH?

  3. 3.

    To what extent does the available evidence support a causal relationship between income inequality and individual mortality/SRH?

Study design

To answer our research questions, this systematic review will incorporate approaches to causal assessment. We will apply adapted Bradford Hill viewpoints where two viewpoints (coherence and analogy) have been excluded. Our interpretation and approach to applying the viewpoints, including which viewpoints to use, is based on earlier research comparing different approaches to causal assessment [52] and a scoping review of causal systematic reviews (unpublished). These interpretations were used to inform our search strategy and inclusion and exclusion criteria. The viewpoints, their interpretation, and the evidence we will use to determine if a viewpoint is met are in Table 1.

Table 1 Causal assessment approach

Eligibility criteria

The inclusion and exclusion criteria for our review are summarized in Table 2.

Table 2 Inclusion and exclusion criteria

Search, screening, data extraction, and critical appraisal

Search strategy

We developed our search strategy with the support of an information scientist. Our strategy was based on two categories of keywords: income inequality (e.g. “income”, “inequality”, “income inequality”, “Gini”) and health (e.g. “mortality”, “all-cause”, “life-expectancy”, “death”, and “health”). We will search MEDLINE, ISI Web of Science, EMBASE, and the National Bureau of Economic Research. The search strategy is in Appendix. We do not plan to search grey literature but references in all included studies will be checked to retrieve any additional articles missed by our search strategy.

Study selection process

Studies identified from the search will be imported into Covidence where they will be assessed against the inclusion and exclusion criteria. One reviewer (MS) will screen all titles to identify and exclude references that obviously do not meet both the exposure and outcome criteria. An initial “title only” screening is a time-efficient approach that is unlikely to affect the amount of relevant included studies [59]. Subsequently, two reviewers will independently screen titles and abstracts of all remaining references. Studies will be included if the study design is unclear but all other criteria are met. Of those that potentially meet the inclusion criteria, full-texts will be identified and independently screened by two reviewers. The reasons for excluding full-text studies will be recorded and a third reviewer will resolve disagreements.

Data extraction

One reviewer will extract data from included studies. A second reviewer will check all of the extracted data. We will extract study information, the indicators used to measure income inequality and morality/SRH and baseline information in the population include sex, socioeconomic status, ethnicity, and age. We will note if the studies explain why certain variables were conditioned upon. A sample data extraction form is provided in Table 3.

Table 3 Data extraction information

Risk of bias assessment

Two reviewers will independently assess the included studies for bias using the ROBINS-I tool for observational data [60]. For ROBINS-I, we have developed a target trial that considers exposures, comparisons, confounding, and co-exposures. Confounding variables are those that cause both the exposure and outcome under study, while co-exposures are those that may be received alongside the exposure understudy and may be associated with (though not necessarily the cause of) the exposure and outcome under study [60]. This is helpful when considering if studies conditioned upon variables that we have considered mediators (and should not be conditioned upon) and those that we have considered confounding variables (and should be conditioned upon) [61]. Though ROBINS-I has been criticized for low interrater reliability [62] and is often misapplied [63], it facilitates thorough and systematic methodological evaluation of non-randomized studies using principles of causal thinking [64]. This methodological evaluation will be used to inform considerations of certainty through GRADE and causal assessment using the adapted Bradford Hill viewpoints.

Strategy for synthesis and causal analysis

Our method for data analysis includes both a meta-analysis as well as causal analysis. As it is unlikely we will be able to test for political and economic factors, we will narratively describe the differences across the included studies.

Meta-analysis

We will conduct a meta-analysis to determine the overall association between income inequality and mortality as well as a separate meta-analysis of and income inequality and SRH. Where possible, we will synthesize the outcome measures for mortality (mortality rates, life-expectancy) and SRH (SRH, wellbeing). All meta-analyses will be random-effects, given the considerable heterogeneity expected across contexts, study designs, and populations. If possible, we will convert all of the indicators into Gini coefficients and standardize the effect estimate of income inequality on mortality/SRH. as this is likely to be the most commonly used indicator for income inequality [13]. In addition, we will explore factors we believe may account for statistical heterogeneity (see Table 4) as part of understanding the causal relationship.

Table 4 Target trial characteristics for ROBINS-I risk of bias

GRADE

We will perform a GRADE assessment, a widely adopted approach for assessing the certainty of evidence across a body of evidence, on both outcomes (mortality and SRH) [65]. The GRADE assessment of certainty, which describes the confidence in the effect estimates [65] is based on the assessment of risk of bias [66], indirectness of included evidence [67], imprecision [68], inconsistency [69], and publication bias across the body of evidence [70]. Certainty from non-randomized studies (NRS), which are the most likely types of studies included in this review, are automatically rated at the same level (“high”) as randomized controlled trials when using ROBINS-I, and then subject to downgrading based on a set of domains [71]. Large associations, dose-response relationships, and adjusting for plausible confounding upgrade certainty of the evidence for NRSs.

Causal assessment

We will assess the studies to explore the causal relationship between the income inequality and mortality. Table 1 provides an overview of the evidence we will consider for each Bradford Hill viewpoint. Together with the meta-analysis, we will narratively describe our confidence, based on the evidence, that there is a relationship between income inequality and individual health and our confidence that it is causal.

Conclusion

This protocol describes the methods for conducting a causal systematic review on income inequality and individual health. In the first section, we highlighted the debate over the nature of the relationship and explain consequences for observed associations. While testing the two explanatory, and often competing, mechanisms are beyond the scope of this systematic review, we are explicitly favouring the neo-material framework by both including multilevel studies that condition upon individual income and considering political and economic factors as co-exposures. We expect the complexities of standardising and synthesising evidence from a wide range of studies using different income inequality measurements and statistical methods to be amongst the most challenging aspect of this review. We hope this review will not only elucidate the relationship between income inequality and health but also act as an exemplar for transparently performing causal assessment in evidence synthesis.