Introduction

The purpose of this paper is to empirically explore the relationship between the quality of institutions and population health at Italian regional level. The issue of institutions has recently drawn considerable attention in the health economics literature, as it acknowledges the fact that health outcomes depend not only on socioeconomic, political, and cultural factors but also on the capacity of institutions to offer public services efficiently, effectively and on time [1]. For this reason, it is obvious that, for example, universal health coverage—free at the point of use—can be established by law, but if the healthcare system is not adequately financed, governed and monitored, the effect on the health of the population might be detrimental. Good health system governance also requires civil society participation and government transparency and accountability [37].

These considerations lead to the issue of extending the traditional set of health explanatory variables to also include institutional quality elements to shape and develop an intersectional framework in which these two orthogonal sets of determinants are instead treated as two complementary determinants [47]. Within this framework, Knowles and Owen [61] study the role of institutional variables in explaining cross-country variations in life expectancy in a sample of 73 high- and low-income countries. Using indicators related to both formal institutions (ruled by law and regulation) and social capital (ruled by conventions, social norms and codes of behaviours), they show that an improvement in institutions has a statistically and quantitatively significant positive effect on life expectancy. Similarly, Hadipour et al. [48], using a panel dataset from 158 high- and low-income countries between 2001 and 2020, find that institutional quality has a positive impact on life expectancy and a negative impact on infant mortality rates. Narrowing to the EU context, Sharma et al. [83] find a positive association between the quality of institutions and infant mortality rate and life expectancy at birth. This kind of relationship is also confirmed by Holmberg and Rothstein [50], whose analysis highlights a positive correlation between several variables of government quality (rule of law, corruption and government effectiveness) and life expectancy and a negative correlation with infant and maternal mortality rates. Narrowing the analysis of institutional quality to the specific aspect of corruption, several studies point out that low healthcare system performance in terms of efficiency, effectiveness and equity can be found in a highly corrupted context [46], with a consequent adverse impact on health outcomes such as life expectancy and mental health [1], general mortality and infant mortality [49].

Our contribution can be collocated in this strand of literature, introducing some elements of originality.

First, compared to the existing studies, our approach adopts a new measure of health status represented by a multidimensional composite indicator. This allows us to account for various dimensions of health, shifting the analysis to a more general level.

Second, we have identified only very few studies validating a significant relation between quality of institutions and observed or self-perceived physical health of people in the Italian context [42]. Therefore, this paper contributes to the literature assessing the relationship between institutional quality and a broad indicator of physical health in Italy, where healthcare is managed by region but subject to guidelines of the central government setting national targets both in terms of healthcare service provision and budget accountability.

The aim of our paper is thus to disentangle the socioeconomic context characteristics, the regional healthcare system resources (staff and beds) and the policy/institutional explanatory variables (compliance with the national standards and quality of local institutions) and to investigate how Italian regional population health is related to these macro-measures.

Many empirical analyses point out a positive association between socioeconomic factors, variously defined (income, education, wealth) and a wide range of health indicators (such as, for example, mortality, life expectancy and morbidity). This thesis has been supported by different perspectives of analysis. Most studies focus on the relationship between individual socioeconomic standing and health indicators within single countries [20, 60, 62, 63, 85, 88, 90]. Fewer studies have examined the relationship across countries. In this framework, a set of research is microdata-based and analyses the relationship between individual socioeconomic variables and health showing that people with more economic resources (typically income and wealth) tend to be healthier than people with less resources in a comparative cross-national perspective [7, 13, 69, 82]. A further body of studies is represented by cross-national or cross-regional macro-level ecological analysis investigating the relationship between structural characteristics of territorial areas (nations, macro-areas, regions, counties) and health indicators. Such socio-ecological approach basically highlights that better health characterizes societies with better socioeconomic context and more egalitarian distribution of income [8, 24, 78].

Our analysis is carried out at Italian regional level with macro-variables, and it draws in the last research approach.

We use a cross-sectional sample of 21 Italian territorial units (19 regions plus 2 autonomous provinces, Bozen and Trento)Footnote 1 observed for 9 years (2011–2019) to test the empirical relationship between health population and institutional quality at regional level also controlling for local socioeconomic variables and healthcare service features. The regional-level analysis is particularly interesting for Italy for at least two reasons: first, Italy has a fairly pronounced disparity in socioeconomic context (in terms of income, income inequality, education) and performance of institutions between regions and macro-areas; second, while some guidelines and targets of health policy are determined by the central government, the management of healthcare is entrusted to the regions.

We find that at Italian regional level, higher institutional quality is associated with higher health status of the population. Moreover, regional socioeconomic factors appear not to be a relevant driver for overall population health when local institutions are well-functioning and regions present a higher level of compliance with national standards in terms of public healthcare services.

The paper is organized as follows. In Sect. Institutions and health: an overview, we provide an overview of the literature on the relationship between institutions and health and state our research hypotheses. In Sect. Assessing health: The Regional Health Status Indicator, we introduce the Regional Health Status Indicator and its computation methodology. Section Data and variables describes the data and the variables used in the empirical analysis. Section Empirical strategy introduces the empirical strategy and Sect. Empirical results discusses the results. Finally, Sect. Conclusions concludes the paper.

Institutions and health: an overview

A long tradition in the socioeconomics literature focuses on the positive relationship between socioeconomic status and health. From this perspective, population health status is related to socioeconomic dimensions as income, wealth, education, occupation, gender, and ethnicity such that people lower in the social hierarchy have poorer health than people higher in the social hierarchy (for a review [30, 68]). However, several studies adopt a wider perspective of analysis by examining the relationship between health and institutions [47, 61]. One of the most important contributions on the theory of institutions comes from Douglas North who defines the institutions as “the humanly devised constraints that structure political, economic and social interaction” [75] p.97). They consist of both formal and informal rules corresponding to formal and informal institutions. Formal institutions include the written constitution, laws, policies, rights and regulations enforced by official authorities and influencing individual well-being [18],informal institutions are usually unwritten norms of behaviour, codes of conduct, customs, conventions that shape thought and behaviour[17, 66].Footnote 2 Formal institutions are easier to identify because they are based on codified rules that define a framework within which human interaction takes place, informal institutions are typically not codified and harder to observe and classify.Footnote 3

Given the heterogeneity of institutions operating within a society, the interaction between institutions and health is a complex issue that has been addressed by researchers and policy makers not only at national but also at international [48] and European level [83].

Against this background, a field of socioeconomic studies consider the relationship between the health and the welfare state regimes by conceiving the latter as a set of formal institutions- or formal rules-governing the distribution of resources and opportunities among citizens [56, 71]. Health institutions are included in this general scope and concern the formal rules affecting the opportunities of care as the establishment of a minimum standard of care for citizens, the access to the healthcare services and the extent of co-payment for treatments, the location and the organization of health facilities etc.Footnote 4 Beckfield et al. [14] propose a theoretical framework that emphasizes the role of the welfare state institutions in distributing population health. They argue that welfare states can stratify health through two macro-channels.Footnote 5 The first one concerns health institutions. They present different features across welfare state regimes (rules concerning the financing of healthcare and their decommodificationFootnote 6 degree, a minimum level of prevention and healthcare, the spatial localization of health facilities, the regulation of private providers etc.), thus distributing health differently. A second way is related to the distribution policies for income and other valuable goods (employment security, minimum wage, wage replacement rate, pensions, working security, housing, education) that vary by welfare state regime and affect the social determinants of health.

In a different perspective of analysis and looking at population health level indicators, Beckfield and Bambra [15] implement a time-series cross-section analysis from 1970 to 2010 for the US and 17 other high-income countries to assess the association between generosity of welfare state institutions (for unemployment insurance, sickness benefits, and pensions) and life expectancy. Their analysis provides evidence of a strong role for social policy shortcomings in explaining the negative gap of life expectancy for US with respect to other rich democracies. In the same strand of research, Jacques and Noël [53] observe a negative relationship between welfare state decommodification and the age-standardized death rate for 21 OECD countries from 1971 to 2010. Their findings confirm that social programmes providing better social protection and making individuals less dependent form the market, are associated to healthier lives.

Moving on to consider other formal institutions, law and the justice sector also play an important role for citizens’ health. Environmental protecting laws, as well as the norms prohibiting the marketing of harmful foods impact on the health of individuals. In a broader perspective, Dingake [37] highlights that only a well-functioning rule of law makes effective the health right and the healthcare access. Access to justice, whether to courts, alternative dispute resolution mechanisms, or traditional justice systems can improve access to healthcare services, in particular for marginalized populations [37] p. 296). Well-functioning legal systems also provide control over contextual factors such as corruption, bureaucratic inefficiency, protection from crime and government accountability, that the literature has identified as significant for population health [61]. Using a cross-sectional sample of 185 countries, Achim et al. [1] show—for the period 2005–2017—that the level of corruption significantly affects physical health (measured as life expectancy and mortality) as well as mental health. Corruption determines the misappropriation of funds and medical equipment, making access to health services more difficult, reducing their effectiveness and leading to worse health outcomes [46]. Robinson and Keithley [79] provide evidence that widespread crime directly affects physical and psychological health of victims but also leads to a substantial increase of the medical care demand, placing additional pressure on services and diverting resources from other patients with potential negative effects in terms of population health outcomes.

The connection between health and the institutional framework also concerns the informal institutions. Informal institutions are basically identified with social capital which is considered a byproduct of social organization. As a multidimensional concept, social capital includes several dimensions: social and civic participation, political participation, trust relationships, perceived social support, sense of belonging. High levels of social capital have been reported to be associated with better health and a lower risk of mortality [51, 58, 59, 80]. The idea is that social capital contributes to mental well-being through a trusting environment or through the benefits of socializing,it also improves physical health through the diffusion of information on the effectiveness of healthcare services and on health behaviours, promoting the mutual assistance, the sense of responsibility and thus reducing health-risky behaviours [43].

Nieminen et al. [72] analyses the associations between individual-level social capital and all-cause mortality among working-age and ageing people in Finland for the period 2000–2009, also controlling for socio-demographic, behavioural and biological factors. They find that the mortality rate was smaller among people with higher social participation activities compared with those socially inactive. Fiorillo and Sabatini [42] provide an empirical assessment of the causal relationship between social capital and health in Italy, finding that individual structural social capital, as measured by the frequency of meetings with friends, is strongly and positively correlated with self-perceived health. The same results characterize the analysis of Yuan et al. [91] carried out for China. Using cross-sectional data from the China Family Panel Studies 2016, the authors find that social relationships (measured by gift income) and organization membership have positive effects on self-relate health, from a micro-perspective.

In this variegated strand of literature, our paper investigates, at regional level, the empirical relationship between a broad indicator of institutional quality, including elements of formal and informal institutions, and overall population health, also controlling for socioeconomic and healthcare system characteristics.

In particular, we state the following hypotheses:

  • Hp1: At regional level, higher institutional quality is associated with higher health status of the population.

  • Hp2: Regional socioeconomic factors do not affect overall population health when local institutions are well-functioning and regions present a higher level of compliance with national standards in terms of public healthcare services (i.e., prevention, use of medical care, vaccinations and other health treatments etc.).

Assessing health: the regional health status indicator

Our investigation into the relationship between institutional quality and health is based on a multidimensional composite indicator summarizing several components affecting the health status of the population.Footnote 7 We propose a Regional Health Status Indicator (RHSI) calculated at the local level for 21 territorial units for the period of 2011–2019. It is a combination of elements relating to both objective measures of health status and self-reported health.Footnote 8

The RHSI summarizes 21 elementary variables representing core aspects of both quantity and quality of life and health status concerning three domains: ‘life expectancy’, ‘mortality’ and ‘morbidity’. Figure 1 describes the variables included in each domain and the data sources.

Fig. 1
figure 1

Domains, variables and data sources of the RHSI

Of the 21 variables, 4 are related to qualitative aspects of health status (‘Life expectancy at birth in good health’, ‘Life expectancy at 65 without functional limitations’, ‘People aged 75 + with three or more chronic conditions’ and ‘People aged 65 + without functional limitations’),Footnote 9 and they are assessed with the self-reported measures of health included in the RHSI.

The remaining 17 variables are represented by the objective measures of health included in the RHSI: ‘Life expectancy at birth’ and ‘Standardized mortality rates’ that relate to 15 causes of clinically related deaths and 1 cause of death due to trauma, poisoning, homicide or suicide (classified as nonclinical deaths).Footnote 10

The choice of the variables included in the RHSI is literature driven. The socioeconomic literature largely focuses on life expectancy [29, 63] and/or general mortality [54, 67, 70] treated, in some analyses, together with infant mortality [76, 84]. A more specialist (medical) literature relies on specific components of health status as multi-morbidity [28]. Most clinical studies place emphasis on reducing mortality rates and increasing life expectancy, both in terms of the length of life and the number of healthy life years [16, 86]. Fanshel and Bush [40] emphasize the need for indicators based on morbidity, while Segovia et al. [81] include chronic diseases functional limitations in their analysis of self-assessed health, among other variables.

From a computational perspective, we basically follow the methodology proposed by international organizations (United Nations,Footnote 11 World Economic ForumFootnote 12) for the computation of composite multidimensional socioeconomic indicators and applied by the economic literature [2, 5, 36]. First, as mortality rates and the indicator ‘People aged 75 + with three or more chronic conditions’ are naturally negatively oriented (i.e., the higher the indicator is, the worse the health condition is), we transform these two variables by taking their complementary value to have all components of the final RHSI positively oriented (i.e., such that higher scores are associated with better health status).Footnote 13 After the transformation, the three domains can be newly defined as ‘Life expectancy’, ‘Survival’ and ‘No morbidity’. Then, to make comparison possible, each elementary variable is standardized by its national average:

$${x}_{v,j, i,t}^{\prime}=\frac{{x}_{v,j,i,t}}{{\overline{x} }_{v,j,t}}$$
(1)

where \({x}_{v,j,i,t}\) represents the value of elementary variable \({x}_{v}\) (v = 1, …, 21) in domain j (j = 1, …, 3) for region i (i = 1, …, 21) at time t (t = 2011, …, 2019).

Finally, the last step of the computation is the aggregation of the variables and domains. As in Di Bella et al. [36], we apply the following aggregation rules:

unweighted arithmetic mean of the \({x}_{v,j, i,t}^{\prime}\) variables within each domain j:

$${I}_{j,i,t}=\frac{{\sum }_{v=1}^{n}{x}_{v,j, i,t}^{\prime}}{n}$$
(2)

where \({I}_{j,i,t}\) is the synthetic measure of domain j (j = life_exp, surv, no_morb) for region i at time t and n represents the number of variables included in each domain;

unweighted geometric meanFootnote 14 of the \({I}_{j,i,t}\) measures in the final \({RSHI}_{i,t}\) for region i at time t:

$${RSHI}_{i,t}={{(I}_{life\_exp,i,t}\cdot {I}_{surv,i,t}\cdot{I}_{no\_morb,i,t})}^\frac{1}{3}$$
(3)

under the hypothesis of assigning equal weights to the three components (\({I}_{life\_exp,i,t}\), \({I}_{surv,i,t}\) and \({I}_{no\_morb,i,t}\)) of the final \({RSHI}_{i,t}.\) Table 5 in Appendix A provides the RHSI values for 2011–2019.

Data and variables

Our data are longitudinal, available annually for a period of 9 years (2011–2019) and for 21 territorial units with a total of 189 observations.

The main source of our data is ISTAT through the operating system Health for All-Italia, the data warehouse (http://dati.istat.it/) and the Benessere Equo e Sostenibile (BES) project.Footnote 15 Other sources are the Ministry of Health and the Institutional Quality Index (IQI) dataset by Nifo and Vecchione [73], updated up to 2019.Footnote 16

A description of the variables used in the empirical analysis is reported in Table 1.

Table 1 Variables description

Dependent variable

Our dependent variable is the RHSI calculated at the regional level and introduced in Sect. Assessing health: The Regional Health Status Indicator. Figure 2 represents the RHSI trend over time (2011–2019) by macro-areas (northern, central and southern Italy). Territorial heterogeneity emerges across the macro-areas of the country, with the highest levels of the indicator for the northern regions and the lowest values for the southern regions. The central regions are in an intermediate position. However, the RHSI is rather constant over time. The standard deviation is approximately 0.006 for each of the macro-areas, meaning that data are clustered around the mean.

Fig. 2
figure 2

RHSI over time (2011–2019) by macro-area

The territorial disaggregated analysis (Figs. 3, Panels A, B, C) also shows a certain heterogeneity among territories. In all three-year periods, the lowest value characterizes southern regions (Calabria, Campania and Sicily for 2011–2013; Calabria, Sardinia and Sicily for 2014–2016; Basilicata, Calabria and Sicily for 2017–2019), while the autonomous provinces of Trentino-Alto Adige record the highest level (Aosta Valley, Bozen and Trento for 2011–2013 and 2017–2019; Bozen, Piedmont and Trento for 2014–2016).

Fig. 3
figure 3

RHSI over time (2011–2013, 2014–2016, 2017–2019) by region

Nonetheless, the comparison of the RHSI for 2011 and 2019 (Fig. 4) shows that among the regions facing an improvement in the indicator (those placed below and to the right of the diagonal), some southern regions, such as Campania, Sardinia and Calabria, recorded the largest increase.

Fig. 4
figure 4

Regional comparison between RHSI for 2019 vs. 2011

Explanatory variables

Explanatory variables used in the empirical analysis are divided into four categories: the socioeconomic component, the healthcare system, the quality of institutions and instrumental variables.

The socioeconomic component

The empirical evidence supports a positive association between good health outcomes and socioeconomic condition. The latter is an extended and multidimensional concept that is not only dependent on the availability of material resources (typically income and wealth) but also employment condition, cultural-behavioural factors (education, behavioural habits as drinking, smoking, sedentary lifestyle, diet) and other elements (gender, ethnicity) that give a better picture of social context also highlighting the possible exposure to social exclusion.Footnote 17

Regarding the Italian context, Franzini and Giannoni [44] point out that populations living in regions with more poverty, more unemployment, more income inequality are more likely to report poor health. These results are supported by other empirical studies. In particular, Landi et al. [64] find evidence of a negative correlation between the socioeconomic context and waiting times for Italian National Healthcare Service (NHS) services. Their analysis suggests that, despite the Italian NHS statutory obligation to provide equal access according to needs to all Italian citizens, some population groups (low educated, less wealthy and low skill workers) are more vulnerable than others to experiencing excessive waiting times for healthcare services. Such a negative feature can induce patients with higher income and ability to pay to address their demand towards private healthcare services thus generating, in the Italian context, a pro-rich inequity in the access and use of healthcare services [26].

Against this background and to provide an in-depth picture of the socioeconomic regional Italian context, we consider a broad set of variables (described in Table 1) that, in our opinion, allows us to better delineate the economic characteristics of the regional context and to take into account some elements of social inclusion (or exclusion). To this end, in addition to variables widely used in the literature and accounting for the economic dimension (per capita income), the labour market performance (unemployment) and cultural features (education), we also control for the degree of differentiation of social status (income inequality),Footnote 18 demographic characteristics (population over 65)Footnote 19 and behavioural features (drinking habits).Footnote 20

The healthcare system variables

A well-established part of the economic literature associates health outcomes with a production process carried out by the healthcare system through the use of productive factors such as capital and labour (among others, for Italy, [22, 31]).

From this perspective, we consider staff employed in public healthcare facilities and beds in public and private accredited facilities as explanatory variables in our analysis. Both can be considered proxies for labour and capital inputs financed by public resources. The variable ‘Staff’ includes both medical (doctors and dentists and nursing staff) and nonmedical staff (technical health personnel and rehabilitation staff) only employed in public healthcare facilities and paid by the public sector, while the variable ‘Beds’ refers to ordinary beds (i.e. acute care, long-term care and rehabilitation beds) employed in either public or private accredited facilities, as both types of beds are financed by the public sector in the Italian NHS.

In addition, we also consider some other elements introduced by the reforms that have interested the Italian NHS over the years since 1978.Footnote 21 Originally a vertically integrated system of production and delivery, the Italian NHS has exhibited a progressive decentralization process over time. Regions were given greater power in the administration and organization of healthcare services in exchange for their acceptance of tighter budget constraints on healthcare expenditure. Many analyses have been devoted to assessing the effects of such reforms in terms of public healthcare expenditure [34], healthcare services provision [25] and citizens’ well-being [21, 77].

The 2001 constitutional reform introduced Livelli Essenziali di Assistenza (LEA), i.e. a set of public healthcare services to be provided and guaranteed to all citizens, either free of charge or on payment of a cost-sharing fee (co-payment), with public resources collected through general taxation. A national fund was established to provide the necessary resources to the regions to deliver the LEA. Any care provided above the LEA had to be funded through the regional budget. However, soon after the constitutional reform, some regions, due to weak managerial capacity and poorer government accountability, failed to reach the set goals, and the regional health budgets quickly ran into severe deficits. As a result, the central government had to adopt strict controls on regional healthcare spending to monitor and contain regional budgets. If the regional budget deficit exceeded 5% of total funding, regions formally committed themselves to designing an industrial reorganization programme and implementing a financial recovery plan, known as the Piano di Rientro (PdR) programme. Under the PdR, regions must identify the inefficient areas causing the deficits and take appropriate measures to recover from them. Within this framework, empirical evidence shows systematic regional heterogeneity both in the management of the budget and in the performance of the provision of public healthcare services. To account for these features of the Italian NHS, we include in the analysis the dummy variable ‘PdR’, which equals 1 if the region is subject to a financial recovery plan, zero otherwise, and the variable ‘LEA’ which measures regional degree of compliance with a nationally set target in terms of adequate level of public healthcare services. More explicitly, a region can be classified as: compliant (when the LEA score is > 160) or compliant with reserve (when the LEA score is > 130 but < 160) when public healthcare services are adequately provided, and critical (i.e., not compliant when the LEA score is < 130) when public healthcare services are not adequately provided.Footnote 22

Variables measuring the quality of institutions

We use the IQI designed by Nifo and Vecchione [73] as a proxy for institutional quality at the local level. The structure of the IQI is inspired by the World Governance Indicator (WGI) proposed by Kaufmann et al. [57], and it is aligned with other initiatives for broader contexts, such as the European Quality of Government index (EQI) by Charron et al. [23]. It is based on data from ISTAT and other national research institutes and it is designed on five dimensions: (1) civic engagement, social cooperation, political participation and cultural liveliness (labelled Voice and Accountability), (2) quality of public services and policies in terms of public expenditure, waste management and environment policies formulated and implemented by the local government (labelled Government Effectiveness); (3) the rule of law measured in terms of crime against persons or property, magistrate productivity, trial times, tax evasion and shadow economy (labelled Rule of Law); (4) the degree of corruption as crimes committed against the Public Administration (labelled Control and Corruption) and (5) the ability of local government to promote policies fostering firms (labelled Regulatory Quality). The IQI is computed in such a way that higher values are associated with higher institutional quality.Footnote 23

The overall IQI includes both elements of formal (from dimension two to five) and informal institutions (first dimension), as defined in Sect. Institutions and health: an overview.

Our choice of the IQI as an indicator of local institutional quality is corroborated by the economic literature that extensively employs it in various contexts of analysis [3, 33, 35, 38, 41],Footnote 24 while the choice of ‘Government effectiveness’ and ‘Rule of law’ components stems from the intention to focus exclusively on two relevant aspects for population health:Footnote 25 the ability of a government to guarantee the effectiveness of some public policies (public expenditure, waste management and environment policies) and the ability to provide an efficient legal system and compliance with the laws.

Figure 5 shows that the RHSI is positively correlated to the IQI and the components ‘Government effectiveness’ and ‘Rule of law’. The Pearson correlation coefficients confirm the statistical significance of the correlations (Table 2). In particular, the data reveal values of correlation coefficients between the health status and IQI or its component ‘Rule of law’ higher than 0.8, while the correlation between RHSI and ‘Government effectiveness’ is much lower (0.4) but still significant.

Fig. 5
figure 5

RHSI and Institutional Quality (average values 2011–2019)

Table 2 Pairwise correlation coefficients

Instrumental variables

The Instrumental Variable (IV) ‘Good transports’ is defined as the percentage of people declaring to be satisfied with public transports. This variable is created starting from the number of people who use public transports (bus/trolley bus/tram) and are very much and quite satisfied with: (1) timetables; (2) punctuality; (3) possibility of finding a seat; (4) speed of the bus; (5) cleanliness of the cars; (6) comfort of bus stops; (6) connectivity with other municipalities; (7) convenience of timetables; and (8) fares. These data are from ISTAT and are expressed as number of people (aged 14 and over) among 100 people with the same characteristics (basically percentages) at regional level. We obtain the IV ‘Good transports’ by computing the unweighted arithmetic mean of the quantities labelled with numbers (1) to (8).

The other IV ‘Clean streets’ is the percentage of households declaring to live in an area where streets are very much or quite clean. These data are from ISTAT and are expressed as number of households among 100 households with the same characteristics (basically percentages) at regional level.Footnote 26

Several reasons address our interest towards these variables. First of all, they are widely used in literature as proxies for the efficiency and the effectiveness of the public services [19, 27] testing, in an indirect way, the quality of the formal institutions governing them. Secondly they are also correlated to the civic engagement of citizens (especially for the ‘Clean streets’ variable) representing a component of informal institutions.

The validity and power of these instruments is proved in Sect. Empirical results in which we report and discuss the Angrist–Pischke test of excluded instruments, the underidentification and the weak identification tests the weak-instrument-robust inference tests and the Hansen J statistic for overidentification of all instruments.

Descriptive statistics

The descriptive statistics are reported in Table 3. On average, the RHSI is rather stable over time, as already discussed above with the support of graphs (see Figs. 2 and 3). Regarding the variables of the regional context, on average, per capita gross disposable income is almost 18,000 euros, with northern regions richer (20,000 euros) than southern regions (14,000 euros). On average, 60% of the individuals aged 25–64 have received at least a secondary school certification, with peaks of 65–70% in Lazio (centre) and Trento (north) but also very low percentages (less than 50%) in Apulia, Sicily and Sardinia (south). On average, 11 individuals out of 100 with the same characteristics are unemployed, with peaks of almost 23 in Calabria and Sicily (south). On average, income inequality, measured by the ratio between the income received by the richest 20% of the population and the income received by the poorest 20% of the population, is equal to 5, meaning that for every rich individual at the top of the income distribution, there are 5 poor individuals at the bottom of the distribution, with peaks of 10 in Campania and Sicily (south) and generally values higher than average in all southern regions. On average, 18 individuals out of 100 with the same characteristics may present drinking habits, with peaks of more than 20 individuals, especially in northern regions. Twenty-two percent of the total population is aged 65 + , with Liguria (north) being the oldest region: on average, almost 28% of the total population is classified as old. Regarding healthcare system variables, on average per 10,000 inhabitants, there are 95 doctors and dentists, nursing staff, technical health personnel and rehabilitation staff employed in public healthcare facilities and almost 33 hospital beds. Between 2011 and 2016, eight regions were subject to a financial restructuring programme (PdR), namely, Abruzzo, Apulia, Calabria, Campania, Lazio, Molise, Piedmont and Sicily, while from 2017 onwards, Piedmont exited the programme, having successfully recovered. Despite being on average fully compliant with LEA (173 scores), compliance is not stable over time, and many regions become complaint with reserve or critical over the period of 2011–2019, as also proven by the high standard deviation (30). Regarding quality of institutions variables, perception concerning law enforcement is generally higher than endowment of social and economic structures and of the administrative capability of regional governments in terms of health policies, waste management and environment (0.588 vs. 0.402), while overall quality of public institutions at the local level (including elements of formal and informal institutions) is higher (0.607). Finally, we report descriptive statistics of IVs: on average, around 57 out of 100 people with the same characteristics are satisfied with public transport, while more than 70 out of 100 households with the same characteristics declare to live in areas where the streets are clean.

Table 3 Summary statistics. Italy, 2011–2019

Empirical strategy

Since multidimensional and socioeconomic indices, based on social and economic determinants, should be used in ecological setting [4], we use the principal component analysis (PCA), as suggested by Friesen et al. [45], applied to our variables of regional context also to reduce data dimensionality and overcome potential multicollinearity issues.

The basic idea of PCA is to describe the variation of a multivariate dataset through uncorrelated linear combinations of the original variables. It is a technique that reduces the number of variables involved in the analysis and is thus widely used to summarize multiple indicators of socioeconomic contexts. Generally, the first few components represent most of the variation in the original dataset. We include in our empirical analysis the first principal component -the only one with an eigenvalue greater than one (3.8)- explaining around 65% of the variation of the six original variables. It can be interpreted as a proxy of socioeconomic well-being. We also ran the Kaise-Meyer-Olkin (KMO) test to check the overall consistency of the PCA in representing our data. The test gives an overall KMO value of 0.79 with partial values for each variable greater than 0.77. Given the overall PCA validation threshold of 0.6 [55], we conclude that the method can be applied to the original data without information loss. Detailed results on PCA are reported in Appendix B.

The fitted scores of the first component of the PCA, are then used in the second part of the empirical analysis in which we estimate the effects of institutions on the RHSI, controlling for healthcare system productive factors and policy variables.

The baseline model specification has the following form:

$${y}_{i,t}={\beta }_{0}+{\beta }_{1}{SE}_{it}+{\beta }_{2}{HS}_{it}+{\beta }_{3}{HP}_{it}+{\beta }_{4}{INST}_{it}+{\gamma }_{i}+{\varepsilon }_{it}$$
(4)

where \({y}_{i,t}\) is the RHSI observed in region i at year t, and \({\beta }_{1}\)\({\beta }_{4}\) are our coefficients of interest that capture the effect of change in the explanatory variables on the health status for region i at year t. In particular, \({SE}_{it}\) refers to the socioeconomic component, measured by the fitted scores of the first component of the PCA; \({HS}_{it}\) refers to the healthcare system variables, i.e. personnel employed in public healthcare facilities and hospital beds; \({HP}_{it}\) refers to the healthcare system policy variables, i.e. PdR dummy and LEA score; and \({INST}_{it}\) represents the variables measuring the quality of the institutions, i.e. the IQI and its components ‘Government effectiveness’ and ‘Rule of law’. Finally, \({\gamma }_{i}\) is a discrete variable taking values 1 to 21 and identifying the regions, and \({\varepsilon }_{it}\) is the overall error term for region i at year t.

The empirical specification (4) is first estimated using the pooled OLS estimator with robust standard errors clustered at the regional level. However, due to the heteroskedasticity issue and to choose the most efficient estimation strategy, we perform both the Breusch‒Pagan Lagrange multiplier test and the Hausman test. In particular, the Hausman test reveals that the random effect estimation model can be considered appropriate (\(Prob>{\chi }^{2}=0.5\)). Random effects (RE) models, however, could suffer from cross-sectional dependence in the errors caused by possible common unobserved factors [32]. To overcome this problem, given that the number of groups (21) is greater than the number of time periods (9), we improve our empirical strategy by applying the panel-corrected standard error (PCSE) technique, which assumes heteroskedastic errors across groups [12].

However, as regional healthcare systems may be under greater pressure when the population is less healthy, leading to a deterioration in the quality of institutions, this potential reverse causal relationship between RHSI and IQI and its components may be a source of possible endogeneity.

We thus combine an IV approach with a two-way RE model to control for these potential problems and provide consistent estimates for Eq. (4). The IVs must be directly correlated with variables IQI and its components but uncorrelated with the dependent variable RHSI. The variables that we adopt, the percentage of people satisfied with local public transports (GT) and the percentage of families satisfied with cleanness of the streets (CS), have these characteristics. They indirectly measure the quality of institutions of the region -as the IQI (and its components) is conceived and calculated- through the ability of local authorities to efficiently and effectively meet the needs of the local population (GT) and also through the civic engagement of the citizens (CS). Formally, we estimate the following first-stage equation

$${INST}_{it}={\alpha }_{0}+{\alpha }_{1}{IV}_{it-1}+{\alpha }_{2}{Z}_{it}+{\varepsilon }_{it}$$
(5)

where INSTi,t represents the institutional variables, either IQI or its components ‘Government effectiveness’ and ‘Rule of law’, for region i at year t; IVi,t−1 represents the instrumental variables, either CS (when Eq. (5) refers to IQI, Column (13) of Table 4) or CS and GT (when Eq. (5) refers to ‘Government effectiveness’ and ‘Rule of law’, Column (14) of Table 4), for region i at year t−1;Footnote 27Zi,t refers to the set of area-level characteristics as discussed above; and \({\varepsilon }_{it}\) is the error term randomly distributed. We consider the lagged value of IVs to generate exogenous variation in variables IQI, ‘Government effectiveness’ and ‘Rule of law’ (INST). The validity and power of the instruments is tested through the following validation tests: the Angrist–Pischke test of excluded instruments, the under identification and the weak identification tests, with cutoffs provided by Stock and Yogo [87], the weak-instrument-robust inference tests and the Hansen J statistic for overidentification of all instruments.

Table 4 Empirical results, 2011–2019

The predicted values of the institutional variables (IQI, ‘Government effectiveness’ and ‘Rule of law’) derived from (5) (\(\widehat{INST}\)) are then used in the second-stage regression of the Eq. (4).

Empirical results

We run our regressions on a balanced panel of 21 territorial units over the period of 2011–2019. The results are reported in Table 4, in which columns (1–8) present OLS estimation with controls introduced by blocks.

In Column (1), we simply focus on the effect of the socioeconomic component. The socioeconomic component is positively correlated with health status. This result is in line with the literature on the effect of socioeconomic status on health status [44].

In Columns (2) and (3), we add variables describing the national healthcare system. Both personnel employed in public healthcare facilities and ordinary hospital beds have a positive effect on RHSI, in line with the economic literature [10] modelling the healthcare system as a “production system” using capital (‘Beds’) and labour (‘Staff’) as inputs.

In Column (4), we also account for policy variables. The coefficient of the PdR variable is positive and significant. However, this finding requires a cautious interpretation. The empirical literature on this issue highlights that the impact of PdR programmes on citizens’ health is quite sensitive to the well-being or health indicator adopted, and some results appear to be contradictory [77]. In this complex framework, we are aware that our findings may be dependent on the composite health indicator used and that more in-depth analyses are needed to investigate in detail the impact of financial recovery plans on disaggregated health status measures.Footnote 28 This is not, of course, the focus of our research, which aims to evaluate instead the relationship between institutional quality and health. Therefore, PdR variable is introduced as a mere control, and in our analysis, it captures the possible effect of rationing inefficient healthcare spending on the health status. The results on the LEA variable indicate that the more compliant the region is with the national healthcare targets, the more adequate the level of public healthcare services is and the higher the RHSI [21].

Finally, in Columns (5–8), we also introduce variables measuring the quality of the institutions. We first look at the effect of the overall quality index (IQI): the higher the quality of institutions is, the greater the effect on health status is. We then use IQI components as a robustness check of this positive correlation between quality of the institutions and health. In particular, we focus on the ‘Government effectiveness’ component, including the endowment of social and economic facilities in Italian regions and the administrative capability of regional governments in terms of health policies (the regional healthcare deficit), waste management and environment factors affecting health (i.e., the separate waste collection and the urban environment index including dimensions related to air quality, water quality, public parks etc.). We also consider the ‘Rule of law’ component containing elements of formal institutions (ruled by law and regulation) and summarizing data on crime, activities of the magistracy, tax evasion and submerged economy. In all cases, we find that the quality of the institutions is highly significant with a positive sign, meaning that the RHSI increases with increasing quality of regional institutions.

Our results confirm the first hypothesis set in Sect. Institutions and health: an overview (Hp1: At Italian regional level, higher institutional quality is associated with higher health status of the population.), even when we introduce a variable identifying the region: health worsens as we move from the north to the south of the country. Our results also confirm the second hypothesis (Hp2: Regional socioeconomic factors do not affect overall population health when local institutions are well-functioning and regions present a higher level of compliance with national standards in terms of public healthcare services.): when introducing institutional quality variables, the socioeconomic component loses significance. This is probably because when the regional provision of public health services corresponds to national standards and the institutions are well-functioning and guarantee their effectiveness, the health needs of the target population are better met, regardless of socioeconomic background.

Results are also confirmed using the RE estimator (Columns (9) and (10)) and the PCSE regression (Columns (11) and (12)).

The last two columns ((13) and (14)) show the estimations of the IVs approach. We only comment here the first-stage results and tests of the IV-RE model, reported at the bottom of Table 4, as second-stage results are similar to those reported in Columns (9) and (10).

We expect that an increase in the share of families declaring to live in areas where streets are clean (CS) and an increase in the percentage of people satisfied with public transport (GT) show a positive significant correlation with the quality of institutions (IQI, ‘Government effectiveness’ and ‘Rule of law’).

In the first stage all instruments are statistically significant and with the expected sign. The IV CS fall within the 1% confidence level, while the IV GT is within the 5% level.

The Angrist–Pischke test of excluded instruments is passed for all institutional variables meaning that the excluded instruments do not influence the dependent variable. Since the p-values of the Kleibergen-Paap test are very low (for both the Wald and LM statistics), the under-identification hypothesis is rejected: the model is identified. Besides being relevant, the instruments are also valid. The Sargan-Hansen test does not reject the null hypothesis that all instruments are exogenous and therefore the model is exactly identified. The strength of the instruments is verified through the F statistic of the Kleibergen-Paap weak identification test. Based on the critical values of Stock and Yogo [87],Footnote 29 the F statistic relative to Column (13) with a value of 8.22 is between the critical values of 15% and 20% (closer to 15% value), while the F statistic relative to Column (14) with a value of 8.59 is over the critical values of 10%. We can conclude that the relevance of our IVs is good in Column (13) and very good in Column (14). Last, according to the Anderson-Rubin Wald F and χ2 tests and the Stock-Wright LM statistic, the hypothesis of joint significance of endogenous regressors is rejected at 5–10% level.

Conclusions

This paper assesses the relationship between the health status of the Italian population measured at the regional level and quality of institutions.

Since in Europe health is a primary responsibility of the member states, not of the EU, which only provides a coordinated approach at both EU and global level, we present Italy as a case study to show the relevance of our hypothesis. We believe that the empirical setting proposed in our paper can be used to explain the role of institutions also in other federal/regional contexts characterised by unevenly distributed resources, inequalities, and non-homogeneous quality of institutions such as Austria, Germany, Belgium and/or by sub-national level of health management such as Finland, Spain, Sweden. Moreover, the issue of institutional quality and health outcomes might be particularly relevant for the international socio-political debate in which the question of the variables affecting health and possible policy measures are a growing concern, also considering the Sustainable Development Goal 3 of the 2030 Agenda (Ensure healthy lives and promoting well-being for all at all ages).

The analysis departs from the construction of a multidimensional composite indicator of the health status, built on the combination of elements relating to both objective measures of health status and self-reported health, and then moves to investigating the role of socioeconomic factors, healthcare facilities, health policies and quality of institutions on such multidimensional indicator.

Our analysis shows that institutional quality is an important driver of population health status in the Italian regional context. In particular, the analysis points out that when local institutions are well-functioning and regions present a higher level of compliance with national standards in terms of public healthcare services, the role of socioeconomic factors becomes less important. In other words, when institutions are efficient and effective and the regional healthcare services provision is coherent with national guidelines, then socioeconomic differences become secondary drivers for health.

Our findings suggest that more effort should be made to increase the quality of local institutions in regions where this is lower. Such an increase can be achieved, for example, by investing in the quality of human capital engaged in local public administrations and by training and focusing on more educated and skilled personnel.

Policymakers should therefore rethink the institutional agenda on health disparities and set investments in intersectionality, i.e., on an integrated ground to consider the multiple factors involved in shaping health conditions, from individual socioeconomic characteristics and social position to the role of institutions. Thus, investing in the quality of regional institutions and ensuring regional compliance with the national targets could be a further policy instrument to fight inequalities from a different angle.

Our analysis could be further delved into, and it calls for future research. A possible extension of the analysis concerns the investigation of the role of public institutions in the COVID-19 era with the purpose of understanding whether the pandemic has affected the relationship between health status and institutional quality.