Introduction

For a long time, people have paid attention to economic growth but neglected the value of environmental resources, even at the expense of the environment to a certain extent, resulting in increasingly prominent environmental pollution problems. In the last decade, the cost of environmental pollution control in China has reached 10% of GDP (Liu et al. 2017; Zeng et al. 2019b). As the largest developing country, the serious air pollution problem faced by China has become one of the major threats to the health status of the residents (Guan et al. 2016; Hunter, 2020). Global Burden of Diseases, Injuries, and Risk Factors Study reveals that PM2.5 ranks sixth among the level 4 of mortality risk factors (Murray et al. 2020). Despite the remarkable results of the implementation of China’s “Air Pollution Prevention and Control Action Plan (2013–2017),” there are still 17 provinces where PM2.5 concentrations are not up to standard (Zheng and Xu, 2020). Numerous studies have confirmed that long-term exposure to PM2.5 pollution is associated with higher rates of COVID-19 transmission and mortality (Vasquez-Apestegui et al. 2021; Wu et al. 2020). Severe haze pollution is exposing people to major health risks. Moreover, the mobility of air makes the externality of pollution even more significant, and the regional pollution characteristics are obvious (Wang, 2014). It becomes very meaningful to consider the spatial relationship between the region and the surrounding areas when studying the environmental health effects. Therefore, the problem of air pollution in urban agglomerations is of great social relevance. It is necessary and urgent to deeply investigate the impact of PM2.5 pollution on public health in urban agglomerations.

As the most economically developed urban agglomerations in China, the Yangtze River DeltaFootnote 1 (YRD) (Fig. 1) is one of the most severely polluted regions in China, and its air quality has always been a concern. With the booming economy of the region, severe haze pollution threatens public health as a by-product (Liu et al. 2020). PM2.5, the primary air pollutant in the Yangtze River Delta region, decreased from an annual average concentration of 67 μg/m3 in 2013 to 46 μg/m3 in 2016, but it is still higher than the national standard, and the number of days exceeding the standard for PM2.5 in 2020 accounted for 50.7% of the total number of over-standard days (China, 2013; 2016; 2020). Long-term exposure to PM2.5 pollution caused 95,500 premature deaths per year in the Yangtze River Delta region from 2001 to 2017 (Lu et al. 2019). Therefore, severe haze pollution remains one of the urgent air pollution problems in the Yangtze River Delta urban agglomerations, and also one of the serious threats to the public health of residents. It is of great significance to study the impact of haze pollution on public health in the Yangtze River Delta urban agglomerations in China.

Research on the theme of environment and health has been paid more attention by the fields of medicine and environmental science. The economic loss caused by the impact of environmental pollution on health has also aroused the interest of policymakers and economists. The impact of the environment on health takes the health production function pioneered by Grossman (1972) as a starting point for research. Currently, studies are mainly carried out from the aspects of environmental health effects assessment, environmental health relationship, and environmental health differences. The assessment of environmental health effects includes two main aspects, namely the short-term (Chen et al. 2020b) and long-term (Lu et al., 2017) adverse health effects caused by air pollution and the economic losses caused by air pollution (Gao et al. 2015; Guan et al. 2019; Zheng et al. 2017). At the same time, a large number of scholars have focused on the relationship between the environment and health. Many studies have investigated the relationship between different air pollutants and related disease mortality using systematic review and meta-analysis (Fang et al. 2016; Lu et al. 2015; Shah et al. 2015; Shang et al. 2013) and the Poisson distribution model (Liang et al. 2021; Shi et al. 2016), and found a significant positive correlation between the two (Jo et al. 2021; Peng et al. 2008; Pope, 1999). However, the Poisson regression.

model is not rigorous in calculating the health effects of air pollutants, and the endogenous problems of air pollution may cause certain errors (Knittel et al. 2016; Lleras-Muney, 2010). Scholars have thus used methods such as two-stage least squares or three-stage least squares to solve the endogenous problems (Knittel et al. 2016; Zhang et al. 2020). Moreover, air pollution is fluid (Cheng et al. 2017), and ignoring the spillover effects of air pollution will lead to serious deviations in the estimation results. Accordingly, scholars have analyzed the relationship between air pollution and health from the spatial perspective (Peng et al. 2021; Sun et al. 2021). Chen et al. (2017) and Zeng et al. (2019a) explored the direct and indirect effects of air pollution and its spillover effects on public health by constructing the spatial Durbin model. With the spread of COVID-19 around the world, some scholars have turned their attention to the study on the relationship between COVID-19, air pollution, and public health (Nakada andUrban, 2020; Nie et al. 2021). Benmarhnia (2020) and Travaglio et al. (2021) suggest that short-term and long-term exposure to air pollution, especially particulate matter pollution such as PM2.5, may increase the risk of COVID-19 infection and mortality, and may increase vulnerability and have harmful effects on the prognosis of patients affected by COVID-19 infections (Ali and Islam, 2020). Iqbal et al. (2021) found that an average 1% increase in PM2.5 concentration resulted in an 11.67% increase in COVID-19 infections and an 18% increase in the number of COVID-19 deaths. In terms of research on regional differences in the health effects of air pollution, different scholars have conducted studies on different age groups, socio-economic factors, living environment, and other aspects, all of which found heterogeneity in the health performance of air pollution (Cakmak et al. 2016; Othman et al. 2014; Yang et al. 2021). Based on the Grossman health production function (Grossman, 1972), Feng et al. (2019) also found significant regional differences in the health effects of air pollution. Consequently, the heterogeneity of environmental health performance seems to revolve around individual differences (Moretti andNeidell, 2011), but in fact, it is closely related to factors such as economic development level and living environment.

It can be seen that a great deal of theoretical and empirical research has been done by predecessors, which has provided the basis and help for environmental governance in China and improvement of public health. However, there is still room for improvement and depth in the existing research. Previous multi-regional studies that have considered the mobility of air pollution have tended to focus on the provincial level, with fewer studies based on the urban scale. At the same time, the imbalance in economic development between regions and the huge differences between elements such as resource endowment, health care expenditure, air pollution, and policy systems make the issue of spatial and temporal heterogeneity of public health between regions impossible to ignore. In addition, existing studies tend to consider spatial dependence separately from spatial heterogeneity, and the research perspective is not yet comprehensive. Health is both the cornerstone of society and the result of social progress (Lancet, 2021). The impact of haze pollution on health deserves attention and discussion. Therefore, based on the fact that haze pollution has caused considerable damage to the health of the residents of the Yangtze River Delta urban agglomerations, the paper uses the macro health production function to comprehensively and systematically investigate the impact of haze pollution on public health in the Yangtze River Delta urban agglomerations from both the global and local perspectives. Moreover, through the construction of the spatial Durbin model and the panel spatiotemporal geographic weighted regression model, the paper analyzes the spillover effects of haze pollution in depth. In particular, the analysis of the temporal and spatial heterogeneity of environmental health between regions is not only a useful supplement to previous studies but also in line with the reality of the uneven distribution of regional economies and resources. The study found that haze pollution had a significant negative impact on the public health level of residents in the Yangtze River Delta urban agglomerations, with significant spillover effects. It also found that the urbanization rate, the number of health technicians, and the green space per capita all had a significant positive effect on the level of public health. Moreover, the spatial and temporal heterogeneity of the impact of haze pollution, economic, social, medical and health care, and living factors on public health was obvious. On this basis, we put forward more scientific and targeted policy insights for different cities in the Yangtze River Delta urban agglomerations, to solve the outstanding environmental problems of haze pollution endangering public health and to contribute to the promotion of the strategy named “Health China 2030.”

Material and methods

Methods

Health production function

Grossman (1972) constructed a health production function based on the health micro-demand model to illustrate the relationship between health input and output. The model takes into account the impact of factors such as lifestyle, income level, public goods consumption, education, time investment, personal endowment, and environmental endowment on health. Since then, numerous scholars have devoted themselves to the study of factors affecting health. Cropper et al. (1981), Gerking and Stanley (1986) introduced environmental pollution into the health production function and demonstrated the effect of pollution on the health depreciation rate. At the same time, considering China’s national conditions, and referring to the studies of Wang and Chang (2007), Zhao and Hou (2005), and Qu and Yan (2015), from a macro perspective, it is determined that the factors affecting the health of Chinese residents are economic, social, health care, life, and environmental aspects. Therefore, the health production function of this paper is set as follows:

$$H = F ( {Eco,Soc,Hc,Life,Envi})$$
(1)

where H stands for public health level and Eco, Soc, Hc, Life, and Envi represent economic, social, health care, life, and environmental factors respectively. Based on the selection of variable indicators, the following health production function expression is obtained in combination with Eq. (1).

$$lnH = ln\omega + \alpha_{1} lnPM_{2.5} + \alpha_{2} lnX_{control}$$
(2)

Among them, ω refers to the estimated value of the initial public health level, αi, is the elastic coefficient of each variable, \({\text{PM}}_{2.5}\) is the annual average concentration of environmental \({\text{PM}}_{2.5}\), and \({\mathrm{X}}_{\text{control}}\) represents a series of control variables Eq. (2).

Global spatial econometric model

Considering that air pollution does not exist independently, the mutual flow of air pollution will pose a certain threat to the health of residents in neighboring cities. Traditional econometrics does not consider the spatial correlation between subjects, and the estimation results may be biased. Therefore, this paper uses the global spatial econometric model to consider the impact of haze pollution on public health at the macro level. Different spatial econometric models are set up with different spatial transmission mechanisms, and their indicators represent different economic meanings. To make the research process more scientific and reasonable, this paper starts with the spatial Durbin model (SDM), which is a nested form of the spatial error model (SEM) and the spatial lag model (SLM), and its expression is displayed as follows:

$$lnH_{it} = \beta_{0} + \theta WlnH_{it} + \beta_{1} lnPM_{2.5it} + \beta_{2} lnX_{control} + \gamma_{1} WlnPM_{2.5it} + \gamma_{2} WlnX_{{\text{control}}} + a_{i} + \eta_{t} + \varepsilon_{it}$$
(3)

where Hit is the explanatory variable, indicating the public health level of the residents, and \({\mathrm{X}}_{\text{control}}\) represents economic, social, health care, and life factor variables related to health level. β0θ, and εit denote the constant term, the spatial autoregressive coefficient, and the independent and identically distributed random error term, respectively. \({\mathrm{a}}_{\text{i}}\) and ηi indicate spatial and temporal fixed effect respectively, in the meantime, and γi correspond to the estimated coefficients of spatial variables. W means the spatial weight, and this paper chooses the geographic Queen adjacency matrix for analysis.

Local spatial econometric model

The study of the impact of haze pollution on the public health of residents is still one-sided only from a global perspective. What cannot be ignored is the temporal and spatial non-stationarity of the data. Considering the issue of heterogeneity in the relationship between local point variables in public health level accounting, it is necessary to choose an empirical tool to describe the heterogeneity of time and space. Fan and Guo (2021) identified the inherent flaws in the traditional modeling process of the spatial–temporal geographic weighted regression model (GTWR) (Huang et al. 2010). By constructing an endogenous spatiotemporal weight matrix adapted to the local analysis of panel data spatial measurement, a holographic mapping-based spatiotemporal geographic weighted regression model capable of reflecting direct and indirect paths of influence effects between local points in space was established. As a result, this paper starts with the panel spatiotemporal geographic weighted regression model (PGTWR) to consider the spatiotemporal heterogeneity of the impact of haze pollution on the public health of residents, and its expression is applied as:

$$lnH ( {u_{j} ,v_{j} ,t_{j} } ) = \rho_{0} ( {u_{j} ,v_{j} ,t_{j} } ) + \rho_{1} ( {u_{j} ,v_{j} ,t_{j}}) lnPM_{2.5} + \rho_{2} ( {u_{j} ,v_{j} ,t_{j} }) lnX_{control} + \mu ( {u_{j} ,v_{j} ,t_{j} })$$
(4)

Among them, \(\text{H and X}_{\text{control}}\) have the same meaning as formula (3). \(({\mathrm{u}}_{\mathrm{j}}, {\mathrm{v}}_{\mathrm{j}}, {\mathrm{t}}_{\mathrm{j}})\) denotes a local point and \({\mathrm{u}}_{\mathrm{j}}\) is the longitude of local point j,\({\mathrm{v}}_{\mathrm{j}}\) is the latitude of local point j, and \({\mathrm{t}}_{\mathrm{j}}\) is the time of local point j. ρ0(uj,vj,tj), ρ1(uj, vj, tj), and ρ2(uj,vj,tj) are exogenous parameters at the specific spatial and temporal point (uj, vj, tj), ρ1(uj, vj, tj) and ρ2(uj,vj,tj) are the local effects of PM2.5 concentration and related control variables on the public health level, respectively. µ(uj, vj, tj) is the random disturbance term Eq. (4).

Spatial autocorrelation test model

The spatial autocorrelation test is mainly divided into spatial correlation test and spatial heterogeneity test. The spatial correlation of air pollution and public health between regions is reflected in the mutual influence of the PM2.5 concentration value and the number of deaths from respiratory diseases between adjacent regions. In addition, the spatial heterogeneity of air pollution and public health between regions is reflected in the lack of spatial homogeneity in PM2.5 concentration and the number of deaths from respiratory diseases. Therefore, the global Moran’s I and the local Moran’s I are selected for the correlation test in this paper. The global Moran’s I is used to test whether there is a spatial autocorrelation effect in socio-economic phenomena. The formula is as follows:

$${\text{Moran}} \prime {\mathrm{s}} \;I = \frac{{n\sum\limits_{i = 1}^{n} {\sum\limits_{j = 1}^{n} {W_{ij} (x_{i} - \bar{x})(x_{j} - \bar{x})} } }}{{s^{2} \sum\limits_{i = 1}^{n} {\sum\limits_{j = 1}^{n} {W_{ij} } } }}$$
(5)

where \(\overline{\mathrm{x}} {=} \frac{1}{{\mathrm{n}}}{\sum }_{\mathrm{i}=1}^{\mathrm{n}}{{\mathrm{x}}}_{\mathrm{i}}\), \({\mathrm{s}}^{2} {=} \frac{\sum_{\mathrm{i}=1}^{\mathrm{n}}({\mathrm{x}}_{\mathrm{i}}-\bar{\mathrm{x}})^{2}}{\mathrm{n}}\). n denotes the number of spatial administrative districts investigating the economic phenomenon, and the corresponding regional feature value is represented by \({\mathrm{X}}_{\mathrm{i}}\). \({\mathrm{W}}_{\text{ij}}\) is a spatial weight to measure the spatial correlation of economic phenomena. If the value of Moran’s I is in the range (0, 1), it means that the variable has a positive spatial correlation, and the more it converges to 1, the stronger the correlation is. Similarly, if Moran’s I takes a value in the range (− 1, 0), then the variable has a negative spatial correlation, and the closer to − 1, the stronger the negative correlation. If the value of Moran’s I is 0, it suggests that the variable is randomly distributed and there is no spatial effect.

The global spatial autocorrelation of socio-economic phenomena is measured by the global Moran’s I index. Since economic phenomena tend to be spatially unbalanced (Xu andLin, 2021), to reveal this imbalance, the use of local Moran’s I is apt to show spatial heterogeneity (Anselin, 1995). The formula is as follows:

$${\text{Local}} \;Moran\prime {\mathrm{s}} \;I = \frac{{(x_{i} - \bar{x})}}{{s^{2} }}\sum\limits_{j = 1}^{n} {W_{ij} (x_{j} - \bar{x})}$$
(6)

Among them, the indicators are the same as formula (5). In this paper, the geographic Queen adjacency matrix is selected for correlation analysis Eq. (5) and Eq. (6).

Data

In this paper, data from a sample of 26 cities in the Yangtze River Delta region of China from 2005 to 2018 is selected to analyze the impact of environmental and other factors on public health. Therefore, the variables that define each factor are shown in Appendix Table 1.

Selection of explained variables

The selection of health indicators in the health production function is crucial. At present, a large amount of literature selects indicators such as mortality, respiratory disease mortality, lung cancer mortality, and perinatal infant mortality to characterize public health. Studies have shown that respiratory diseases caused by haze pollution are long-term chronic diseases, and people with respiratory diseases are extremely susceptible to the effects of particulate matter (PM) (Manisalidis et al. 2020). PM2.5 can cause serious damage to multiple systems in the body, with a significant increase in the incidence of respiratory symptoms (Guo and Wei, 2013). Consequently, this paper chooses the number of deaths from respiratory diseases that is more representative of PM2.5 pollution as a proxy indicator of public health.

Selection of core explanatory variable

In this paper, the annual average PM2.5 concentration is selected as a proxy indicator for environmental factors and at the same time as the core explanatory variable. The main reasons are as follows: firstly, compared with other environmental factors, the air environment is more exogenous and overflowing, which reduces the endogenous problem to a certain extent. Secondly, the probability of the national population as a whole being exposed to the air environment is higher than in other environmental factors, which to some extent compensates for the possible shortcomings of using macro data (Lu and Qi, 2013). Thirdly, a large number of epidemiological studies have confirmed that haze pollution is harmful to public health, and PM2.5 is the primary pollutant in the frequent occurrence of atmospheric haze nationwide, showing typical regional characteristics (Zhou et al. 2017).

Selection of control variables

It is mainly considered from the four aspects of the economy, society, health care, and life. They are characterized by per capita disposable income of urban households (Miao, 2008), urbanization rate (Zhao, 2012), population density (Chen and Chen, 2014), number of health technicians per ten thousand people (Feng et al. 2019), per capita expenditure on health care in urban households (Sun et al. 2021), and green space per capita (Ruan et al. 2020).

Among them, the PM2.5 concentration comes from the PM2.5 concentration dataset obtained by the Atmospheric Composition Analysis Group of Dalhousie University in Canada (van Donkelaar et al. 2019). The remaining variables are obtained from “China Statistical Yearbook,” “China City Statistical Yearbook,” statistical yearbooks of various provinces and cities, statistical bulletins of national economic and social development of various cities, the official websites of statistical bureaus of various provinces and cities, “China Health Statistics Yearbook,” Jiangsu provincial Center for Disease Control and Prevention (jscdc.cn), and EPS database. After that, the scale model of the exposure–response relationship and Poisson distribution is used to calculate the number of deaths from respiratory diseases in the exposed population (Zhou et al., 2021). For indicator data that cannot be directly obtained, it is acquired by the conversion of the original data. Economic data have been adjusted with 2005 as the base period to eliminate the impact of price inflation. Also, all variables in this paper are taken as natural logarithms to control the influence of heteroscedasticity.

Results and discussion

Spatial autocorrelation test

Global spatial autocorrelation test

With the help of GeoDa software, the global Moran’s I index was used to explore the clustering or correlation characteristic of the spatial distribution of respiratory disease deaths and PM2.5 pollution, and the results are shown in Fig. 2. The test results show that the Moran's I index of public health indicator is positive, and passes the 10% significance test. The Moran’s I index over the years ranges from 0.110 to 0.236, indicating the spatial clustering of cities with a similar level of public health. On the other hand, the Moran’s I index for PM2.5 pollution is positive from 2005 to 2018 and passes the 1% significance test. The Moran’s I index fluctuated upwards between 0.585 and 0.763 from 2012 to 2018, implying that haze pollution presented the strong spatial clustering characteristic that continued to increase. That is, cities with more serious haze pollution tend to be spatially clustered, and cities with lighter haze pollution tend to be clustered in space, which is consistent with the research of Li et al. (2018). To sum up, the spatial correlation of haze pollution and public health are both positive, and the spatial clustering of haze pollution is relatively stronger.

Local spatial autocorrelation test

To further detect the local clustering of respiratory disease deaths and PM2.5 pollution, the local Moran’s I test was used. Using ArcGIS and GeoDa software, Fig. 3 demonstrates the concentration maps of 2005, 2011, and 2018. According to the local Moran’s I scatter plots, the local clustering characteristic of both respiratory disease deaths and PM2.5 pollution are dominated by high–high clustering and low–low clustering. In terms of public health, high–high clustering cities are located in Nantong, Suzhou, and Shanghai, and the clustering trend has not changed significantly during the study period. High-high clustering cities with PM2.5 pollution are mainly distributed in the northwest part of YRD such as Wuhu, Hefei, Nanjing, and other cities. It may be due to the diffusion of air pollution caused by the development of heavy industry in the region, which in turn causes the intensification of haze pollution in neighboring regions. Meanwhile, low-low clustering cities show an upward trend in the time dimension, with 4, 4, and 6 cities in 2005, 2011, and 2018, respectively, mainly in part of the southeast of the YRD such as Shaoxing, Jinhua, Ningbo, and Taizhou. It may depend on the fact that industrial activity is less developed and polluting enterprises are few and scattered, which is conducive to the formation of low-level PM2.5 pollution clustering areas. Therefore, the public health level and PM2.5 pollution are not randomly distributed among cities in the Yangtze River Delta urban agglomerations but are characterized by spatial correlation and spatial clustering.

Regression results without considering spatial feature

Before the parameter estimation, first of all, this paper set up the ordinary panel model for regression analysis without considering spatial effect. Then, the F test was performed, along with the classic LM test proposed by Anselin and Griffith (1988) and the robust LM test proposed by Anselin and Smirnov (1996). By judging the spatial correlation of lag or residual terms, the SEM was deemed more appropriate. The joint significance of spatial and temporal fixed effect was also examined and it confirmed that model with time and space fixed effect should be expanded through the likelihood ratio test result. Based on the estimation results in Appendix Table 2, it is found that the core explanatory variable PM2.5 pollution had a significant positive effect on the number of deaths from respiratory diseases. And the coefficient is between 1.3478 and 1.6933, which is relatively stable. The results of the estimation of the control variables show that the majority of the estimates pass the significance test. Among them, the number of health technicians per ten thousand people and the green space per capita both have significant positive effects on the deaths from respiratory diseases, with elasticity coefficients of − 0.5455 and − 0.0816 for dual fixed effect of time and space, respectively.

Spatial correlation study

Estimation and analysis of spatial Durbin model

The tests in “Regression results without considering spatial feature” section all point to SEM with dual fixed effect. However, according to LeSage and Pace (2009), even if the non-spatial model is rejected in favor of SLM or SEM based on the LM test, Wald and LR tests are still required and the SDM model is preferred. Thus, this paper used the SDM model under the dual fixed effect as a paradigm for Wald estimation and LR test to determine whether the SDM model can be degraded to a simple SEM model. The test results in Appendix Table 3 show that the null hypotheses of γ = 0 and γ + θβ = 0 are rejected for both the Wald and LR statistics, indicating that the SDM is more reasonable. Afterwards, the SDM model was also subjected to the Hausman test and the null hypothesis was significantly rejected, so the SDM model under fixed effect was chosen for analysis. According to the results of the spatial Durbin model with different fixed effect in Appendix Table 3, it is found that the statistical properties of the model with dual fixed effect in time and space are optimal. To obtain more accurate parameter estimation, the spatial Durbin model with the dual fixed effect (including error correction) was chosen for the estimation analysis, taking into account that Lee and Yu (2010b), Lee and Yu (2010a), and Lee and Yu (2014) pointed out that the parameters in the fixed effect model may cause estimation errors.

The effect of the core explanatory variable on public health

The impact of PM2.5 pollution on the number of deaths from respiratory diseases passed the 1% significance test and the elasticity coefficient was positive, indicating that the increase in haze pollution had a significant positive impact on the increase in the number of deaths from respiratory diseases among the population. The essential reason for this is still the serious air pollution problem in the region. For a long time, the coal-based energy structure in the Yangtze River Delta has not changed fundamentally, and the region's heavy industry accounts for a relatively large proportion, with the establishment of a large number of heavy industrial plants leading to a continuous decline in air quality(He et al., 2019). Coupled with the emission of vehicle exhaust, the aggravation of atmospheric pollution such as haze, acid rain, etc., poses a greater threat to the public health of residents, which greatly increases the risk of residents suffering from respiratory diseases(Chen et al. 2020a).

The effect of control variables on public health

The urbanization rate, the number of health technicians per ten thousand people, and the green space per capita all passed the 1% significance test, while other variables were not significant. In particular, the urbanization rate had a negative effect on the number of deaths from respiratory diseases, which is consistent with Wang and Chang (2007). The urban–rural gap has led to large differences in medical and health facilities between urban and rural areas, which in turn has a significant impact on the differences in public health between urban and rural residents. The increase in the number of health technicians per ten thousand people is conducive to the improvement of public health. The continuous improvement of medical equipment and the overall quality of health technicians have created a good treatment environment, thereby providing great convenience for patients to see and treat. Therefore, the increase of health technicians plays an important role in the improvement of public health. The effect of green space per capita on the number of deaths from respiratory diseases showed a significant negative effect, which is in line with the findings of Sun and Li (2017). Good environmental quality is not only favorable for purifying the air but also makes residents physically and mentally happy and helps to improve their health levels. The negative effect of per capita disposable income of urban households on the number of deaths from respiratory diseases was not significant. It may be on account of a series of unhealthy lifestyle habits caused by the growing economy and the rising standard of living, which has led to the decline in public health. Additionally, the positive effect of population density on the level of public health of the population was not significant. The agglomeration effect of population density was stronger than its scale effect, which facilitated local residents to make full use of health facilities and other services within the constraints of a certain health budget, thus exerting the positive externality effect of the agglomeration effect. In the meantime, per capita expenditure on health care in urban households had a negative but non-significant effect on the number of deaths from respiratory diseases. In the context of rapid economic development, despite increasing health care expenditure, the limited health care expenditure has made little improvement in health in the environment of scarce high-quality medical facilities and rising health care cost.

From the perspective of the spatial lag term, it characterized the impact of the explanatory variables of a certain city’s neighboring cities on the level of public health in the region. According to the statistical results in Appendix Table 3, the coefficients of PM2.5 pollution and green space per capita were significantly negative, and the coefficient of urbanization rate was significantly positive, while the spillover of other independent variables was not significant. It suggested that the interactive effect of pollution in urban agglomerations was beneficial in reducing the number of deaths from respiratory diseases caused by haze pollution among residents. The increase in the green area in neighboring areas also had a beneficial impact on the health of residents, indicating that the planting of green plants not only beautifies the environment but also purifies the air for human beings and creates a pleasant living environment. In addition, the rising in the urbanization rate in neighboring regions increased the burden on the health level of the residents, which may be related to the continuous generation of pollutants such as exhaust gases and sewage in the process of urbanization.

Decomposition of spatial effect

Given the presence of spatial spillover effects, the impact of each factor on the number of deaths from respiratory diseases was further decomposed and its direct, indirect, and total effects are shown in Table 4.

According to the results of direct effect, the impact of PM2.5 pollution on the health level of the residents was still significantly negative. At the same time, the coefficient of 1.5057 under the direct effect is inconsistent with its estimated coefficient of 1.5740, because the direct effect takes into account both the impact of PM2.5 pollution in the neighboring area on the public health of the region and the feedback effect of the transmission of PM2.5 pollution in the region on the health effect in neighboring regions back to the region. The urbanization rate, the number of health technicians per ten thousand people, and the green space per capita all had a significant negative effect on the number of deaths from respiratory diseases, while the other variables were not significant. Of these, the number of health technicians had the greatest impact, indicating that the training of medical and health personnel should be taken into account even more in conjunction with vigorously advancing the process of urbanization.

As a result of the indirect effect, PM2.5 concentration had a significant positive effect on the public health level of residents in neighboring cities. It means that the PM2.5 pollution in a certain city was aggravated, and then the air pollution in its neighboring cities was relatively reduced, which had a beneficial impact on the health of residents in neighboring cities. The rise in the number of health technicians and the green space per capita was good for the public health of the neighboring cities. It shows that the improvement of medical services and green areas in the region had not only improved the quality of life of residents in neighboring cities, but also attracted neighboring residents to seek medical treatment in the region, in the case of inadequate medical resources in neighboring cities, and ensured that people are treated for their illnesses. Additionally, the increase in the urbanization rate may inhibit the improvement in the health of residents in neighboring cities, while the effects of other factors were not significant. The reason for this may be that under the traditional urbanization development model, environmental problems such as heavy traffic and increased air pollution have adversely affected the health of residents in neighboring cities.

The results of the total effect show that the increase in PM2.5 concentration has a significant negative effect on public health. The number of health technicians per ten thousand people and the green space per capita both passed the 1% significance test for the positive impact of public health, while the effects of other factors were not significant.

Robustness test

In this study, the geographic Queen adjacency matrix was used to analyze the impact of PM2.5 pollution on the public health of residents. However, to avoid errors caused by the selection of the spatial weight matrix, a nested spatial weight matrix based on the Queen space nearest neighbor relations and economic scale was selected for analysis, taking into account economic factors and using GDP per capita as its proxy variable. The results of estimating the spatial spillover effects of variables such as PM2.5 concentration on the public health level of the residents under this spatial weight matrix with a dual fixed effect SDM model (including error correction) are further reported. It can be seen from Appendix Table 5 that the concentration of PM2.5 still presented a significant negative effect on public health. And yet the effect of per capita expenditure on health care in urban households was opposite to the above result, but its effect was not significant. Moreover, the other control variables only differ in the size of the overflow, the direction of the overflow does not change. On this ground, the robustness analysis results are consistent with the above results, indicating that the results are stable and reliable.

Spatial–temporal heterogeneity study

Estimation analysis of panel spatiotemporal geographic weighted regression model

To empirically demonstrate the temporal and spatial distribution of deaths from respiratory diseases under the influence of different environmental, economic, social, health care, and life factors, this paper constructed the PGTWR model to investigate the spatial and temporal heterogeneity of public health of cities in the Yangtze River Delta urban agglomerations. The variance inflation factor was also used to know that there was no multicollinearity in the indicators selected in this paper. Due to the weak diagnosis results of the spatiotemporal geographic weighted regression model, this paper chose to incorporate OLS and GWR models for analysis and to compare the accuracy of the results. The annual average of each indicator from 2005 to 2018 is selected as the new cross-sectional data for GWR analysis, and we perform Gaussian, exponential, and tricube analyses according to the different choices of the kernel function. As shown in Appendix Table 6, the adjusted goodness of fit of the panel spatiotemporal geographic weighted regression model is higher, indicating that the PGTWR model is more suitable for the dataset in this paper.

The PGTWR model was used and the corresponding standardized program was run through MATLAB R2018a. Based on the AICc criterion, the GCV criterion, and the RSS criterion, the optimal spatial bandwidth for this dataset was determined to be 26 and the optimal temporal bandwidth to be 14. It reveals that in the estimation of the public health level of the residents in the Yangtze River Delta urban agglomerations, the number of deaths from respiratory diseases in a given city is not only affected by the health of the residents from other cities, but also by the number of deaths from respiratory diseases in that city in previous years, with an effective influence period time of 14 years. Given the above determination of the optimal spatial and temporal bandwidths, the overall statistical properties of the PGTWR model are shown in Appendix Table 7. It is worth pointing out that this paper discusses the impact of haze pollution on public health in the Yangtze River Delta urban agglomerations, which incorporates all spatial units, so the PGTWR model with random effect is not considered. According to the results of the overall statistical properties of the models with different effects, the model with spatial fixed effect embodied good statistical properties. The model under spatial effect has a significant ratio of 67.27% for the local coefficient estimates, the adjusted goodness of fit of 0.9902, and its estimated value of the variance of random disturbance term, CV criterion value, GCV criterion value, and AICc criterion value are smaller, and the F statistic passes the hypothesis test with a significance level of 1%. In consequence, the spatial fixed effect is more appropriate for the dataset in this paper, and the analysis is carried out according to its parameter results.

The calculation results of PGTWR with spatial fixed effect are shown in Appendix Table 8. Based on the absolute value of the mean of the regression coefficients of each indicator, it is found that PM2.5 pollution has the greatest impact on the level of public health, followed by per capita expenditure on health care in urban households, per capita disposable income of urban households, urbanization rate, and the number of health technicians per ten thousand people. By contrast, the green space per capita and population density have a less intense impact on public health.

Spatial and temporal differences in the impact of various factors on public health

The PGTWR model was used to study the spatial and temporal heterogeneity of the influence of environmental factors, economic factors, social factors, health care factors, and life factors on the level of public health. The revealed temporal and spatial differences are shown in Figs. 4 and 5.

Figure 4a presents PM2.5 pollution regression coefficients that are all positive, consistent with the results in the global analysis, indicating that the intensification in PM2.5 pollution has an inhibitory effect on the improvement of residents' health. Moreover, the public health level of residents from 2006 to 2009 was greatly affected by haze pollution. The average coefficient for each city was 1.274 in 2005, rising to 1.383 in 2018, manifesting that the impact of PM2.5 pollution on health level has increased in the time dimension. Fig.  5a plots the spatial distribution of the average coefficient of PM2.5 pollution and shows that there are significant regional differences in the impact of PM2.5 pollution on health. The cities with the highest correlation between PM2.5 pollution and public health level are mainly located in the eastern region of the YRD, including Taizhou* (1.5421), Ningbo (1.5104), Yancheng (1.4999), Taizhou (1.4779), and Shanghai (1.4764). It may be related to the distribution of PM2.5 pollution in the Yangtze River Delta urban agglomerations, with higher PM2.5 concentration in northwestern cities such as Hefei, Maanshan, Wuhu, Nanjing, and Yangzhou, and lower PM2.5 pollution level in eastern cities. Once air pollution intensifies, the public health of residents in eastern cities is at greater risk and will be more sensitive to change in PM2.5 pollution.

According to Fig. 4b, the regression coefficients of per capita disposable income of urban households are all positive, which is in accordance with the results of the global analysis, demonstrating that there was a negative effect of per capita disposable income of urban households on public health. The impact of per capita disposable income of urban households on public health was greater in Chizhou, Anqing, Tongling, and Hefei from 2009 to 2018, and given that the average coefficient for each city was 0.632 in 2005, rising to 0.713 in 2018, the impact increased in the time dimension. Fig.  5b presents clear regional differences in the impact of per capita disposable income of urban households on public health. The cities with the highest correlation between the two are mainly located in western YRD cities such as Anqing (0.8937), Chizhou (0.8782), Hefei (0.8720), Tongling (0.8646), and Wuhu (0.8184).

Figure 4c indicates that the increase in the level of urbanization rate is conducive to the improvement of residents’ health. In 2005, the average coefficient of each city was − 0.406, and in 2018 it dropped to − 0.383, reflecting the weakening of the impact of urbanization rate on health level in the time dimension. According to the spatial distribution of the average coefficient of Fig  5c, it is found that the cities with the higher correlation between urbanization rate and public health level are mainly located in the northeastern region of YRD, consisting of Yancheng (− 0.5641), Taizhou (− 0.4834), Yangzhou (− 0.4650), Zhenjiang (− 0.4478), Nantong (− 0.4356), and other cities.

Figure 4d shows that the regression coefficients of population density are negative except for the coefficient of Ningbo, Zhoushan, and Shanghai in 2007, indicating that the increase of population density has a certain contribution to the improvement of public health. The average elasticity coefficient of population density to health level was − 0.223 in 2005 and − 0.068 in 2018, revealing that the intensity of the effect of population density on health level has weakened over time and that the intensity of the effect was greater in 2005 and 2006 during the study period. From Fig.  5d, it can be seen that the northern YRD cities of Yancheng (− 0.1492), Chuzhou (− 0.1353), Yangzhou (− 0.1334), Nanjing (− 0.1250), and Taizhou (− 0.1243) are the regions with a high correlation between population density and public health level.

Figure 4e displays the regression coefficients of the number of health technicians per ten thousand people are all negative, in line with the results of the global analysis, which describe that the increase in the number of health technicians makes for the improvement of residents’ health. It is worth noting that the average coefficient for each city in 2005 was − 0.264, and it decreased to − 0.411 in 2018. The positive effect of the number of health technicians on the public health level is not only enhanced in the time dimension, but its enhancement reached 0.147. Fig. 5e shows significant regional differences in the impact of the number of health technicians on the level of public health, with a trend of weakening from west to east. Moreover, the cities whose public health level is greatly affected by the number of health technicians are mainly located in the cities of Anhui Province, such as Anqing (− 0.6774), Chizhou (− 0.6419), Hefei (− 0.6339), Tongling (− 0.6151), and Wuhu (− 0.5324).

It can be seen from Fig. 4f that the elasticity coefficients of per capita expenditure on health care in urban households are all negative, supporting the results of the global analysis. And the intensity of the impact of per capita health care expenditure on health levels was − 0.118 in 2005, rising to − 0.102 in 2018, with a weak change overall. According to Fig.  5f, it can be learned that the impact of per capita health care expenditure on public health has regional differences, and the major cities with a greater impact are mainly in Anhui Province, such as Anqing (− 0.2588), Chizhou (− 0.2399), Hefei (− 0.2332), Tongling (− 0.2257), and Wuhu (− 0.1870).

Figure 4g details regression coefficients of green space per capita. The regression coefficients of green space per capita in the analysis by the PGTWR model are all negative except for Anqing in 2006 and 2007, revealing that good quality of the living environment is conducive to the improvement of residents’ public health. On the whole, the level of public health of residents in most cities was influenced to a greater extent by green space per capita from 2009 to 2018. The average coefficient for municipalities was − 0.0523 in 2005 and decreased to − 0.0860 in 2018. Fig.  5g shows the spatial distribution of the average coefficient of green area per capita. It is also found that the cities with the highest correlation between green space per capita and public health level were mainly distributed in the northeast region of YRD, consisting of Nantong (− 0.0938), Suzhou (− 0.0921), Wuxi (− 0.0919), Shanghai (− 0.0899), Changzhou (− 0.0898), and so on.

Conclusions and policy implications

Based on the above empirical analyses, the main conclusions of this paper are as follows:

  1. 1.

    PM2.5 pollution and public health level are not randomly distributed among cities in the Yangtze River Delta urban agglomerations but are characterized by spatial positive correlation and spatial clustering. And the local clustering of public health level is not significant. Relatively speaking, PM2.5 pollution has the obvious characteristic of local clustering. The cities of Wuhu, Hefei, and Nanjing are high-high clustering areas for PM2.5 pollution, while cities such as Shaoxing, Jinhua, Ningbo, and Taizhou are low—low clustering areas for PM2.5 pollution.

  2. 2.

    Based on the results of SDM estimation, it is found from the global perspective of spatial econometrics that PM2.5 pollution has a significant impact on the level of public health. Taking into account environmental, economic, social, health care, and life factors, PM2.5 pollution is the main factor affecting the health level of residents, regardless of whether the spatial effect is included. Furthermore, the indirect effect coefficient of PM2.5 pollution is significant at − 0.8818, indicating that the increase of PM2.5 pollution in the city can have a beneficial effect on the public health of neighboring cities, and the spillover effects are significant. Among them, the urbanization rate, the number of health technicians per ten thousand people, and the green space per capita have a significant positive effect on the public health level, with each 1% increase being associated with 0.5680%, 0.4984%, and 0.0851% reduction in the number of deaths from respiratory diseases respectively. Besides, the effects of per capita disposable income of urban households, population density, and per capita expenditure on health care in urban households on public health were limited and not significant.

    As a result, based on the above conclusions, this paper puts forward the following suggestions: (a) Pay attention to inter-city collaboration, improving the coordination mechanism of regional air pollution joint prevention and control, actively building cross-regional air pollution prevention and control command institutions, establishing the air pollution hotspot grid supervision system, and strengthening the whole process of collaborative supervision in order to reinforce the effective management of haze pollution. (b) Increase the investment in health care and actively train high-quality health technicians to improve the quality and efficiency of medical services, and improve and implement the diversified and comprehensive regulatory system for the health care industry to create a good medical environment for patients. (c) Strengthen the protection of the urban and rural ecological environment, strictly prohibit deforestation, actively expand the green area, while reinforcing the management of landscaping construction and improving the supervision of landscaping to create a good living environment for residents. (d) Lay stress on alleviating the trend of environmental pollution in the process of promoting urbanization and realizing environmental regulation of new-type urbanization to promote high-quality, healthy, and green development of urbanization.

  3. 3.

    In the light of the estimation results of the PGTWR model, a spatial–temporal analysis of the regression coefficients of each indicator is carried out. From the local spatial econometric perspective, it reveals that the impact of environmental, economic, social, health care, and life factors on public health level presents obvious temporal and spatial heterogeneity, leading to distinctly different public health problems in different cities. Among them, in terms of environmental factors, the public health level of cities such as Taizhou*, Ningbo, Yancheng, Taizhou, and Shanghai are greatly affected by PM2.5 pollution. As for social factors, changes in the level of urbanization have a greater impact on the public health of cities such as Yancheng, Taizhou, Yangzhou, Zhenjiang, and Nantong. Additionally, concerning health care factors, cities where the number of health technicians has a greater impact on public health, include Anqing, Chizhou, Hefei, Tongling, and Wuhu. Referring to life factors, public health is strongly influenced by the green space per capita mainly in the cities of Nantong, Suzhou, Wuxi, Shanghai, and Changzhou. It is worth noting that the impact of PM2.5 pollution, the number of health technicians, and the per capita green area on public health has increased over time. Of these, the strongest enhancement was in the number of health technicians, whose impact intensity increased by 0.147, implying that more attention should be paid to the training of medical and health personnel.

Moreover, since the coefficients of various indicators show temporal and spatial heterogeneity, the differences in the degree of influence of different indicators should be considered when developing countermeasures against haze pollution. (a) Firstly, for cities such as Yancheng, Taizhou, Yangzhou, Zhenjiang, and Nantong, the city governments should actively promote new-type urbanization to take advantage of the population gathering and industrial agglomeration effects it brings. At the same time, the governments are supposed to promote the development of public services such as medical care and education to further improve the public health level of residents. (b) Secondly, in cities such as Anqing, Chizhou, Hefei, Tongling, and Wuhu, the city governments should strengthen the training of health care professionals and encourage the development of health care to gradually improve the level of medical and health services due to the imperfect medical facilities and services in the cities. (c) Thirdly, the city governments of Nantong, Suzhou, Wuxi, Shanghai, and Changzhou should step up environmental protection efforts, and pay attention to the improvement and effective implementation of air governance policies. At the same time, the governments are ought to strengthen the construction of urban and rural parks, and rationalize the arrangement of green space to form the park system with a reasonable layout. In addition, municipal governments should continuously improve urban resilience. The relevant functional departments should actively fulfill their regulatory responsibilities, strengthen the supervision of mobile source pollution control, and enhance their regulatory capacity by improving the legal standard system and increasing penalties.

This paper explores the spatial correlation and spatial–temporal heterogeneity of the impact of PM2.5 pollution and other factors on residents’ public health through the SDM and the PGTWR model, providing a scientific direction for 26 cities in the Yangtze River Delta urban agglomerations to cope with environmental health issues. Moreover, a comprehensive analysis of the impact of haze pollution on public health from both global and local perspectives makes up for the limitations of the existing literature, and the parameter estimation results are conducive to proposing targeted strategies for improving the public health level of different cities. Considering the spillover effects of haze pollution, each city government should take into account regional environmental governance and protection while implementing differentiated policies. Importantly, the government should pay attention to the realization of cross-regional environmental cooperation, effectively promote the joint prevention and control of regional air pollution, and timely evaluate the effect of collaborative regulation of air pollution prevention and control, to maximize the effect of collaborative governance.

Fig. 1
figure 1

Study area

Fig. 2
figure 2

Recognition of spatial correlation between air pollution and public health. H stands for public health level. The 10%, 5%, and 1% significance levels are represented by the *, **, and ***, respectively

Fig. 3
figure 3

Local Moran’s I index of a respiratory disease deaths in 2005; b PM2.5 concentration in 2005; c respiratory disease deaths in 2011; d PM2.5 concentration in 2011; e respiratory disease deaths in 2018; f PM2.5 concentration in 2018

Fig. 4
figure 4

Coefficients for each region at different times. a Coefficients of PM2.5 pollution (PM2.5); b coefficients of per capita disposable income of urban households (PDI); c coefficients of urbanization rate (urban); d coefficients of population density (pd); e coefficients of number of health technicians per ten thousand people (num_tech); f coefficients of per capita expenditure on health care in urban households (hpe); g coefficients of green space per capita (green)

Fig. 5
figure 5

Results of PGTWR of factors in 16 cities in YRD. a Average coefficient of PM2.5 pollution (PM2.5). b Average coefficient of per capita disposable income of urban households (PDI). c Average coefficient of urbanization rate (urban). d Average coefficient of population density (pd). e Average coefficient of the number of health technicians per ten thousand people (num_tech). f Average coefficient of per capita expenditure on health care in urban households (hpe). g Average coefficient of green space per capita (green)