Keywords

1 Introduction

The northern regions have been the most affected area by the first wave of the COVID-19 epidemic in Italy. The great speed and intensity with which disease has spread to these regions has led to the hypothesis, in some preliminary studies, that high levels of pollution may play a role in epidemic expansion and in determining the severity of the infection (Coccia 2020; Conticini et al. 2020; Fattorini and Regoli 2020; Martelletti and Martelletti 2020). In fact, Northern Italy is one of the most heavily polluted area in Europe in terms of smog and air pollution (Carugno et al. 2016) because it is characterized by a high concentration of densely populated urban areas, as well as by a strong presence of industrial activities. In addition, the particular geomorphological conformation and the weather-climatic characteristics of some areas do not favor the recirculation and release of pollutants with their consequent stagnation.

This contribution intends to verify these hypotheses at a more detailed geographical scale than the previous studies (Fattorini and Regoli 2020; Pignocchino et al. 2022), that is to say at the level of local systems, considering the northwestern regions of Italy, that are characterized by different environmental conditions and socio-demographic patterns. The results should therefore have a greater geographical consistency and, therefore, scientific validity.

2 Materials and Methods

This research project focused on the period of the first COVID-19 epidemic wave (February–May 2020) in the Northwest regions. We worked on the data at the local level by considering Local Labor Systems (LLS), which are inter-municipal territorial units, independent from any administrative subdivision, and defined by the daily commuting flows recorded in the general population and housing censuses (ISTAT 2015a).

In order to analyze this data at the local level, the first necessary step was to create a cartography of LLS updated to the most recent changes in the administrative boundaries of the municipalities using the data referring to January 1, 2020, obtained by ISTAT—Italian National Institute of Statistics (ISTAT 2020a). The total number of LLS in Piedmont, Valle d'Aosta and Liguria regions is 55 (Fig. 7.1).

Fig. 7.1
A map and a table. A map of Piedmont region marks the region delimitations and marks and labels 55 local labor systems code. The table has the 55 local labor systems code and its denomination.

LLS of Piedmont, Valle d'Aosta, and Liguria

From the first months of the pandemic, a discrepancy is emerged between official data on COVID-19 deaths and the significantly higher number of deaths compared to previous years (Ciminelli and Garcia-Mandicó 2020). Therefore, to estimate the real impact of COVID-19, we used excess mortality data, defined as the percentage of change in the mortality of the year 2020 compared to the average number of deaths in the same period of the previous five years (2015–2019). These data were retrieved from ISTAT (ISTAT 2021a), temporally aggregated bi-weekly and then for the overall period (February-May).

Nitrogen dioxide (NO2) concentrations data were derived from the space satellite Sentinel 5 Precursor (S5P) managed by the European Space Agency (ESA) under the Copernicus program. According to Eum et al. (2019), there is an association between one year of exposure to nitrogen dioxide and the increase of mortality from several pathologies (cardiovascular and respiratory, in particular). Therefore, we decided to consider the time period of one year as proxy for the pollution data. The data were retrieved from the Sentinel-5P OFFL NO2 dataset through Google Earth Engine platform (GEE), obtaining a single grid image defined by the annual mean of NO2 concentrations from July 1, 2018 to June 30, 2019 with a resolution of 1 km2 per pixel.Footnote 1 Pollution data of nitrogen dioxide for the first period of the epidemic wave (February to May) were also collected in the form of nine grids, each with the average concentration referenced over a two-week interval.

Next to this, population-weighted average was calculated using a grid image from the Gridded Population of the World dataset of the Center for International Earth Science Information Network (CIESIN 2018). This grid was also used to derive the population density (population/km2) of each local system.

The Italian National Institute of Statistic was the source of the data on the resident population as of January 1, 2020 (ISTAT 2020b) and for LLS quality indicators (ISTAT 2015b). Among many quality indicators, three were used in this study: the number of jobs, from which it was derived the number of jobs per capita (jobs/population), and two self-containment indices (SCI), one referred to the jobs demand and the other to the jobs supply. The self-containment index of the demand (SCID) is calculated as the number of workers who work and live in the LLS divided by the number of workers who are working there, while the self-containment index of the supply (SCIS) is calculated as the rate between the number of workers who work and live in the LLS and the number of workers who are living there. These indices vary from 0 to 1 and quantify the level of isolation of the local systems, i.e., higher values of SCID index correspond to fewer workers who come from outside the LLS to work, while higher values of SCIS index correspond to fewer residents who work outside the LLS.

All data not available at the local system level were obtained at the municipality level and then aggregated to LLS.

An exploratory and descriptive analysis was conducted for each variable considered in this study, using histograms and boxplots to investigate the distributions of the values and detect the presence of outliers, and finally cartograms to evaluate the spatial distribution. To assess the effect of emergency measure taken by the Italian government (lockdown) on air pollution, a series of maps of the bi-weekly period of NO2 concentrations was created. In addition, for each bi-weekly period, a cartogram on excess mortality data was also created to visualize the spatial and temporal evolution of the pandemic.

Spearman’s correlation coefficientFootnote 2 was used to examine the relationship between nitrogen dioxide concentrations and excess mortality. Next to this, three quasi-Poisson regression models were implemented to exploring further the relationship between the two variables. Poisson’s multivariate regression models are generalized linear models based on a logarithmic scale. They are useful with counting variables (number of deaths, in this case) and for modeling rates. A generalization of Poisson models was used in this study: the quasi-Poisson regression, which allows to take in account the overdispersion of the data by adjusting the variance to a specific dispersion parameter (Berk and MacDonald 2008).

Within these models, other variables were considered for a possible confounding effect on the associations between the two main variables: the population density, the proportion of the population over the age of 65 (population over 65/total population), the proportion of the male population (male population/total population), the number of jobs per capita, the LLS self-containment index of the demand, and the LLS self-containment index of the supply. These confounding factors were chosen as the data show higher mortality among males and among the elderly people. More deaths are expected in areas with higher population density while higher number of jobs and the level of isolation measured by the self-containment indices could represent a limitation to the spread of the virus within local systems.

Results obtained from the quasi-Poisson multivariate regression models are expressed as estimated rate ratio (RR), a relative difference used to compare different observed incidence rates due to exposure to a given risk factor over a given time period. In this study, the rate ratio is calculated as the ratio of the incidence rate in an exposed group and the incidence rate in a less exposed group. The risk factor is the NO2 concentrations.

All cartograms were created with QGIS platform and all statistical analysis were carried out with RStudio software.

3 Results

3.1 Descriptive Analysis

All the variables used in this study have a tendency toward a normal distribution, as shown in the histograms in Fig. 7.2 The most extreme values, numerically distant from the rest of the data collected, are identifiable as outliers and only the two self-containment indices are the variables that do not present with one, as also confirmed by the boxplots in Fig. 7.3.

Fig. 7.2
8 histograms represent trends which increase then decrease, with outliers.

Histograms of variables

Fig. 7.3
8 box plots. The estimated median values are. Excess of mortality, 1.7. N O 2 concentration, 45. Population density, 180. Proportion of population over 65, 0.275. Proportion of male population, 0.49. Jobs per capita, 0.34. Self-containment indexes of demand and offer, 0.825 and 0.76.

Boxplots of variables

The effects of the limitations to circulation, imposed by the Italian government with a national lockdown, in LSS considered are visible in Fig. 7.4. The reduction in NO2 concentrations starts from the 9th week, and there is a constant decreasing trend of the values recorded for the whole lockdown period (weeks 11–18). The end of the national restrictions marks the return to higher values as shown in weeks 19–20.

Fig. 7.4
9 maps of Piedmont region highlight the average concentrations of N O 2 for L L S. The concentration of N O 2 for L L S is the highest in map 2, weeks from 7 to 8, and lowest in map 7, weeks from 17 to 18. The concentration of N O 2 is the highest in 2 regions from 150 to 175 of map 2.

Average concentrations of NO2 for LLS

A good visualization of the first evolution wave is represented in Fig. 7.5. The cartograms show the excess mortality in the nine bi-weekly periods considered from weeks 5 and 6 (January 27–February 9) to weeks 21 and 22 (May 18–31) in all LLS used in this study. It is evident that the excess mortality started in weeks 9–10 when the number of local systems with excess mortality was greater than 50% (Fig. 7.6). The peak was reached in weeks 13–14, with only one local system without excess mortality, and began to fade after weeks 15–16. It is noticeable that the most affected period, which runs from weeks 11 to 18, corresponded to the national lockdown that started on March 9 (DPCM 2020a) and ended on May 4 (DPCM 2020b).

Fig. 7.5
9 maps of Piedmont region highlight excess mortality for L L S. It is the highest in map 5, from weeks 13 to 14, followed by map 6, from weeks 15 to 16, and the lowest in map 1, from weeks 5 to 6. In map 5, the excess mortality for L L S is the highest in 2 regions from 280 to 500.

Excess mortality for LLS

Fig. 7.6
A line graph plots excess mortality percentage versus bi-weekly period. The line starts from 20% in fifth to eighth week, rises to 100% in thirteenth to fourteenth week, and drops to 58% in twenty-first to twenty-second week. Values are estimated.

Excess mortality trend

Considering the excess mortality for the whole period of the first wave (Fig. 7.7a), a pattern of higher values emerges close to the border with Lombardy region, although not well defined since some border LLS report only medium values. Other clusters of high values far from it are the local system of Garessio and the axis Tortona–Turin. This pattern becomes more evident in the cartogram of NO2 concentrations (Fig. 7.7b) where a group of local system with medium–high values on the eastern border is clearly visible. The values tend to decrease toward West except for Turin LLS which corresponds to the outlier value.

Fig. 7.7
2 maps of the Piedmont region. a. It highlights excess mortality, which is the highest in Tortona and Garessio from 250% to 300%. b. It highlights the average concentrations of N O 2. The average concentrations of N O 2 is the highest in Turin from 100 to 120.

a Excess mortality; b average concentrations of NO2

Not all variables present a clear pattern like the previous; however in the cartogram of the population density (Fig. 7.8a), it is possible to see a spatial correlation with the NO2 concentrations, which is confirmed by a Pearson correlation index of 0.51 with a p-value of 5.12e−05. Another pattern is shown in the cartogram of the proportion of the elderly population where the local systems of Liguria region are all part of the upper classes (Fig. 7.8b).

Fig. 7.8
4 maps of Piedmont region. a. Population density is the highest in Genoa from 700 to 900. b. The proportion of elderly population classes are very high in Garessio. c and d. The jobs demand and supply indexes are the highest in Aosta, Turin, and Genoa.

a Population density (2020); b proportion of elderly populationFootnote

The classes were defined using the mean value and adding or subtracting one and two standard deviations.

; c self-containment index of the jobs demand (SCID); d self-containment index of the jobs supply (SCIS)

While, in the two cartograms of the self-containment of the job market indices (Fig. 7.8c, d) no scheme emerges but the local system of the regional capitals have the highest values in both indices, which is predictable as they are strong centers of attraction for labor.

3.2 Statistical Analysis

Spearman’s correlation coefficient revealed that excess mortality during the first wave is moderately correlated to nitrogen dioxide concentrations (ρ = 0.43, p-value < 0.05), as evidenced by the regression line in the scatterplot in Fig. 7.9.

Fig. 7.9
A scatter plot of excess mortality versus N O 2 concentrations. The regression line increases from (10, 1.3) to (105, 2.3). Values are estimated.

Scatterplot: excess mortality versus NO2 concentrations

Subsequently, this correlation was also investigated using three quasi-Poisson regression models. Since the resulting coefficients of the regression models are reported on a logarithmic scale, they must be antilog to obtain rate ratios in the original scale for a better interpretation. Therefore, in the conversion process with the exponential function, some variables have been further transformed. The coefficients of NO2 concentrations were multiplied for 10 so the rate ratio expresses the percent variation in excess mortality for each variation of 10 µmol/m2. Similarly, other coefficients were transformed: the population density was multiplied for 100 so the rate ratio indicates the percent variation in excess mortality for a variation of 100 inhabitants/km2. While, the coefficients of the two population proportions and the two self-containment indices (SCI) were divided by 100 so the rate ratios measures the percentage change in excess mortality for a one percentage point change in these indices.

In the first model, only the two main variables (NO2 concentration and excess mortality) of the study were considered. As given in Table 7.1, each increase of 10 µmol/m2 in NO2 concentrations is associated with a 1.49%Footnote 4 (95%CI: 0.33 ÷ 2.67) increase in excess mortality.

Table 7.1 Values resulted from the regression in the first model (already retransformed)

Next to this, other variables were also considered for a possible confounding effect. These other two models differ for just one variable: one model considers the self-containment index of demand (“demand model”) and the other the self-containment index of supply (“supply model”). In the “demand model” (Table 7.2), among all the variables, the only one statistically associated with the excess of mortality is the demand SCI variable.

Table 7.2 Values resulted from the regression in the “demand model” (already retransformed)

Table 7.3 shows the rate ratios of the “supply model” in which four variables result associated with excess mortality. The associated percentage increase in excess mortality is 1.88% for each 10 µmol/m2 increase in NO2 concentration and 2.38% for one percentage point increase in the elderly population proportion. While, for an increase of one job per capita, the excess mortality is sixfold.

Table 7.3 Values resulted from the regression in the “supply model” (already retransformed)

Both self-containment indices have an inverse association with the excess mortality indicated by the fact that both rate ratios are less than one. This inverse association means that each increase in one percentage point of the index is associated with a decrease in the excess mortality, respectively, of—0.65% for the demand SCI and—0.64% for the supply SCI.Footnote 5

Moreover, in both two models, the variables of population density and the proportion of the male population do not find a significant association with excess mortality.

4 Discussion and Conclusions

The relationship between NO2 concentration and excess mortality investigated in this study partially emerges from the comparison of the respective cartograms. As previously described, these two variables tend to show higher values along the border with Lombardy region, particularly evident for the nitrogen dioxide pollutant. The zoning of NO2 values follows the spatial distribution of the population density. This result is an expected consequence since more polluting sources (i.e., industries and vehicular traffic) tend to reside in the most populated areas.

The pattern of the highest excess mortality values near the Lombardy region needs more clarification: Lombardy region was the epicenter of the pandemic in Italy, and this explains how the local systems closest to the border have a higher excess of mortality, with a decreasing gradient toward West. However, this gradient is nonlinear as proximity is not the only contributing factor to the spread of the virus. In facts, an important element to consider is the degree of relations and connections that each local system has with other local systems. The more distant but more connected local systems, such as the Turin one, could be more affected by the spread of the virus than the systems closer to Lombardy but less connected, i.e., Casale Monferrato. For other local systems such as Garessio, the reason of high excess mortality value could be linked to the remarkably high value in the proportion of the elderly population. However, this kind of observations based on the comparison between only two variables could lead to misinterpretation of the real relationship; hence, statistical analyses are needed to consolidate preliminary observations. The nitrogen dioxide concentrations result associated to the excess mortality in two of the three regression models implemented. Our finding is therefore in accordance with previous studies (Filippini et al. 2021; Fattorini and Regoli 2020; Ogen 2020). In the “supply model”, other variables, considered as confounders, showed a statistically significant association with the excess mortality. In the case of the proportion of the elderly population this result is congruent with the high incidence rate of COVID-19 in elderly people reported by the Italian National Institute of Statistic (ISTAT 2021b). Among the variables associated with excess mortality, the rate ratio with the greatest multiplicative effect is the one referred to the jobs per capita: since the period with higher excess mortality values coincides with the national lockdown, when the only movements allowed were mainly for work reasons. The number of jobs in a local system was critical for the spread of the virus because it increased the intensity of travels and the amount of close contacts between coworkers. Therefore, it can be assumed as the cause of such a great multiplicative effect. An unexpected result is the non-association of population density with excess mortality; in fact, a greater population density should correspond to a greater possibility of interpersonal contacts and therefore more infections. An interesting interpretation of this is provided by the study of Cremaschi et al. (2021) which suggests the adoption of a relational density (McFarlane 2016) to justify the spatial spread of the virus. Density can be considered a measure of how close social relations are in the resident population. The use of this variable would allow to highlight substantial differences in socio-spatial relationship that characterize small- or medium-sized territorial systems compared to those of large cities.

The two self-containment indices were used as a measure of the isolation of the local systems. Higher values correspond a smaller number of workers who arrive from outside the system or who leave the system in which they live to work outside. The inverse association with the excess mortality demonstrated by both indices means that a system with a higher SCI, net of other variables, recorded a lower excess mortality during the first wave of the pandemic. This relationship could be explained as lower exchanges of workers with other local systems. In an initial phase of the spread of the virus, when it was not yet homogeneously present on the territory, exchanges with external systems were essential for its transmission. Furthermore, considering that the only travels allowed during the lockdown period were those related to work, this fact strengthen the role of these indices in the interpretation of the spread of the disease.

The multivariate regression models carried out confirmed the existence of a significant statistical association between NO2 concentrations and excess mortality values for local systems of the northwestern regions during the first COVID-19 epidemic wave in Italy.

The results obtained are thus part of a broader panorama of studies aimed at evaluating the effects of air pollution on human health (Carugno et al. 2016; Eum et al. 2019; Fattorini and Regoli 2020; Filippini et al. 2020; Ogen 2020) but also highlight the importance of conducting such analyses on a more appropriate geographic scale such as that of local systems.