Introduction

From the combination of demographic changes (urbanisation, increase of the share of older people in total population), climate change (increase of mean and extreme temperature) and local urban effects (urban heat island, concentration of air pollution), considerable health risks related to climate and air substances emerge. A specific concern applies to ground-level air temperature and ozone (O3) concentrations, due to the physical relationships between these variables, the single and combined effects of both variables on human health and the anticipated substantial changes in the scope of climate change.

Temperature extremes like hot days and heat waves can have a negative impact on human health and prolonged exposure to high air temperature can cause heat-related diseases like heat exhaustion and heat stroke. Epidemiological studies have shown that high air temperatures are related to increased mortality (Hajat and Kosatky 2010). Heat exposure was found to be associated with increased risk of cardiovascular, cerebrovascular and respiratory mortality (Song et al. 2017). However, the authors also noted that health effects of temperature depend on the definition of indicators and thresholds and that definitions of temperature exposure at the regional level are necessary. With respect to thermal load, an official warning is issued by the German Weather Service when the apparent temperature exceeds 32 °C. Also, the definition of a “hot day” with maximum temperature >30 °C is frequently used in studies (e.g. Vescovi et al. 2005).

Exposure to ground-level O3 can also have significant health impacts. Ground-level O3 is a secondary air substance, which is primarily built by photochemical reactions under solar radiation with the involvement of precursor gases including nitrogen oxides, carbon monoxide, methane and non-methane volatile organic compounds. It has to be distinguished from stratospheric ozone, which acts to protect against harmful UV radiation. With respect to long-term O3 exposure, an effect was found on respiratory and cardiorespiratory mortality, especially in people with potential predisposing conditions (WHO 2013). In a study of effects of short-term exposure to O3, a 0.35% (95% confidence interval 0.12–0.58) increase in cardiovascular mortality per 10 μg/m3 increase in daily maximum 1-h O3 concentrations was found (Katsouyanni et al. 2009). For the protection of human health from O3 exposure, information and alarm thresholds (1-h O3 concentration of 180 μg/m3 and 240 μg/m3, respectively) and target values (maximum 8-h mean of 120 μg/m3 at no more than 25 days per year) exist. The WHO (2006) even recommends the use of 100 μg/m3 8-h mean as guideline. In a study relating myocardial infarction (MI) frequencies to O3 exposure for the city of Augsburg, Bavaria, significantly enhanced MI frequencies were found above 96 μg/m3 daily maximum 1-h O3 concentration (Hertig et al. 2019).

There is also evidence of synergistic effects between temperature and O3 pollution. Thus, health effects of O3 are worse on hot days (Pattenden et al. 2010). Also, Katsouyanni and Analitis (2009) found that the effects of heat-wave days on mortality were larger at days with high O3 concentrations, particularly in the age group 75–84 years.

Climate change is associated not only with an increase of mean temperature but also with an increase of temperature extremes in Europe (Jacob et al. 2014) and, via changes of the synoptic conditions and the chemical environment, of ground-level O3 (Katragkou et al. 2011). Increases in summertime O3 are assessed for mid-Europe until the end of the twenty-first century (Colette et al. 2015), particularly in polluted environments (Schnell et al. 2016). Changing patterns of disease are occurring in response to the changing environmental conditions (De Sario et al. 2013). With respect to changes of the temperature-mortality relationships under climate change, for Central Europe, a strong increase of excess mortality of 3.5% (95% confidence interval 0.4 to 7.1) under RCP8.5 scenario conditions was assessed (Gasparrini et al. 2017). A recent study by Orru et al. (2019), analysing changes of premature deaths due to temperature and O3 exposure, found that under RCP4.5 scenario conditions future heat-related deaths will increase, while O3-related deaths will increase when considering only climate change, but will decrease when assuming reductions in O3 precursor emissions. In contrast, Hendriks et al. (2016) note that the effects of increasing temperatures on O3 concentrations and associated health impacts might be higher than the reduction that is achieved by lowering ozone precursor emissions.

In summary, there are still few combined temperature and O3 assessments relevant for human health and there is an incomplete picture about the future changes of temperature and O3 exposure, particularly on the regional to local scale. Thus, the current contribution addresses relationships between air temperature and ground-level O3. Local projections of temperature and ozone events under the constraints of ongoing climate change until the end of the twenty-first century are presented. Attention is also given to the change of health-relevant thresholds of these variables. The federal state of Bavaria, Southern Germany, builds the regional focus.

Data and data preparation

Since health-relevant higher air temperature and O3 concentrations occur in Bavaria primarily in spring and summer, all analyses comprise the months from April to September.

Station-based air quality data

Station-based air quality data were provided by the Bavarian Environment Agency (Bayerischen Landesamt für Umwelt 2019). Twenty-nine air quality stations could be retrieved which provide daily ozone data with a minimum of 5 years of data. Name, geographic coordinates, height and time series length of each station are given in Table S1 in the Supplementary Material. From the hourly data, the daily maximum 1-h ozone concentration was derived. Preliminary analyses using the daily maximum 8-h average O3 concentration or the daily mean O3 concentration provided a lower performance in the statistical models. Thus, the daily maximum 1-h ozone concentration (O3max) was chosen. Daily mean 1-h nitrogen oxide concentrations (NOmean, NO2mean) were also extracted to analyse the station-specific NOx-O3 relationships. Station characteristics were assumed from the classification of the Environment Agency, categorising into urban traffic (5 stations), urban background (6 stations), suburban background (10 stations), peri-urban background (3 stations) and rural regional background (4 stations) as well as rural remote background (1 station). The location of the stations as well as their station characteristic can be seen in Fig. 1.

Fig. 1
figure 1

Location of the air quality stations and their station characteristics (red: urban traffic, orange: urban background, yellow: suburban background, light blue: peri-urban background, turquoise: rural regional background, dark blue: rural remote background)

Station-based air temperature data

Daily 2-m maximum air temperature (Tmax) from meteorological stations of the German Weather Service (Deutscher Wetterdienst 2019) was also extracted. Temperature stations were selected to match the spatial location of the ozone stations as close as possible. Note that the meteorological stations are in general not located within the same station environment as the air quality stations. Meteorological measurements of the German Weather Service target at regional representativeness, where local influences should be minimised. Thus, temperature stations are located in urban proximity, but outside of high-density areas. Studies (e.g. Fenner et al. 2019) have shown that there are only minor effects of the urban heat island (UHI) on Tmax, since the UHI is predominately a nocturnal phenomenon and thus mainly appears in increased minimum and mean air temperatures. Thus, it is assumed that there is no substantial urban-rural difference with respect to Tmax and that the 29 stations can be regarded as representative for the particular city regions.

Predictor data from reanalysis

Larger scale predictor data, i.e. daily mean sea-level pressure data, as well as air temperature and humidity of the 850 hPa level, were taken in a 1° × 1° resolution from the European Centre for Medium-Range Weather Forecasts (ECMWF) ERA5 reanalysis (Hersbach and Dee 2016). The choice of predictor variables was governed by a literature review (e.g. Carro-Calvo et al. 2017; Otero et al. 2016), own analyses and the data availability in reanalysis and earth system models. Mean sea-level pressure data was derived for the domain 25°N–70°N, 25°W–40°E. For all predictors, the mean of the 9 grid boxes covering the area over and around the particular target city was used.

Air flow indices were calculated from the daily mean sea-level pressure fields based on an objective weather type approach developed by Jenkinson and Collison (1977). It has been advanced for reanalysis data, for instance by Jones et al. (2013). The approach makes use of three variables that define the circulation features over the region of interest: overall direction of air flow, the strength of flow and the total shear vorticity. A grid-point pattern, laid out over the study region, was used to calculate these wind-flow characteristics. Details on the calculation can be found in Jones et al. (2013). The indices were calculated for each of the 29 air quality stations using the geographic coordinates of the stations rounded to full degrees as central points for the definition of the grid-point pattern.

Earth system model (ESM) data

Historical and RCP8.5 scenario (Van Vuuren et al. 2011) runs performed for the Coupled Model Intercomparison Project round 5 (CMIP5) were downloaded from the CMIP5 archive (http://pcmdi9.llnl.gov/esgf-web-fe/). Model data were taken from a three-member MPI-ESM-LR ensemble, and one member each from CMCC-CMS, IPSL-CM5A-LR and ACCESS 1-0. The same variables as for the ERA5 reanalysis have been extracted and interpolated to a 1°×1° resolution using ordinary kriging. The period 1950–2005 of the historical runs and the period 2006–2100 of the scenario runs are used.

Methods

Statistical downscaling models

For the definition of the appropriate statistical models, goodness of fit of the O3max data and Tmax data was tested. Normal, Gamma, Weibull and lognormal distributions were considered for O3max and normal distribution for Tmax. Since O3max data contains some zero values, the O3max minimum value was set to 1 μg/m3 for hypothesis testing. Anderson-Darling test and Cramér-von Mises criterion are used for the selection of the statistical models.

Separate statistical models for O3max and Tmax as predictand were built. With the objective of consistent downscaling results, the same predictors were used for both target variables. Predictors-predictand relationships were assessed using generalised additive models (GAM; Wood 2017). GAM is a generalised linear model involving a sum of smooth functions of covariates for the linear predictor. Within GAM, non-linear as well as linear relationships can be modelled. As predictors, the mean of the 9 grid boxes over the target city of daily mean air temperature (x1) and daily mean relative humidity(x2), both at the 850 hPa level, as well as daily mean sea-level pressure(x3), was used. Furthermore, the three air flow indices (x4, x5, x6) were taken as predictors. Seasonality (defined by sin (2πd/365.25) and cos (2πd/365.25), x7, x8) was also taken into account. Preliminary tests with linear (x) as well as a non-linear (f(x)) characterisations of the relationships showed that non-linear characterisations are appropriate. Thus, the model can be written as

$$ g\left({\mu}_i\right)={A}_i\theta +{f}_1\left({x}_{1i}\right)+{f}_2\left({x}_{2i}\right)+{f}_3\left({x}_{3i}\right)+{f}_4\left({x}_{4i}\right)+{f}_5\left({x}_{5i}\right)+{f}_6\left({x}_{6i}\right)+{f}_{78}\left({x}_{7i,}{x}_{8i,}\right) $$
(1)

where μi ≡ E(Yi) and Yi EF(μi, ∅). Yi is a response variable, EF(μi, ∅) is an exponential family distribution with mean μi and scale parameter ∅, Ai is a row of the model matrix for any strictly parametric model components, θ is the corresponding parameter vector and the fi are smooth functions of the covariates xk. Note that one smooth function of both seasonality parameters x7, x8 was chosen. In the present work, penalised thin plate regression splines were used for estimating the smooth functions. Thin plate regression splines avoid the problem of knot placement, can be constructed for smooths of multiple predictor variables and are relatively computationally efficient (Wood 2017).

Model performance

Performance of the statistical models was evaluated using the p values of the model coefficients, the adjusted R2 of the model and the partial autocorrelation function of the residuals. Also, a bootstrap procedure was applied. Bootstrapping with random sampling of calibration (2/3 of data) and validation (1/3 of data) periods was performed using 100 random samples. Tests with sample sizes up to 5000 gave similar results; thus, 100 samples were selected to balance between stable results and high computer efficiency. Performance in calibration and validation was evaluated using the correlation coefficient between observed and modelled values as well as the mean squared error skill score (MSESS) with the long-term climatology as reference forecast. Both measures were evaluated as mean values over the 100 random samples.

Projections

Statistical projections were done by replacing reanalysis predictor data with the corresponding ESM data. Time slice differences under the RCP8.5 scenario conditions of the future periods 2041–2060 (mid- twenty-first century) and 2081–2100 (end twenty-first century), respectively, compared with the historical period 1986–2005 are used to illustrate the changes of O3max and Tmax under future climate change.

Linear scaling (e.g. Teutschbein et al. 2011; Casanueva et al. 2016) was applied to bias-correct O3max and Tmax statistically downscaled from ESM data. In this regard, monthly correction factors based on the difference of O3max and Tmax, assessed using historical ESM values and observed O3max and Tmax values, were applied so that ESM downscaled values matched the long-term monthly averages of the measured O3max and Tmax, respectively. Thus, biases between downscaled time series and observations were identified and then used to correct both historical and scenario runs, assuming stationarity of the relationships under historical and scenario conditions.

Uncertainties were assessed by using the 95% confidence interval of the GAM-modelled mean (based on the standard error of predictions within the GAM models), the range over the 100 statistical models and the range over the six ESM runs. Results are presented as ensemble mean change and its confidence range, using the maximum uncertainty range, i.e. the minimum and maximum of the change signals across the models obtained from the 95% confidence intervals of the 600 models (100 statistical models × 6 ESM runs).

Results

Characteristics, trends and health-related threshold exceedances of Tmax and O3max

The median value of Tmax was 20.4 °C and the 95% quantile amounted to 28.4 °C, calculated as means over the 29 stations in the months from April to September. Daily Tmax for each station as mean over the months from April to September can be seen in Fig. 2. As spatial pattern, a negative relationship of Tmax with the altitude of the stations becomes visible. A correlation analysis using Spearman’s rank correlation yielded a coefficient of − 0.79 between Tmax and altitude. Due to the consideration of Tmax (and not Tmean or Tmin) and since the temperature stations are located outside urban centres (“Station-based air temperature data”), no UHI signal is present in the temperature data.

Fig. 2
figure 2

Mean daily Tmax in the months from April to September. Mean for each station over the years as indicated in Table S1

O3max concentrations show distinct differences depending on the station characteristics. Thus, the median and the 95% quantile values from April to September, as means over the urban traffic stations, amounted to 77 μg/m3 and 124 μg/m3, respectively, but were already 92 μg/m3 and 138 μg/m3 for the urban, suburban and rural peri-urban background stations, and were even 98 μg/m3 and 141 μg/m3 for the stations of the rural regional background. Figure 3 provides an overview of daily mean O3max for each station from April to September.

Fig. 3
figure 3

Mean daily O3max concentrations in the months from April to September. Mean for each station over the years as indicated in Table S1

O3max concentrations depend strongly on NOx levels and (B) VOC ((biogenic) volatile organic compounds) availability. The median values were 25 μg/m3 for NOmean and 42 μg/m3 for NO2mean at the urban traffic stations from April to September. Concentrations decline fast towards the urban hinterland and amounted in the urban background to 7 μg/m3 for NOmean and 25 μg/m3 for NO2mean, and in the rural regional background only to 1 μg/m3 and 9 μg/m3, respectively. The relevance of spatial NOx variations for O3 formation can also be seen in correlation relationships. Thus, the correlation coefficient between O3max and the (NO2mean/NOmean) ratio amounted to 0.45 at the urban traffic sites and dropped to around zero in the rural background. Also, the negative correlation coefficient between O3max and NOmean deteriorates from − 0.39 to around zero. It highlights the contribution of NOx to ozone formation in the urban area as well as the process of ozone destruction at high NO concentrations. A similar analysis for (B) VOCs could not be done due to data non-availability.

There was a mean (non-significant) Tmax warming trend of + 0.3 K/decade in the months April to September in the time period 1990–2017. For O3max, there was no clear overall trend, with negative trend (e.g. rural regional background with − 2.4 μg/m3 per decade), no trend (urban background) or positive trend (suburban background with 2 μg/m3 per decade). In summary, a heterogeneous picture of O3max progression becomes apparent, with no dependence on specific station characteristics. Note however that the given numbers refer to station group means calculated from stations with different time series lengths and that, for individual stations, different trends may have occurred.

Table 1 gives an overview of the frequency of threshold exceedances (Tmax > 30 °C, O3max > 100 μg/m3/120 μg/m3/180 μg/m3) grouped according to station characteristics. Note that all O3 thresholds were calculated using the daily 1-h maximum, not the maximum 8-h mean. The station associated with rural remote background conditions was not included in this statistic, since it is located in 1770 m asl and shows in general higher O3 concentrations. O3 degradation by deposition and titration, which are dominant processes in the boundary layer, plays only a minor role at this site. As expected, there was no urban-rural differentiation of hot days (Tmax > 30 °C) and only small differences occurred across different station characteristics. With respect to health-relevant O3 concentrations, a high frequency became apparent aside urban traffic sites. Already in the urban background, about one-third of all days between April and September showed O3max concentrations above 100 μg/m3, increasing to over 43% in the rural regional background. The threshold of 120 μg/m3 was exceeded at about 15% of all days in background locations and very extreme O3 days with values above 180 μg/m3 occur in 0.3–0.4% of all days.

Table 1 Observed frequency of exceedance (% of days) of the thresholds O3max > 100 μg/m3/120 μg/m3/180 μg/m3 and Tmax > 30 °C in the months April to September. Numbers are the mean over stations with similar station characteristic

In summary, there was a considerable exposure to ground-level ozone in the summer half year from April to September, with the exception of urban traffic sites. The occurrence of thermal load was almost always connected with the concurrent occurrence of O3 pollution (Table 1). The combined thermal-O3 pollution events affect in general the urban areas as well as the rural regions, with the highest frequency of combined events at urban background locations.

Table 2 Projected frequency of exceedance (% of days) of the thresholds O3max > 100 μg/m3/120 μg/m3/180 μg/m3 and Tmax > 30 °C in the months April to September for the scenario period 2081–2100 (italicized numbers) and change in frequency compared with the historical period 1986–2005 (bold numbers in brackets). Numbers are the mean over stations with similar station characteristic

Relationships between O3max and Tmax

GAM models between O3max and Tmax revealed that there is in general a non-linear relationship between the two variables, with larger increases of O3max at higher Tmax. The explained variances from the GAM models amounted to about 50% as average across the 29 stations, highlighting the strong connection of O3max concentrations with Tmax. Figure 4 shows the relationship exemplarily for one station in Munich, located in the suburban background (München Johanneskirchen), but the same kind of relationship was found for all stations, independent of the station characteristics. As a consequence of the non-linear relationship, it can be expected that increases of Tmax under future climate change conditions have a large impact on O3max concentrations, particularly if the Tmax rise lies in the range of associated large O3max increases.

Fig. 4
figure 4

Observation-based relationship between daily O3max and daily Tmax April–September 1993–2017 for Munich (München Johanneskirchen)

Statistical model performance and predictors

Goodness of fit tests showed for O3max that the best fitting distribution was the Gamma distribution for 20 stations and the normal distribution for 9 stations. Thus, GAM based on Gamma distribution was applied to the 20 stations. As link function, the log link was used. For the remaining 9 stations, GAM based on normal distribution with identity link was applied. With respect to Tmax, the best fitting distribution was the normal distribution and thus, GAM based on normal distribution (identity link) was chosen for this variable.

With respect to statistical model performance, the correlation coefficient between observed and modelled Tmax as mean over the 100 random samples amounted between 0.95 and 0.97 in calibration and 0.94 and 0.97 in validation, depending on the station considered. The MSESS (with 0% = no model skill, 100% = perfect model) took values between 91 and 95% in calibration, and between 89 and 94% in validation.

For O3max, the correlation coefficient between observed and modelled values spanned from 0.71 to 0.85 in calibration and 0.70 to 0.84 in validation, depending on the particular station considered. MSESS amounted to 50 up to 72% in calibration and between 50 and 70% in validation.

Station-based 2 m Tmax showed a strong, almost linear relationship with mean temperature above the boundary layer in 850 hPa height. Also, Tmax generally increased with increasing sea-level pressure, pointing to higher air temperatures under high-pressure conditions in the summer half year. High relative humidity values at the 850 hPa level indicated decreased ground-level Tmax values, reflecting the general inverse relationship between relative humidity and temperature. Also, higher relative humidity values relate to more moist air, which is associated with development of clouds and precipitation and consequently with reduced 2 m Tmax. Seasonality also played an important role in the GAMs, whereas the air flow indices had, depending on the station considered, a varying importance in the statistical models.

For O3max as target variable, the significance of the larger scale predictors, as expressed by the p values of the model coefficients, varied across the stations, but for all stations, p values were significant (95% level) for the predictor variables 850 hPa air temperature, 850 hPa relative humidity and seasonality. In general, station-based ground-level O3max showed a non-linear relationship with daily mean air temperature of the 850 hPa level, with stronger increases of O3max at higher temperatures. This non-linear relationship already became apparent in the connection between O3max and station-based 2 m Tmax (“Relationships between O3max and Tmax”). In contrast, high daily mean relative humidity values of the 850 hPa level were associated with low O3max concentrations. High relative humidity may be seen as an indicator for enhanced cloudiness, reduced incoming solar radiation and atmospheric lability and thus implies reduced O3 concentrations. Furthermore, high atmospheric humidity plays a role by the formation of secondary organic aerosols and by reducing the chemical reactivity of the system, particularly under low NOx conditions (Hertig et al. 2019). With respect to the other predictor variables, a more diverse picture appeared, depending on the target station, but higher mean sea-level pressure was usually connected with increased O3max concentrations. It points to the connection of O3 built-up by photochemical reactions under high solar radiation in the presence of high-pressure systems. An increase of the total air flow often leaded to a decrease of O3max concentrations. Increased wind speeds tend to disperse pollutants and can weaken ozone concentrations, whereas stagnant or low air flow favours pollution concentrations to build up. It can also be seen as an indicator that ozone was produced at the vicinity of the measurement station, and that longer range transport did not play a major role in these cases. The total shear vorticity (as an indicator for cyclonic/anti-cyclonic flow) showed complex relationships with O3max. The relationship with the overall direction of flow also varied, due to the dependence of O3max on sources and sinks in the upstream area.

Projections for the twenty-first century

Bias correction

There was a very low bias of Tmax and O3max between observed and with reanalysis-modelled values, indicating a good performance of the downscaling models per se. However, a comparison of the values statistically modelled using the historical ESM predictor data with reanalysis-modelled values revealed partly larger biases. While MPI-ESM-LR and CMCC-CMS showed a comparatively good agreement between reanalysis-modelled and ESM-modelled values, historically modelled values from IPSL-CM5A-LR were about 1.5 K warmer and values from ACCESS 1-0 even about 3 K warmer than reanalysis-modelled values. Thus, linear scaling was applied to the statistically downscaled values in order to minimise ESM-related biases. After linear scaling, the mean bias over all stations for Tmax amounted to 0.27 K (with a bias range from 0.1 to 0.4 K, depending on the station considered), indicating that the Tmax values modelled with historical ESM predictors were still somewhat warmer than observation-based Tmax values after bias correction. Concerning the days with threshold exceedance of Tmax > 30 °C, there was an about 1% overestimation of the frequency when using bias-corrected ESM historical predictors compared with observational data. For O3max, the mean bias after linear scaling was 0.8 μg/m3 (range from 0.1 to 2.2 μg/m3), pointing to overall slightly higher O3max in the modelled values using historical ESM data. With respect to threshold exceedances, the relative frequency of days with O3max > 100 μg/m3 was on average 4% overestimated in the ESM-modelled values compared with observational values, whereas the frequencies for the thresholds O3max > 120 μg/m3 and O3max > 180 μg/m3 were underestimated with − 0.8% and − 0.2%, respectively. Since in the observation-based/historical periods the frequency of co-occurring Tmax > 30 °C and O3max > 100 μg/m3 events was mainly governed by the Tmax frequency (i.e. a hot day was usually characterised by elevated O3 concentrations), there was, as for the Tmax frequency, an about 1% overestimation in the ESM historical assessments compared with observations.

Future changes of Tmax and O3 max

Figure 5 shows the mean change of daily Tmax between the scenario period 2081–2100 and the historical period 1986–2005, statistically downscaled under the RCP8.5 scenario assumptions. For each station, the value refers to the mean over 600 models (100 statistical models × 6 ESM runs). In addition, Table S2 (Supplementary Material) gives the uncertainty range defined by the minimum and maximum change values from the GAM 95% confidence intervals across the 100 statistical models and six ESMs. Thus, uncertainties related to statistical prediction errors, to the choice of calibration/validation periods and to the application of different ESMs were addressed. A north-south gradient of Tmax warming until the end of the twenty-first century became apparent with stronger Tmax increases in the southern parts of Bavaria compared with the northern parts (Fig. 5). The average Tmax increase under the RCP8.5 scenario amounted to 2.3 K until the mid-twenty-first century (not shown) and to 5 K (to 4.4 K up to 6.3 K, depending on the station considered) until the end of the twenty-first century. This implies a mean rise of Tmax from 20.7 °C in the historical period 1986–2005 to 25.7 °C in the scenario period 2081–2100. Even within the very conservative lower bounds of the confidence ranges, the average increase was 2.8 K (range from 0.7 to 4 K, depending on the station considered, see Table S2), but in the worst-case (upper bounds of the confidence ranges) to 8.5 K (range from 7.3 to 10.5 K). Thus, the downscaling results indicate substantial increases of daily Tmax from April to September and consequently of human thermal load in the future.

Fig. 5
figure 5

Change of daily Tmax from April to September between the periods 2081 to 2100 compared with 1986 to 2005 under RCP8.5 scenario

Concerning O3max, Fig. 6 and Table S3 (Supplementary Material) give the mean changes and the uncertainty ranges for the scenario period 2081–2100 compared with the historical period 1986–2005, statistically downscaled under the RCP8.5 scenario assumptions. For all stations, an increase of daily O3max in the months April to September emerged, ranging between about 13 and 22 μg/m3 depending on the station considered. However, no dependence between the rate of increase and specific station characteristics became evident with the exception of the high-altitude rural remote background station Garmisch-Partenkirchen Wankgipfel, which exhibited the lowest increase. The average increase across all stations amounted to 17 μg/m3 for the months April to September in the time period 2081–2100 (to 7 μg/m3 for the time period 2041–2060, not shown). Thus, on average, an O3max increase was projected from 93 μg/m3 in the historical period 1986–2005 to 110 μg/m3 until the end of the century. The lower and upper bounds of the confidence ranges gave a mean change between − 1 μg/m3 (− 18 to 7 μg/m3, depending on the station considered) and 52 μg/m3 (34 to 86 μg/m3). In summary, the lower bounds of the confidence ranges indicated no change of O3max or slight decreases at some locations until the end of the century. However, the multi-model mean points towards an increase of O3max at all stations and the upper bounds of the confidence ranges denote quite substantial increases of O3max under ongoing climate change. Due to the strong relationship between ozone and temperature, a large part of the projected O3max increases can directly be related to the temperature rise under future climate change.

Fig. 6
figure 6

Change of daily O3max from April to September between the periods 2081 to 2100 compared with 1986 to 2005 under RCP8.5 scenario

With respect to the attribution of uncertainties, as expressed by the confidence ranges of the projection results, it became evident that uncertainties from the statistical models as well as the uncertainties related to the ESMs have about an equal share in total uncertainty.

In order to gain further insight into the impact of the projected Tmax and O3max changes on human health, Table 2 gives the assessed changes of health-relevant threshold exceedances (Tmax > 30 °C, O3max > 100 μg/m3/120 μg/m3/180 μg/m3). A strong increase of hot days with Tmax > 30 °C can be seen, leading to projected frequencies of about 30% of all days in the months from April to September at the end of the twenty-first century. With respect to the frequencies of days when O3max > 100 μg/m3 or O3max > 120 μg/m3, very sharp rises were assessed as well, leading to exceedances of O3max > 100 μg/m3 at about 40% of all days at urban traffic sites and up to about 70% of all days in the rural regional background. Days with O3max > 120 μg/m3 were projected to occur still at about 20% of all days at urban traffic sites and about 45% in rural regional background locations. Besides, there was an overall increase of the frequency of very high O3max concentrations (O3max > 180 μg/m3) to about 1.5–2.5% of all days assessed. Regarding the frequencies of combined Tmax and O3max events, there were also strong increases projected, with the increases being strongly connected to the advance of Tmax > 30 °C frequencies. Only at urban traffic sites lower O3max concentrations can limit the number of co-occurring events, whereas at hot days, O3max concentrations always exceeded the threshold of 100 μg/m3 (also of 120 μg/m3, not shown) at all other station locations. Hence, the frequencies of Tmax > 30 °C and O3max > 100 μg/m3 days in the future time slice 2081–2100 amounted to 27.5% at urban traffic sites to over 29% at urban background locations of all days in the months from April to September, indicating a substantial burden of health-relevant thermal load and ozone pollution in the future.

Discussion

Daily Tmax and O3max were assessed for 29 Bavarian locations with different air quality settings. Statistical downscaling models based on GAM were developed to assess the relationships between larger scale meteorological predictors and local-scale Tmax and O3max in the observational period. Subsequently, ESM data were integrated into the statistical models, and Tmax and O3max were projected until the end of the twenty-first century under the RCP8.5 scenario conditions. Uncertainty estimates of the downscaling results considered uncertainties related to statistical prediction errors within the GAMs, uncertainties covering the choice of different calibration/validation periods and uncertainties due to the application of different ESMs. Uncertainties from the statistical models as well as the uncertainties related to the ESMs contributed about equally to total uncertainty.

In the observational period, daily Tmax in the months from April to September amounted to 20.4 °C and hot days with Tmax > 30 °C occurred on average at about 4.4% of all days, pointing to the location of Bavaria in a rather temperate climate. Daily O3max amounted at the background locations on average to over 90 μg/m3. Furthermore, urban background locations exhibited about one-third of all days between April and September with O3max concentrations above 100 μg/m3, increasing to over 43% in the rural regional background. Still, about 15% of all days exceeded the threshold of 120 μg/m3 at background locations. The frequency of co-occurring Tmax > 30 °C and O3max > 100 μg/m3 days amounted on average to approx. 4.1% and was mainly governed by the frequency of hot days, since high Tmax was usually accompanied by elevated O3max concentrations.

Under RCP8.5 scenario conditions, a substantial increase of Tmax with a mean value of 5 K (uncertainty range from 2.8 to 8.5 K) in the period 2081–2100 compared with the period 1986–2005 was assessed. These results are largely consistent with other studies over the European area (e.g. Jacob et al. 2014). The frequency of days with Tmax > 30 °C showed also a significant rise. Thus, under a strong climate change forcing, hot days may occur at about 30% of all days in the months from April to September.

Projections pointed also to increases of O3max, with a mean rise of 17 μg/m3 (uncertainty range spanning from − 1 to 52 μg/m3). The frequency of threshold exceedances showed also large changes, with mostly over 20% increases of days with O3max > 100 μg/m3 and O3max > 120 μg/m3 at all station locations. Consequently, a very frequent exposure to ozone pollution occurs at background locations in the future and even at urban traffic sites, the frequency of O3max > 100 μg/m3 will rise to about 40% of all days. In recent years, increases of O3 particularly at urban or suburban background sites were observed (Sicard et al. 2018; this study). Sicard et al. (2018) attributed the increasing levels in urban surroundings to a lower O3 degradation by NO, due to the reduction in local NOx emissions. Under climate change conditions, Schnell et al. (2016) found increases of mean and extreme O3 in summer mainly in more polluted regions of Europe, North America and Asia. The authors concluded that higher temperatures increase the efficiency of precursors to produce O3 in polluted regions, whereas precursor availability in neighbouring, downwind locations is reduced. In the present study, no obvious dependence of the O3 increases from the station characteristics could be seen. In fact, regional O3 pollution does not only depend on the availability of O3 precursors, but changes of dynamical and photochemical processes are also important as well as there can be an impact from tropospheric background changes.

Conclusions

In general, a strong relationship between air temperature and O3 formation was found. As a consequence, temperature can be regarded as a powerful predictor to assess O3 concentrations. Due to the non-linear nature of the relationship, higher temperatures usually lead to substantially enhanced O3 concentrations and temperature extremes were usually accompanied by high O3 concentrations. Furthermore, temperature increases in the scope of climate change considerably affected future O3 concentrations.

With respect to changes of Tmax, O3max and their specific thresholds as well as of combined Tmax > 30 °C and O3max > 100 μg/m3 events in the future, a sharp increase was projected, leading to an occurrence of such combined events at about 27–29% of all days in the summer months from April to September. The increases are mainly associated with the strong temperature rise until the end of the century. In summary, the projected Tmax and O3max changes point to a considerable increased health burden in Bavaria until the end of the century, resulting from strong changes of both variables and their associated individual and combined impact on human health.