Introduction

Excess heat common for cities (i.e., Urban Heat Island, UHI, Oke 1982), combined with other urban conditions, affects tree physiological functioning with outcomes ranging from enhanced growth to early senescence, branch die-back, and even mortality (Au 2018; Gillner et al. 2014; e.g., Hilbert et al. 2019). Thus, assessing the effect of increased temperatures on trees, as part of urban green infrastructure, is instrumental for understanding as well as adapting to current and expected conditions in this century (Ward and Johnson 2007), especially considering ever more urbanized societies and the potential for UHI effects to compound with more frequent atmospheric drought (Brune 2016; Norton et al. 2015; Roloff et al. 2009).

The UHI effect, i.e., the difference between urban and adjacent rural (air) temperatures, has been intensively studied for several decades (cf. Oke 1982; Stewart 2011). It is typically related to the structure and density of urban land-use (Kuttler et al. 2015), which can be characterized through local climate zones, and modulated by physiographic and urban characteristics, such as vicinity to water bodies, predominant wind and street direction, etc. (Stewart and Oke 2012); yet, the physical basis for the excess heat in cities is to a large extent found in the altered surface energy balance as the proportional cover of vegetation decreases compared to rural (or reference) systems (Hertel and Schlink 2019; Oke 1992). In temperate climates, this results in strongest UHI magnitudes at night (cf. Fenner et al. 2014). For example, Berlin features the most intense UHI in Germany due to its large extent and development intensity with an average air temperature increase of around 5 K at night-times (2001–2010) with maxima of up to 11 K (Fenner et al. 2014) in urban \(vs.\) rural areas.

Increased air temperatures due to UHIs can affect tree growth through altering several physiological processes across plant organs directly or indirectly (Dusenge et al. 2019). Generally, reaction times at cellular level increase with temperature up to a maximum, after which a drop in enzymatic activity results in a species-dependent optimum curve (Arcus et al. 2016; Parent et al. 2010). In leaves this optimum response is reflected in the net assimilation rate of carbohydrates, as a balance of photosynthesis and respiration, with losses exceeding gains more rapidly with increasing temperatures (Long 1991). These responses vary between species (Tjoelker et al. 2001) as well as intra-specifically due to local acclimation, i.e., a shift of optimum temperature responses after prolonged exposure (Yamori et al. 2014), and threshold temperatures before tissue damage occurs (for review see Geange et al. 2021). High temperatures in temperate areas are often coincident with low relative air humidity (i.e., large vapor pressure deficit), which in turn can decrease stomatal conductance governing the majority of gas exchange in leaves (Grossiord et al. 2020), and thus the capacity for photosynthesis. Under prolonged stomatal closure (or decreased conductance) with high temperatures, trees may thus face decreased growth (in subsequent years) or even starvation as their carbohydrate reserves are depleted yet not replenished at sufficient rates (McDowell et al. 2008). Furthermore, air (and soil temperatures) affect the initiation, speed and cessation of cambial activity, and thus radial growth throughout a growing season (e.g., see Begum et al. 2013; Rathgeber et al. 2016). Radial growth is increasingly considered to be limited by wood formation dynamics and their relation with environmental drivers, rather than solely by photosynthetic activity (Körner 2015). In particular, the availability of soil water is critical for cell expansion (e.g., Peters et al. 2021) and most likely limits radial growth before photosynthesis (Fatichi et al. 2014); however, this water availability is again linked to local climate as higher temperatures drive evaporation and thus may contribute to the depletion of soil water storage, impeding growth.

Several urban tree species have been shown to have tendencies of enhanced growth rates and/or productivity compared to rural conspecifics (e.g., Briber et al. 2015; O’Brien et al. 2012), which is typically attributed to increased temperatures (Jia et al. 2018; Pretzsch et al. 2017), however, a broad range of effect sizes and signs (i.e., reduced growth) specific to species and location are also commonly reported alongside (e.g., Berland 2020). For instance, Zhao et al. (2016) showed that productivity rates, as a proxy for growth, increased within urban clusters as urbanization intensifies using remotely sensed vegetation indices. Furthermore, Moser-Reischl et al. (2019) identified positive associations between air temperature and radial growth for two species commonly selected by urban planners (Tilia cordata Mill., Rubinia pseudoacacia) in Munich. Similarly, for Berlin, Dahlhausen et al. (2018), identified enhanced growth in highly urbanized environments for T. cordata, the most abundant tree of the city, which they attributed to the UHI effect, while intermediate development intensity was adverse for tree growth. By contrast, Gillner et al. (2014) highlight decreased growth for Acer species (A. platanoides and pseudoplatanus), Platanus x hispanica and Quercus rubra with higher summer temperatures of the preceding year, especially when compounded with drought, in another major German city (Dresden). These differences in growth trends may result from contrasting species-specific responses to increased temperatures, but are indeed affected by other (time-varying) factors and stochastic processes, such as water availability, pollution and road-salt loading, structural impedance by infrastructure, or management, etc. (Pauleit et al. 2002; Quigley 2004; Randrup et al. 2001; Rhoades and Stipes 1999).

Multiple of the aforementioned studies applied space-for-time substitutions or time series comparisons to quantify temperature and excess heat on growth for a given region. However, they typically compared trees grouped using qualitative or summary descriptors of sampling sites, disregarding the spatial and/or temporal variability in location-specific factors noted above that may affect growth of individual trees. This can hinder the extrapolation from individual sampling sites toward predicting effects across entire urban areas and tree stocks, especially when studies rely on labor-intensive methods, reducing sample sizes and coverage of species and space. A lack of co-located environmental variables (i.e., measured in situ or modelled) at pertinent spatial scales can further exacerbate these limitations, as noted, for instance, by Wohlfahrt et al. (2019) for air temperature and urban tree leaf phenology, which may lead to incorrect inferences and interpretations for the role of climate change on growth/productivity. Indeed, even neighboring trees’ exposure to excess heat may differ substantially as the urban fabric that controls their microclimate varies at short distances and across time (e.g., shading at low and high sun angles through days and seasons, extent and characteristics of sealed surfaces, etc.). It is thus likely that the varying and even contrasting growth responses observed for urban trees across and within studies are at least modulated by some confounding factors and exacerbated by lack of co-located environmental variables, making the attribution to a single driver, such as excess heat, more difficult and possibly less accurate.

These limitations could be overcome by developing extensive dendroecological surveys (i.e., incremental growth) and/or inventories (single or repeat) combined with environmental data with adequate spatio-temporal coverage and resolution. Environmental data such as surface and air temperatures from remote sensing products and process-based models have been used as a basis for assessing phenological and growth differences between and within tree species. For example, Dale and Frank (2014) and Locosselli et al. (2019) extracted remotely-sensed landsurface temperatures in the vicinity of manually sampled trees as a proxy for microclimatic conditions that influence growth. Similarly, Vo and Hu (2021) use remote sensing imagery with a downscaling modelling scheme to develop canopy-level temperature data for further analyses. Tree inventories are logistically and financially more feasible compared to extensive dendroecological surveys, and - together with environmental data - are increasingly more available (e.g. Ossola et al. 2020) due to open data policies and their value being recognized across domains for urban greenspace planning and adaptation (Hansen et al. 2019; Monteiro et al. 2020). Berlin, as one of the greenest cities in Europe, provides an openly accessible tree inventory, with spatio-temporal environmental data sets relevant to tree growth. It features nearly 700000 individuals covering 94 genera and some 600 species and/or cultivars, listing information on location, stem diameter (at breast height; \(DBH\)), and stem height, among other variables, for the majority of street and park trees. For this study, our objective was to assess the impact of excess urban heat, i.e. the UHI effect, on street tree growth (\(DBH\)) using this openly available inventory data set, complemented by additional open data sources with high resolution and good spatial coverage, as well as incremental growth data from tree cores. The assessment relied on flexible statistical models that could capture species and location-specific responses to heat and other urban factors. Specifically, we aimed to (1) assess heat exposure of the most abundant species; (2) determine the impact of (excess) heat on stem growth across tree age classes with a space-for-time substitution; (3) highlight the role of location-specific environmental factors in mediating temperature responses.

Methods

Study area

Berlin is one of the largest metropolitan areas in Central Europe (892\(\hspace{0.25em}k{m}^{2}\)) with a population of approximately 3.6 million, and a maximum extent of 38\(\hspace{0.25em}km\) in North-South and 45\(\hspace{0.25em}km\) in East-West directions. It is located in North-Eastern Germany, and lies in the temperate zone with warm-humid climate (Dfb) according to the updated Köppen-Geiger classification (Beck et al. 2018), with mean annual temperature of approximately 10 °C and precipitation of 575\(\hspace{0.25em}mm\) (Tempelhof weather station, DWD). Berlin features low relief (approximately 30\(\hspace{0.25em}m\) to 60\(\hspace{0.25em}m\) with 120\(\hspace{0.25em}m\) at solitary peaks), and is centered around a glacial outwash valley (sands, gravel), bordered by two plateaus consisting of glacial till and clay in the North-East and South, as well as sands in the South-West. The city provides extensive public green space covering around 30\(\hspace{0.25em}{\%}\) of its area (SUVK and Berlin 2019), with an extensive urban forest of nearly 700000 publicly-managed trees along streets, in parks and in riparian areas (Fig. 1).

Fig. 1
figure 1

Berlin’s generalized land-use derived from SUVK and Berlin (2019) and location within the European context (inset)

Data sources

An overview of data used for models, including sources, types, and application, is provided in Table 1, with detailed descriptions in the following subsections.

Table 1 Data description used for maps, visualizations and analyses. Resolution and radius are in \(m\), the latter is the buffer in which data was averaged around each tree. A zero-radius refers to a point extraction from categorical and location specific data. Polygons with radius data were rasterized to a resolution of \(5\hspace{0.25em}m\)

Street trees

Berlin’s open data provided tree inventories including species, age, location, and circumference which was transformed into diameter. Note that only street trees in urban, not rural areas or within green spaces, were considered here, but individual trees may grow along streets adjacent to green spaces and parks of varying sizes. Implausible observations, possibly from erroneous data entry, were removed. Additional manual data processing for quality control was done with a bespoke software datacleanr by Hurley et al. (2022), where obvious outliers or clearly interpolated data were removed. The latter was deemed necessary, as several observations in multiple city districts were derived by linear relationships (i.e., straight-line), which do not capture the ontogenetic growth dynamics of trees, and leave no variation related to variables other than age. All of these operations were recorded, and can be viewed and reproduced via the code linked in the Online Resource 1. Lastly, observations with unlikely diameter-age combinations were identified via the residuals of a generalized linear model between diameter and age with a Gamma log-link distribution: if individual residuals exceeded seven times the median absolute deviation of all residuals, they were removed. The median absolute deviation (MAD) is comparable to the inter-quartile range, yet more robust to outliers:

$$MAD=median(\parallel {X}_{i}-median(X)\parallel )$$
(1)

Similar cut-off approaches are frequently applied in allometric and trait databases. We considered this approach (i.e., \(MAD\cdot 7\)) conservative, and necessary given the presence of clearly erroneous data. The approach and impact thereof are visualized in the Online Resource 1 (Section Data quality control). Table 2 shows the binned distribution of genera across age classes. Final samples applied in models were smaller, following the availability of ancillary data for a given observation, and limited to a maximum age of 125 years to increase confidence in reported values, and ultimately model estimates.

Table 2 Binned age-distribution for street tree genera in Berlin data set, and entries missing age information

Temperature/UHI data

Temperature and UHI data were summarized temporally either by the provider or manually to provide a characteristic representation of (excess) heat during summer at different times (morning, afternoon/day, night), from which tree averages (radius of 150 m) were calculated (Fig. 2). Two urban air temperature (Berlin Environmental Atlas, UrbClim) and one surface UHI data set (MODIS) were tested as explanatory variables in generalized additive models (see Section GAMs). The air temperatures from the Berlin Environmental Atlas (EnvAt) are processed model outputs (FITNAH-3D; cf. Gross 1994) that are representations of typical summer conditions at 0400, 1400 and 2200 h at 2 m height; these data are provided at city block basis (spatial polygons), from which weighted averages were extracted after rasterizing (5 m resolution). The model was assessed by comparing microclimatic patterns from model outputs against two measurement campaigns for a representative location (greenspace – building transition), which show good agreement (see https://www.berlin.de/umweltatlas/klima/klimaanalyse/2014/methode/; only available in German). We decided to use the raw air temperatures from this data set, rather than calculating excess heat, i.e., an UHI measure, as the model outputs are limited to Berlin’s extent and thus do not cover extensive rural areas (see Eq. 2). UrbClim air temperatures are hourly model outputs (100 m resolution, 2 m height, De Ridder et al. 2015) based on ERA5 re-analyses data (ECMWF) for which observations from the hottest month available (June, 2011) were averaged to hours equivalent to Berlin Environmental Atlas (referred to as Berlin EnvAt) data by using a window of \(\pm \hspace{0.25em}1\hspace{0.25em}\)hour (i.e., 0300 to 0500, etc.). The model domain extends beyond Berlin’s city limits into adjacent rural areas. A land-use and land-cover mask (CORINE; European Union, Copernicus Land Monitoring Service 2018, European Environment Agency) was applied to define urban and rural/forested areas within and outside the city. Using this mask was deemed reasonable as Berlin’s built-up area has not changed markedly over the past 50 years, i.e., about 52 to 61% (Mohamed 2017). The excess heat was then calculated as

$$UH{I}_{x,y}={T}_{Ai{r}_{2 m}\hspace{0.25em}x,y}-\overline{{T}_{Ai{r}_{2 m}\hspace{0.25em}Rural}},$$
(2)

where \(T\) is temperature (\({}^{\circ }C\)) \(x\) and \(y\) define an urban grid cell. The model outputs have been extensively applied and validated for several cities (De Ridder et al. 2015; Lauwaet et al. 2016; Sarkar and De Ridder 2011; Zhou et al. 2016). The MODIS-derived surface UHI data set by Chakraborty and Lee (2019) (referred to as MODIS) estimates its measure in a similar fashion and the reader is referred to the detailed description therein. In brief, the underlying algorithm calculates the difference between urban and rural land surface temperatures for so-called urban clusters. The extent of urban areas is based on MODIS land-use data from 2002 (also considered unproblematic, see above). A filtering step in the algorithm ensures that UHI data are only generated for urban clusters where biased estimates are least likely (e.g., clusters excluded where no rural pixels were present, or where elevation differences between urban and rural pixels is greater than 50 m). Note this data set provides day and night-time averaged UHI estimates at 500\(\hspace{0.25em}m\) resolution, which were extracted for the hottest summer in this record (2007).

Fig. 2
figure 2

Overview of data sets used for assessing the relationship between temperature and tree growth, with data sources in rows and averaged time intervals in columns. Note the varying spatial coverage and resolution as noted in Table 1

Ancillary environmental data

Following the general approach described above, four ancillary covariates next to a temperature measure were employed in models; these were chosen due to their availability at high spatial resolution and coverage, and/or because their influence on growth was previously identified in literature or their likely impact could be deduced using ecophysiological principles. We included planting bed area and the sum of exchangeable basic cation as a proxy for soil nutrient availability (point extractions), adjacent building height (rasterized to \(5\hspace{0.25em}m\) resolution), as well as the proportional coverage of local climate zone 6 (LCZ6; open arrangement of low-rise buildings of 1 to 3 stories, pervious land cover, low plants and scattered trees, see Demuzere et al. (2019) and Stewart and Oke (2012) for details). Local climate zones integrate multiple characteristics of the urban fabric into individual categories and are widely available in products such as the WUDAPT dataset (Demuzere et al. 2019), easily accessible for researchers and practitioners. We selected this class for two reasons: (1) abundance – LCZ6 shows the greatest spatial coverage when considering the averaging radius we applied around trees and thus we were able to retain more observations for subsequent analyses; (2) previous research indicated that street tree growth for T. cordata within medium housing density in Berlin varied most, which was related to strongest sensitivity for growing conditions or exposure to most extreme conditions (Dahlhausen et al. 2018), and thus some degree of relationship was to be expected with the LCZ6 class.

General approach: space-for-time analyses

We modeled the stem diameter (\(DBH\)) of Berlin’s ten most abundant species (contingent to ancillary data availability) in relationship to their location, age, a measure of (excess) heat (UrbClim by De Ridder et al. 2015; Berlin Environmental Atlas models; MODIS-derived surface Urban Heat Island by Chakraborty and Lee 2019), and additional environmental covariates with generalized additive models (GAMs, see Section GAMs for details). Covariates were extracted at 150 and 300\(\hspace{0.25em}m\) to infer the impact of reference scale of the urban fabric on tree growth. From all tested models the most suitable (i.e., parsimonious with highest explanatory power) was employed for further analyses.

Dendrochronological sampling

To contextualize tree growth patterns between age groups derived from Berlin’s inventory data, we drew upon a recently established data set from Schneider et al. (2021), who sampled several common tree species across a rural-urban gradient. For our purposes, we grouped trees sampled in parks, green spaces and along streets into a single urban category, and focused analyses on these (Table 3). Note that Schneider et al. (2021) refer to these sampling locations as sub-urban and urban. Their analyses showed strong relationships between ring-width indices, climate parameters and a measure of urbanity derived from the proportion of sealed surfaces and building fraction. Thus it can be derived that the Berlin city tree-growth data applied here had a sufficient “urban imprint” relevant for our analyses. In brief, the data was derived as follows. Two to three cores were extracted at breast height from each tree. These were then prepared using standard dendroecological methods (i.e., mounting, sanding, measuring), and cross-dated with TSAP-Win and COFECHA (Holmes et al. 1986), producing mean tree series of incremental growth. Additionally, cambial age of each increment was established by counting years from the inner most ring at the pith (\(a=0\)) outward; on tangentially bored cores, missing rings to the pith were estimated.

Table 3 Overview of urban sampling locations and species sample sizes (n). Individual trees were sampled two or three times to obtain a mean-tree ring width series

Statistical analyses

GAMs

We applied hierarchical generalized additive models (GAM) to estimate the relationship of several covariates with stem diameter growth (\(DBH\)). GAMs, as an extension of generalized linear models (Wood 2017), allow modeling response variables as parametric and non-parametric combinations of smoothed explanatory covariates, and can assume non-normal response distributions. These smooths are constructed by summation of base functions of varying complexity and form, analogous to scatterplot smoothing (Hastie and Tibshirani 2017), which provides a high degree of flexibility, ideal for fitting ecosystem dynamics which are rarely linear (Pedersen et al. 2019), or correctly represented with deterministic functional forms (e.g. quadratic equations). In general, a GAM can be written as:

$$E\left(Y\right)\hspace{0.25em}=\hspace{0.25em}{g}^{-1}\left({\beta }_{0}+\sum \limits_{i=1}^{n}{f}_{i}\left({x}_{i}\right)\right),$$
(3)

and

$$y\hspace{0.25em}=\hspace{0.25em}E\left(Y\right)+\varepsilon,$$
(4)

where \(Y\) is taken from an appropriate distribution and corresponding link function \(g\), \({\beta }_{0}\) is the intercept and \({f}_{i}\) represents a smooth function of a predictor (Pedersen et al. 2019), and \(\varepsilon\sim\mathcal N(0,\sigma^2)\). Nested data structures (e.g., city districts) can be accounted for by introducing random effects, while spatial dependence between observations can be accounted for by constructing smoothing functions with, for instance, northings and eastings (cf. Wood 2017). All models were implemented in R (Core Team R 2021) using functions available in the package mgcv (Wood 2017).

Dendrochronological analyses

We assessed trends in annual growth dynamics of urban trees across the 20\({}^{th}\) century for 1920–1960 and 1961–2001 (similar temporal grouping as Dahlhausen et al. 2018 and Pretzsch et al. 2017). Given the small sample sizes across species and potential for site-specific differences, we grouped all individuals and focused on inferring age-dependent growth patterns for these trees in general. We used a hierarchical GAM implemented in mgcv::gamm() to leverage auto-correlation structures made available through the package nlme (Pinheiro et al. 2021) to account for individual variation across sites and species. Annual growth was modeled as:

$$g\left(\varDelta {r}_{i}\right)=f\left(yea{r}_{i}\right)+{f}_{j}(cambial\hspace{0.25em}ag{e}_{i})+{c}_{i,j}+{\tau }_{i,k}+{e}_{i}$$
(5)

where \(g\left(\right)\) is a log-link for \(\varDelta r\sim Gamma\), \(\varDelta r\) is the annual radial increment for observation \(i\). A global temporal (by year) and time-dependent (\(j\), \(\le 1960\) or \(>1960\)) trend in cambial age were implemented with thin plate regression splines (default smoothing function in mgcv); \({c}_{j}\) is a time-group dependent intercept, while \(\tau\) represents a matrix of random effect coefficients for \(k\) species to account for differences in growth patterns, and \(e_i=\varphi e_{i-1}+\varepsilon_i\). A \({3}^{rd}\)-order autocorrelation-moving average (ARMA) correlation structure was applied (i.e., \(\varphi (3,1)\)) to account for the dependency of \(\varDelta r\) across years for each tree, as is frequently the case for tree growth (e.g., see Fritts and Swetnam 1989). Similar model structures, although not based on GAMs, have been applied by, for example, Pretzsch et al. (2017). The detailed implementation for this model is given in the code linked to in the Online Resource 1. \(\varDelta r\) was then derived for a range of cambial ages, and averaged for both time groups, allowing a comparison of recent to earlier growth. We acknowledge that tree cores obtained at breast height do not represent absolute tree age. However, here they serve as a proxy for growth between young (⪆ 1960) and older individuals to contextualize growth patterns inferred from the larger-scale tree inventory.

Stem diameter model development and selection

We focused the analyses on the ten most abundant species after data quality control with a minimum of 10000 observations to ensure greatest possible spatial coverage and to increase confidence in estimates, totaling 218329 trees, with Table 4 showing species-specific sample sizes.

Table 4 Ten most abundant species and total sample size (n) for which GAM models were developed. Note that ancillary data availability across variables determined final sample sizes in individual models

The diameter (\(DBH\)) of these species was modeled using GAMs as follows:

$$g\left(DB{H}_{i}\right)=f({x}_{i},{y}_{i})+{f}_{j}\left(ag{e}_{i}\right)+{f}_{j}(tem{p}_{t,i},ag{e}_{i})+f\left(covariat{e}_{i}\right)+{c}_{i,j}+{\tau }_{i,k}+{\varepsilon}_{i}$$
(6)
$$g\left(DB{H}_{i}\right)=f({x}_{i},{y}_{i})+{f}_{j}\left(ag{e}_{i}\right)+{f}_{j}(tem{p}_{t,i},ag{e}_{i})+{c}_{i,j}+{\tau }_{i,k}+{\varepsilon}_{i}$$
(7)
$$g\left(DB{H}_{i}\right)=f({x}_{i},{y}_{i})+{f}_{j}\left(ag{e}_{i}\right)+f\left(covariat{e}_{i}\right)+{c}_{i,j}+{\tau }_{i,k}+{\varepsilon}_{i}$$
(8)
$$g\left(DBH_i\right)=f(x_i,y_i)+f_j\left(age_i\right)+c_{i,j}+\tau_{i,k}+\varepsilon_i$$
(9)

where \(g\left(\right)\) is a log-link for \(DBH\sim Gamma\), and \(i\), \(j\) are indices for observations and species, respectively, and \(t\) refers to an (excess) heat measure from UrbClim, Berlin EnvAt or MODIS at different times (morning, afternoon/day, night; see Section Temperature/UHI data); \(c\) is a species-dependent intercept, while \(\tau\) represents a matrix of random effect coefficients for \(k\) districts to account for differing management regimes across the city. A global spatial smooth \(f({x}_{i},{y}_{i})\) (representing projected coordinates in UTM) via a Gaussian process (cf. p. 242 in Wood 2017) was included to account for the spatial structure of observations, which reduced auto-correlation of model residuals considerably (see Online Resource 1). These were compared to a sub-set of models without a spatial component (cf. Eq. (7), and Fig. 4 but not further discussed there). We also tested a suite of models without the spatial smooth for comparison.

Furthermore, we implemented the interaction between temperature and age (i.e., \({f}_{j}(temp,age)\)) as tensor smooths (Wood 2006) to account for the different variable scales (i.e., units); all models were also tested without this interaction using a thin plate regression spline smooth for temperature (not shown in equations above). The functions \({f}_{j}\) are for species-specific smooths (i.e., with individual smoothness penalties and functional shapes as detailed by Pedersen et al. 2019). The covariates for planting bed area and soil nutrient availability were log-transformed to account for their skewed distribution, improving the estimation of coefficients for their respective basis functions. Note, that Eq. (9) was considered as the appropriate null model for interpretations, and a model including all covariates and the temperature-age interaction was tested as a saturated model. Models were implemented with mgcv::bam() (Li and Wood 2020; Wood et al. 2017) and readers are referred to the detailed implementation in the code linked to in the Online Resource 1. Considering all combinations of (excess) heat measures and covariates (with point, as well as 150/300\(\hspace{0.25em}m\) extractions), a total of 158 models were applied. With these models, we derived age and species dependent \(DBH\) averages across a temperature measure from predicted values in 5-year age groups starting at 30, 45, 60, 75, 90

Model selection was based on explained deviance (similar to explained variance) and the Aikake’s information criteria (AIC). The best model, considered having the highest explanatory power (highest explained deviance) and simplest structure (lowest AIC), was chosen for final analyses. Explained deviance assesses model fit in generalized linear modelling similar to explained variance in ordinary least squares modelling, but relies on a generalized form of residuals, i.e., deviance residuals (see Wood 2017). Deviance for the Gamma distribution is calculated as

$$D(y,\widehat{\mu })=2\cdot \left\{\frac{y-\widehat{\mu }}{\widehat{\mu }}-log\left(\frac{y}{\widehat{\mu }}\right)\right\}$$
(10)

where \(y\) is the observed and \(\widehat{\mu }\) the expected value.

As the spatial extent and coverage varied between temperature and ancillary data, more complex models (and specifically those including planting bed area and MODIS temperatures) typically also had fewer total observations. While this prevented a full comparison with information-based model selection criteria like AIC, the appropriateness of models that differed only in their implementation of the temperature-age interaction (i.e., \({f}_{j}\left(tem{p}_{i}\right)\)\(vs.\)\({f}_{j}(tem{p}_{i},ag{e}_{i})\)) could be assessed. For this reason, the suite of developed models presented above were limited to comparatively simple structures (i.e., few terms, interactions and restricted number of basis functions), reducing the potential for choosing over-fitted models without formal comparison. AIC is calculated as:

$$AIC=-2log\left(L\right)\cdot 2k$$
(11)

where \(L\) is the maximum likelihood estimate, and \(k\) is the number of parameters. Where any two comparable models had a \(\left|\varDelta AIC\right|>10\), we considered the model with the lower score more suitable. We chose to carry out the analysis in its current form rather than on considerably smaller but comparable sample sizes across models, to identify the strongest relationships in the existing data. This allowed us to highlight the utility of the approach per se and for Berlin in particular. Lastly, we also report the root mean squared error (\(RMSE=\sqrt{\frac{1}{n}\sum _{i=1}^{n}({y}_{i}-{\widehat{y}}_{i}{)}^{2}}\)) and mean absolute error (\(MAE=\frac{1}{n}|{y}_{i}-{\widehat{y}}_{i}|\)) for the final model.

Results

Growth trend dynamics

Recently established annual, incremental growth is on average greater as compared to earlier times (i.e., prior 1960; Fig. 3). The contrast is strongest in the first 30 years of cambial development, with clear indication that averages are statistically different up to approximately 22 years (cf. overlap of 95\(\hspace{0.25em}{\%}\) confidence intervals). The predicted trajectories for both periods follow typical ontogenetic patterns, yet differ in shape. This may be due to inaccurately estimated cambial ages, explaining the largely monotonic decrease for recent growth due to missing the typical (but not always present) initial rise and fall in pith-near stages. This would inadvertently create a left-shift in Fig. 3 for recent growth. Assuming that to be the case, the actual difference in average rates would be greater than presented here.

Fig. 3
figure 3

Rates for annual incremental growth differ between recent and earlier times, as predicted by a hierarchical GAM (see Section Dendrochronological analyses). Thick lines and bands are for mean and 95\(\hspace{0.25em}\%\) confidence interval of all annual predictions across a time group (fine lines). On average, early stages of recent growth (up to approximately 22 years) exceed that of earlier periods discernibly

Model selection

The null models containing only age to explain diameter captured a large portion of deviance. Including additional covariates thus resulted only in comparatively small increases of predictive skill. The inclusion of temperature (additive structure) and temperature-age interactions improved the predictive skill, with interactions reaching higher explained deviance (i.e., skill). Models including the MODIS-derived UHI measures performed best on average, where models containing \(LCZ{6}_{\frac{}{150\hspace{0.25em}m}}\) or all covariates (saturated model) did best (Fig. 4), which also holds true for other temperature/UHI measures. Note that non-temperature models containing planting bed area and \(LCZ{6}_{\frac{}{150\hspace{0.25em}m}}\) alone outperformed several of those including other temperature measures and covariate combinations. This indicated the importance of both covariates, with \(LCZ{6}_{\frac{}{150\hspace{0.25em}m}}\) also represented in the best-performing model, corroborating its impact on stem diameter development. As \(LCZ{6}_{\frac{}{300\hspace{0.25em}m}}\) only marginally increased performance compared to the null model (age only), the reference scale for the impact of the urban fabric (i.e., LCZ6) likely has a limit at \(<300\hspace{0.25em}m\). Nutrient availability, similar to \(LCZ{6}_{\frac{}{300\hspace{0.25em}m}}\), also only improved model performance marginally. Building height, regardless of its reference scale (i.e., 150 vs. 300 m), as another proxy for urban fabric, showed minimal to no improvement compared to the age-only null model.

Differences in sample sizes between model structures due to covariate and/or temperature data coverage were considerable (e.g., planting bed area \(vs.\) soil nutrient models for UrbClim UHI). Thus, we considered the model with the overall greatest predictive skill and largest sample for further investigations, namely \(LCZ{6}_{\frac{}{150\hspace{0.25em}m}}\) for MODIS derived UHI. For this variable combination, the model structure with age and temperature interaction also had a lower AIC (\(AIC\hspace{0.25em}=\hspace{0.25em}8.95\cdot {10}^{5}\)) compared to the additive one (\(AIC\hspace{0.25em}=\hspace{0.25em}8.98\cdot {10}^{5}\)).

Fig. 4
figure 4

Overview of model fit, expressed as explained deviance, for all tested models. The Panels distinguish between models without (top) and with (bottom) spatial smooths to account for autocorrelated residuals. Colored areas reflect (inclusion of) temperature and (excess) heat measures. Symbols identify models with an interaction between temperature and age (diamond), and without (circle), while symbol colors highlight covariates and their reference radius (i.e. 150/300\(\hspace{0.25em}m\), but also note x-axis labels). Symbol size indicates the model’s included observations (i.e., sample size). Generally, including temperature (interactions) improves model fits above the null model (age only) and other non-temperature models. However, note the importance of planting bed area and LCZ6 cover is apparent even without including temperature. The MODIS-derived UHI measures provide the best fit on average, with the model including a temperature-age interaction and LCZ6 cover scoring highest overall

The final model, comprising a temperature-age interaction, the covariate \(LCZ{6}_{\frac{}{150\hspace{0.25em}m}}\) and MODIS-derived UHI following Eq. (6), showed good agreement between predicted and observed \(DBH\) for the majority of observations (Fig. 5; \({R}^{2}\hspace{0.25em}=\hspace{0.25em}0.79\)) resulting in a fairly low MAE of \(6.12\hspace{0.25em}cm\), and a RMSE of \(1.2\cdot {10}^{-9}\hspace{0.25em}cm\). While some prediction errors are large (spread in Fig. 5), the predictions are unbiased (see correspondence of 1:1 and least-squares fit in Fig. 5). Thus, we consider using species and age-group averages as appropriate for identifying and discussing relationships between \(DBH\) and other covariates.

Fig. 5
figure 5

Predicted \(vs.\) observed data for the best model according to explained deviance and AIC comparison, based on the MODIS-derived UHI measure (Chakraborty and Lee 2019) for average summer conditions (2007) and LCZ6 (open to mid-rise, Stewart and Oke 2012). Hexagons and colors represent x-y bins (i.e., in the observed-predicted space) and their counts (N), and the red line is a least-squares fit. The model captures the mean response well, as indicated by lighter colors along the dashed 1:1 line, and its close approximation by the least squares fit, as well as model metrics (see text label in figure)

Urban heat and urban fabric effects on stem growth

Relationships derived via the GAM in Eq. (6) with LCZ6 as covariate show fairly consistent responses across age groups considering trajectories, yet they differ somewhat in shape (Fig. 6) and considerably in absolute magnitude (i.e., slope in Fig. 7). Generally, younger age groups (30–35) showed increased growth (strongest for A. hippocastanum, T. intermedia, T. intermedia ‘Pallida’), or little to no impact (Acer platanoides, Acer pseudoplatanus), with stronger UHI magnitude. At older ages, this trajectory gradually shifted toward decreasing stem diameter at higher temperatures, with T. euchlora, A. pseudoplatanus and P. acerifolia having the most negative relationships. Notably, Platanus acerifolia showed an earlier reversal of sensitivity at age group 45–50 as compared to other species. This may be due to the stronger upward curvature towards the estimates’ lower temperature limit (see respective panel in Fig. 6). Although the uncertainty around this edge is comparatively well-constrained, the low number of observations at the end of the continuum does call for caution for further interpretation.

The distribution of individual street trees along the UHI continuum, evident in the density estimates in Fig. 6, show that the majority of species investigated here are subjected to intermediate to high UHI levels. Trees in younger age groups are typically further toward the upper end of this continuum. Several species - including A. platanoides, A. pseudoplatanus, Acer hippocastanum, Quercus robur, and Tilia euchlora - have good coverage across the entire UHI continuum (although skewed toward positive values), which increases the confidence in the observed trajectory shift from young to old. Contrastingly, the cultivar Tilia intermedia ‘Palllida’ is predominantly distributed along the upper end of the continuum, resulting in greater uncertainty in the estimated relationship across age groups and UHI magnitudes.

Fig. 6
figure 6

Age-group averaged relationships between the MODIS-derived UHI and stem diameter for the ten most abundant species from the best model (i.e., Eq. (6) with LCZ6 as covariate). Note that estimates were derived with \(LCZ{6}_{\frac{}{150\hspace{0.25em}m}}=0.5\), and that y-axis scales differ between panel rows. The colored lines and bands are the averaged GAM fit, and gray lines are least-squares estimates fitted to these averages. Estimates were only generated within temperature ranges that a given species experienced. Density plots (scaled to match 100\(\hspace{0.25em}cm\) on the y-axis) show the distribution of individual trees along the urban heat continuum, aiding both with interpreting uncertainties (band width, “wiggliness”) and assessing species’ exposure to urban heat. A general shift from increased or stable growth with temperature at younger ages to a decrease for older trees is apparent

Fig. 7
figure 7

Temperature sensitivity of diameter growth centered on species-specific means for comparability and to aid interpretation of inter-specific differences. Lines and bands are from ordinary least squares fits on the GAM estimates in Fig. 6 (A) and the corresponding coefficients (slopes, B)

The chosen model also highlighted that, on average, trees growing in less urbanized environments (i.e., more open to mid-rise urban land cover; \(LCZ{6}_{\frac{}{150\hspace{0.25em}m}}\)) achieve greater diameters (Fig. 8). Note, that planting bed area, similar to \(LCZ{6}_{\frac{}{150\hspace{0.25em}m}}\), was identified as a covariate increasing explained deviance, both in models with and without a temperature measure, and represents an opportunity for further investigation, especially following additional data collection.

Fig. 8
figure 8

Age-group averaged relationships between the LCZ6 and stem diameter for two \(Tilia\) genera from the best model (i.e., Eq. (6) with MODIS-derived UHI as excess heat measure). LCZ6 cover was derived as a weighted average within a radius of 150\(\hspace{0.25em}m\) around each tree. Note the identical shape, but difference in intercept as a result of the model specification with a global smooth for the non-temperature covariate for all species (see Section Stem diameter model development and selection); the UHI magnitude was fixed at \({3\,} ^{\circ }C\) where most trees are present across age groups. The model identified a steady increase of average stem diameter with open to mid-rise urban land cover

Discussion

Heat, environmental and urban controls on tree growth

Demographic patterns

We showed that initial growth rates (⪅30a) are greater in recently established trees, as compared to older ones. This pattern was found in annually-resolved incremental growth observations, and corresponds with prior work on urban-rural (e.g., Pretzsch et al. 2017; Zhao et al. 2016) and intra-urban comparisons (e.g., Dahlhausen et al. 2018), where it was attributed to increased temperatures - namely, the UHI effect. Furthermore, the space-for-time substitution highlighted that recently established, i.e. younger, trees have greater (positive) sensitivity to heat than intermediate age groups, while the oldest trees showed lower average absolute growth (diameters) with increasing heat. We are not aware of other studies reporting such an age-dependent shift in sensitivity. However, the consistency of this pattern across species provides a satisfactory level of confidence in it, and indeed, Dahlhausen et al. (2018) also report a time - and by implication an age-dependent - shift of temperature sensitivity on growth depending on urban development intensity, but do not further distinguish between age groups. The age-dependent shift may be explained by a combination of physiological and environmental changes over time, and/or may be related to management practices, but we acknowledge that additional analyses is required to disentangle these processes. For instance, higher temperatures may have benefited physiological processes close to their respective optima in both young and old individuals. But, as older trees must support greater leaf area through rooting in similarly small (restricted) soil volumes, water availability and nutrients may become increasingly limiting. Higher rates of transpiration and photosynthesis, driven through temperature increases, could thus turn detrimental as the ratios between soil volume, root and leaf area change throughout a trees life span and in time. In addition to this, younger trees are typically more rigorously managed and irrigated (e.g., Koeser et al. 2014), which may (further) alter age-dependent temperature sensitivity. We lack data on demographic differences in management to confirm this, and acknowledge that younger trees here also surpassed their establishment phase, which entails high mortality during the first five years after planting (Harris and Day 2017; Hilbert et al. 2019) during which the above would not apply.

Intra-specific differences

Generally, the ten investigated species and cultivars showed consistent responses in sign, but differed substantially in magnitude, modulated by age. Aesculum hippocastanum, Tilia intermedia, T. intermedia ‘pallida’ and showed greatest absolute growth at high temperatures at younger (30–35) as well as older ages (60–65). The drought tolerance and performance under high temperatures is well documented for the latter two and features in tree selection frameworks, such as species climate matrices in Roloff et al. (2009) and in Brune (2016), where several studies investigating (hydro)climatic sensitivity were summarized. Contrastingly, Acer pseudoplatanus showed decreased absolute growth with higher temperatures; this corresponds with their classification as not very suitable according to Roloff et al. (2009). Platanus acerifolia and Acer platanoides are considered suitable and very suitable (Roloff et al. 2009), yet show the greatest decline with temperatures here. Indeed, several studies reviewed in Brune (2016) highlight A. platanoides as drought-tolerant. Such contrasting indications are also given for A. pseudoplatanus, and even report A. hippocastanum as only moderately tolerant, despite performing best here. These differences may be due to idiosyncrasies of planting sites, such as low soil volume and increased potential for water stress, (long-term) interactions with pathogens and herbivores (e.g., Meineke and Frank 2018), or perhaps related to data quality. Most important, these differences highlight the need for assessments of performance (e.g., as growth) under location-specific conditions, and this study provides a first attempt and insights into temperature sensitivity at the scale of an entire metropolis.

Utility, transferability and limitations of approach

The major strength of this work is the estimation of growth sensitivity of urban trees to heat and other urban conditions at city-scale with little data and processing power requirements. In particular, differences within and between species across several dimensions (depending on model specification) are readily explored where only tree locations, age, diameter (or any other target) and selected variables are needed to implement this approach. Indeed, due to the high temporal and spatial variability of controls on growth, and thus variability in responses, such assessments should be developed specifically for a given setting, as well-understood tree characteristics (e.g., Roloff et al. 2009), could be strongly mediated at a given location or time. For instance, if drought hardiness is related to extensive root networks, the limited soil volumes available to street trees could render a species vulnerable to water stress. In addition, studies on climatic suitability and physiological responses are difficult to compare due to local acclimation of species to controls on growth and performance, as well as different methodological protocols (e.g., see Brune 2016). Thus, the approach followed here, with its comparatively simple data requirements, can be implemented in a variety of settings to gain initial and/or guiding insights on tree performance in a specific urban region, and also serve as a benchmark to compare other empirical estimates or mechanistic models with. It may even allow estimating tree performance under future conditions to some extent (e.g., considering increase in temperature alone for this study), but is of course contingent on data coverage (e.g., demographics, environmental gradients). Furthermore, the implementation of GAM models here with mgcv::bam() is computationally efficient (Wood et al. 2017), making model testing, selection and final application a rapid process even for abundant observations (here \(n>150000\) and processing times of \(t<60\hspace{0.25em}s\) on a desktop computer for the selected model). Future research may focus on collecting additional data (e.g., increasing the coverage on planting bed area) and subsequently deriving species-specific smooths for ancillary environmental covariates.

Two noteworthy caveats of the approach followed here lie in its empirical, data-driven nature, and its dependence on data quality. The presented results here are correlative. While their interpretation follows first principles and may also allow anticipating the performance of a given tree under a range of increased temperatures, insights derived from mechanistic understanding are indispensable. This is because a change in one environmental condition, e.g., temperature or pest abundance, may also entail alterations to other drivers, and ultimately, affect physiological processes in ways not yet captured in existing data. Furthermore, the data provided in Berlin’s tree inventory required extensive quality controlling, and it is likely that inaccurate (diameter, age) or interpolated (diameter) observations were including in the analyses. However, the minimum requirement of 10000 observations per species, and the focus on average diameter responses across age groups decreased potential impacts of such observations to a degree that was considered satisfactory, while uncertainty is communicated effectively and transparently (i.e., larger confidence bands for T. intermedia ‘Pallida’). Further, the utility and transferability of the approach per se were considered of value to the community and practitioners.

Implications

Tree species selection for urban forests and urban greening depends on several often competing factors - such as aesthetic appeal and ecosystem service provision (e.g., Roman et al. 2020) - and has been focused on identifying trees suitable for current and future conditions for the last decades with increasingly strong evidence basis (e.g., Hirons and Sjöman 2019). Here, biogeographical approaches aim to match natural (i.e., non-urban) species distributions within current bioclimatic envelopes with future conditions at a planting site (e.g., Broadmeadow et al. 2005). Watkins et al. (2020) used such “climate matching” to identify the space along environmental dimensions (heat, precipitation) which urban regions inhabit, falling within or extending the natural envelope. Our work extends this approach by identifying the heat space (i.e., UHI range) occupied by several key species within a city. This not only allows identifying species-specific and intra-specific (age-dependent) sensitivities to urban conditions, but can also inform how age cohorts may respond as temperatures at a given location shift along the present continuum through time, while accounting for the impacts of additional urban conditions (e.g., management, disturbance, urban fabric). Notably, increased growth rates at higher temperatures, as reported here, also entail greater demographic turnover through mortality (Smith et al. 2019). Thus, as the majority of individuals were situated at the upper end of the temperature continuum, high temperature sensitivity may entail additional management due to faster turn-over and potentially subsequent replacement by more appropriate species. In addition, our approach can allow managers and planners to identify potential scale-limits of the urban fabric (e.g., \(LCZ{6}_{\frac{}{150\hspace{0.25em}m}}\hspace{0.25em}vs.\hspace{0.25em}LCZ{6}_{\frac{}{300\hspace{0.25em}m}}\)) on tree performance when altering surroundings or planting in a given area; efforts should be directed toward identifying the most pertinent environmental covariates and how their influence scales in space. The insights derived from this study, and potentially others following this approach in the future, can therefore add another dimension to tree selection approaches implemented down to individual trees and locations. It thus complements quantitative and qualitative tools, such as the climate matching tool, ecological site classification or species-climate matrix (see Hirons and Sjöman (2019); Roloff et al. (2009); Brune (2016); and references therein).

Finally, open data was key for this study. We urge authorities to implement FAIR (findable, accessible, inter-operable and reusable) data principles to enable work similar to ours, as well as the creation of larger-scale, multi-temporal databases and/or execution of synoptic studies (e.g., Ossola et al. 2020). This also includes the collection and provision of standardized (meta) data for quality control and to improve statistical inference.

Conclusion

We showcased the application of a GAM-driven analyses to identify the sensitivity of urban tree growth to environmental and urban drivers in a space-for-time framework. The extensive data coverage (tree inventory and covariates) allowed us to account for location-specific influences and identify the factors with strongest impacts on growth potentials (i.e., maximum diameter) for individual species. The identified age-dependent shift of temperature sensitivity, which was fairly consistent across species, as well as inter-specific differences thereof, will inform and guide further research. Notably, increasing the confidence in and coverage of data collection on less abundant species is key to maximize the potential of the approach developed here. Our results are a contribution toward Berlin’s current and future management of its tree stock and may help drive adaptation to climate change. Despite being a case study for a single city, we believe our work may provide a flexible approach for other cities with available or growing inventories, as well as ancillary environmental data, and may also inform the use of other planning tools, such as species-climate matrices (Roloff et al. 2009) regarding temperature sensitivity.