1 Introduction

The phenomenon of an urban heat island (UHI), consisting of a significant increase in urban area’s temperature with respect to the surrounding suburban and rural neighborhoods, has been known and studied since the nineteenth century (Landsberg 1981; Fortuniak and Kłysik 2008). In the early phase of those UHI studies, researchers concentrated on ground-based meteorological data, with focus on a comparative analysis of those data (Arnfield 2003; Stewart 2011). Oke (1979) determined that UHI (or atmospheric UHI) can also be defined for the urban canopy layer (UCL) and urban boundary layer (UBL). The UCL observations contain in situ measurements or data from sensors mounted on mobile platforms (using cars and bicycles). The UCL consists of air volume which is contained between urban buildings and other rough elements. In the case of UBL (i.e., a layer above UCL), the data comes from instruments mounted on special platforms, balloons, or aircraft (Voogt and Oke 2003). The air in the UCL is affected by the surroundings, mainly artificial surfaces, materials, and their geometry (in the microscale). In considering UBL, the local or mesoscale concept can be discussed. The UBL becomes part of the planetary boundary layer whose characteristics are modified by the actual presence of an urban area as its lower boundary (Oke 1976).

So far, it has been found that the effect of UHI is best marked in the summer and warmer part of the year (Oke 1982; Kłysik and Fortuniak 1999; Philandras et al. 1999; Morris et al. 2001), while in a daily cycle, the highest intensity of UHI occurs at night (Ripley et al. 1996; Jauregui 1997; Magee et al. 1999; Montavez et al. 2000; Tereshchenko and Filonov 2001; Dousset and Gourmelon 2003). The intensity of UHI decreases as cloudiness and wind speed increase (Oke 1982; Voogt 2004) and is greatest during anticyclonic conditions (Oke 1988; Morris and Simmonds 2000; Arnfield 2003).

There is also a clear relationship between city size and the magnitude of UHI—the latter is linearly correlated with the population logarithm. In large American cities during favorable weather conditions, the intensity of UHI can exceed 12 °C while in European cities it often reaches 10 °C (Oke 1973). In Rotterdam, the highest maximum UHI is found in late spring and summer with 95 percentile values from 4.3 °C to more than 8 °C (Steeneveld et al. 2011; van Hove et al. 2015).

In Poland as well, the atmospheric UHI is a common occurrence, and this phenomenon has been thoroughly investigated with in situ measurements among most of the biggest Polish agglomerations, providing some fairly specific conditions in each of the analyzed locations. For instance, in Wrocław, the average UHI intensity is ca. 1.0 °C (Szymanowski 2004). The intensity of UHI in Kraków has decreased since 1990 while remaining higher in the cold half of the year (0.5–1.0 °C, Bokwa 2010). The maximal intensity of UHI in Kraków exceeds 8 °C. Kaszewski and Siwek (1998) found that the maximal intensity of UHI in Lublin is around 3 °C which occurs in summer during night hours. Stopa-Boryczka et al. (2001) show that the maximal intensity of UHI in Warsaw occurs in summer and may reach as much as 10 to 12 °C.

A traditional point-wise character of investigating UHI does not allow for recognizing a spatial structure of this phenomenon in every detail. Therefore, nowadays, more and more studies try to characterize the UHI phenomenon using remote sensing data. The first publication based on satellite data was by Rao (1972). Since then, more and more analyses have been based on remote sensing data in urban climate studies. Kawashima et al. (2000) used Landsat 5 TM data to investigate relations between surface temperature and air temperature in the central part of Japan during three winter nights. Voogt and Oke (2003) pointed out that the UHI observed from thermal remote sensing data is none other than the surface urban heat island (SUHI). In the North China Plane, an attempt at air temperature retrieval from Moderate Resolution Imaging Spectroradiometer (MODIS) data was made by Sun et al. (2005). The coefficient of determination of the linear model in predicting the air temperature based on MODIS data is about 0.80. In Madrid, based on SEVIRI and MODIS data, the maximal intensity of SUHI in the summer months can reach 5 °C (Fabrizi et al. 2011), while in Rome, it is 3 to 4 °C during nighttime and negative or almost zero SUHI intensity during daytime (Fabrizi et al. 2010).

Specific local issues may arise while using remote sensing techniques to recognize the local features of SUHI. Dobrovolny (2013) studied the SUHI in Brno (Czech Republic) in terms of land cover categories, vegetation cover, and building density. Schwarz et al. (2011) presented several SUHI indicators obtained from various remote sensing data: difference between urban and agricultural areas, urban and water surfaces, hot island area (area with land surface temperature (LST) higher than the mean plus one standard deviation), or micro-SUHI (as a percentage of the area with LST that is higher than the warmest LST associated with tree canopies). Bechtel et al. (2012) indicated that phenological conditions may affect correlation calculations, e.g., between LST and normalized difference vegetation index (NDVI). They suggested that the fraction of active vegetation is a better predictor of LST than NDVI. Schwarz et al. (2012) calculated the mean LST values for selected land classes, and their results showed that the highest mean difference between evening and morning LST occurred in industrial areas (industrial units, roads, railways, and associated lands).

Keeping the aforementioned in mind, the main objective of this paper is to characterize the spatial variability of the SUHI and UHI in Poznań based on Landsat 5 TM images and in situ measurements. The next objective is to build a statistical model for estimating air temperature at a height of 2 m a.g.l. directly from the Landsat 5 TM data.

2 Study area and data

2.1 City of Poznań

Poznań (52° 18′–52° 30′ N, 16° 48′–17° 04′ E) is the fifth biggest city in Poland with a population of over 554,000 and an area of 261.91 km2 (Fig. 1). The city of Poznań is located in the Wielkopolskie Lakeland (Kondracki 2002) on an elevation from 50 to 154 m amsl. More than 58 % of its territory is in the plateau areas (higher than 80 m amsl), about 7% is in the Warta River valley, and less than 35 % in the upper terraces of the river. The Warta River flows through the city from south to north for a distance of 22.8 km. Within the administrative boundaries of Poznań, there are also natural (glacial origin) and artificial lakes, covering altogether 1.9 % of the city area.

Fig. 1
figure 1

Location of the study area within Poland (a), Landsat 5TM scene (band 4 near-infrared) with administrative border of Poznań (red line) (b), and location of measurement points and types of land cover according to Corine Land Cover 2006 (c)

Urbanized areas comprise 43 % of the territory of Poznań including residential (25 %), transport (13 %), and industrial (5 %) areas. The urban structure is dominated by fragmented and diverse forms of development that are distinguished by a concentration of large areas of modernist housing estates (20 % of the city’s invested area) and inner-city neighborhoods and areas with single-family housing (45 % of the city’s invested areas). The green urban areas were created as a wedge-ring system that consists of two major components: four green wedges that radiate out from the center to the borders of the city and a second one that consists of three concentric rings. In Poznań, forest cover amounts to 13.7 % (Central Statistical 2012).

Poznań is located in the transition zone between continental and oceanic temperate climate zones with mainly oceanic influences. The average annual temperature in Poznań is 8.3 °C, with the highest in July (18.1 °C) and the lowest in January (−1.6 °C) (Woś 2010). The average annual precipitation is 517 mm (as measured in the period 1951–2000), with the highest precipitation in July (75 mm) and the lowest in February (26 mm). During the year, the study area is reached for the most part by polar air masses coming from the west at a frequency of about 70 %.

2.2 Remote sensing and in situ dataset

In order to characterize the spatial variability of SUHI, thermal remote sensing data were used on the basis of cloud-free Landsat 5 TM imageries (path/row 190/24 or 191/23) acquired from the NASA Earth Explorer website (http://earthexplorer.usgs.gov). Only 15 imageries were usable due to frequent cloud cover over Poznań. The first scene used in this research was acquired on 14 June 2009 and the last one on 3 October 2011. All of the abovementioned data concern the warm half of the year, and the scene center scan time differs from 09:30 UTC to 09:40 UTC. In situ measurement network consists of nine meteorological stations located in different types of land cover and parts of urban space (Fig. 1). The measurements of air temperature were carried out in 30-min intervals during the period from June 2008 to May 2013 by data loggers HOBO U23-001. The accuracy of HOBO loggers is 0.2 °C which is equal to the accuracy of the Vaisala HMP45 sensor at the synoptic airport station, Ławica. The loggers are located at a height of 2 m a.g.l. in anti-radiation shields, with the exception of data obtained from the synoptic station Ławica (airport) run by the IMGW-PIB (Institute of Meteorology and Water Management - National Research Institute).

The basic characteristics of measurement points such as longitude, latitude, distance from the city center, sky view factor (SVF), and type of land cover (according to Corine Land 2006) are shown in Table 1.

Table 1 Characteristics of measurement points

Five data points represent the various types of urban development. Two measurement points—Piekary and Collegium Minus—are located in the city center, an area characterized by continuous and historic buildings. In this area, the predominant height of buildings is five stories. Measurement point Piekary is located at a car park on Piekary Street. This location is characterized by four- to five-story buildings and the Marshal Office of the Wielkopolska Region (11 floors). The Collegium Minus is located next to academic buildings and railway. This area is characterized by a more open urban space than previously described which is clearly visible in the SVF values (Table 1). The SVF is calculated by means of “fish-eye” images in the BMSky-view software.

Another data point was located in a residential area south of the city center (Rusa). The area includes buildings of different heights, while the surrounding area includes housing estates founded in the 80s. Predominant are buildings with a height of six to nine stories. In the nearest neighborhood to the measurement point, there are apartment blocks of four, ten, and even 16 stories.

Two further points were located in a discontinuous urban fabric. The type of detached houses scattered throughout the north-western part of Poznań represents point Strzeszyn (number 8), while compact detached houses are located in the southern part of Poznań (Świerczewo station). In both positions, prevailing buildings are as high as two stories (up to 7 m). The next measurement points are represented by the area near to the surface water (Dębina—sport and leisure facilities), a peripheral area of low buildings located in the northern part of Poznań (Collegium Geographicum—pastures), and green urban areas (Słoneczna). The Słoneczna point is set up within the Astronomical Observatory of Adam Mickiewicz University.

3 Methods

The intensity of UHI was defined as the difference in air temperature between particular measurement points and the synoptic station Ławica (airport), which is considered the reference station. Despite the fact that Ławica (airport) lies within the administrative boundaries of Poznań, it may be regarded partly as a rural station due to its peripheral location (Półrolniczak et al. 2015). According to Koczorowska and Farat (2006), the average annual differences in air temperature between Ławica (airport) and meteorological stations located around Poznań do not exceed 0.1 °C, while 50–80 % of daily means differ only by about 0.0–0.2 °C. A similar approach regarding UHI research was established in other Polish cities, e.g., Warsaw, Wrocław, and Kraków (Stopa-Boryczka et al. 2001; Szymanowski 2004; Bokwa 2010).

In order to obtain the UHI maps based on the meteorological station data, the ordinary kriging method (OK) was used. This method assumes that the distance between sample points is spatially correlated, as often observed in terms of spatial intensity of UHI, and there is no constant data trend over the area of investigation (Childs 2004). Moreover, OK provides usually acceptable spatial variability of the UHI phenomenon in comparison to highly sophisticated regression techniques (Szymanowski and Kryza 2009).

In order to calculate the LST derived from Landsat 5 thermal infrared images, a few additional pre-processing procedures need to be applied. First of all, the raw data (with a spatial resolution of 30 × 30 m) have to be re-calculated to an energy radiation value by means of linear calibration. It requires applying this method to every single spectral band, including the day of image registration in the formula proposed by Walawender (2009):

$$ \mathrm{L}\mathrm{s}=\left(\frac{\mathrm{Lmax}-\mathrm{Lmin}}{255}\right)\mathrm{D}\mathrm{N}+\mathrm{Lmin} $$
(1)

4 where:

Ls:

Energy radiation [W m−2 sr−1 μm−1]

Lmax:

Maximum value of energy radiation for each scene [W m−2 sr−1 μm−1]

Lmin:

Minimal value of energy radiation for each scene [W m−2 sr−1 μm−1]

DN:

Digital number (value of each pixel, band 6 thermal infrared)

To calculate the radiation temperature, a modified Planck’s formula must be used as follows (Chander and Markham 2003):

$$ {T}_{\mathrm{R}} = \frac{K_2}{ \ln \left(\frac{K_1}{\mathrm{Ls}}+1\right)} $$
(2)

where:

T R :

Radiation temperature (brightness temperature) [K]

Ls:

Energy radiation [W m−2 sr−1 μm−1]

K 1 :

Landsat 5 calibration constant = 607.76 [W m−2 sr−1 μm−1]

K 2 :

Landsat 5 calibration constant = 1260.56 [K]

Before calculating the emissivity in urban areas, it would be very useful to apply a plant coverage to the area of interest. One of the most common vegetation indices used in this approach (Sobrino and Raissouni 2000; Walawender 2009) is NDVI, which can be calculated using the formula proposed by Lillesand and Kiefer (2004):

$$ \mathrm{NDVI}=\frac{\mathrm{NIR}-\mathrm{VIS}}{\mathrm{NIR}+\mathrm{VIS}} $$
(3)

where:

NIR:

Near-infrared spectral band (wavelength for Landsat 5 TM 0.76–0.90 μm)

VIS:

Visible spectral band (wavelength for Landsat 5 TM 0.63–0.69 μm)

On the basis of NDVI:

The next step according to Sobrino and Raissouni (2000) is to calculate the fractional vegetation cover—FVC (Pv):

$$ \mathrm{P}\mathrm{v} = {\left(\frac{\mathrm{NDVI} - {\mathrm{NDVI}}_{\min }}{{\mathrm{NDVI}}_{\max } - {\mathrm{NDVI}}_{\min }}\right)}^2 $$
(4)

where NDVImin and NDVImax are the NDVI values for bare soil and full vegetation, respectively.

Further, emissivity for a full-vegetated land is counted on the basis of FVC. The most common way to calculate emissivity is a formula suggested by Sobrino and Raissouni (2000):

$$ \upvarepsilon ={\varepsilon}_{\mathrm{v}}{P}_{\mathrm{v}}+{\varepsilon}_{\mathrm{s}}\left(1-{P}_{\mathrm{v}}\right)+C $$
(5)

where:

P v :

Fractional vegetation cover

ε :

Emissivity

ε v :

Emissivity for full-vegetated land

ε s :

Emissivity for bare ground

C :

Cavity effect (for urban areas, this equals 0.005, according to Walawender 2009)

Radiation temperature, calculated from the thermal infrared remote sensing data, is a blackbody temperature whose emissivity (ε) is 1 and its radiation temperature is equal to the surface temperature—LST. In the environmental sciences, the conception of “gray body” is defined because blackbody does not exist in the environment. Its emissivity is less than 1, making it necessary to use emissivity to calculate LST from the satellite data. By comparing the Stefan-Boltzmann law for a blackbody and gray body, it is possible to find a theoretical relationship between radiation temperature (T R) and LST. The temperature of the surface is usually different from the LST counted on the basis of satellite data. The LST accuracy was validated for Landsat TM (sixth band) by Jiménez‐Muñoz and Sobrino (2003). They showed that the root mean square deviation (RMSD) is lower than 1.5 K for different land cover types. On the basis of previous calculations, the LST can be counted using the following equation (Lillesand and Kiefer 2004):

$$ \mathrm{L}\mathrm{S}\mathrm{T}=\frac{T_{\mathrm{R}}}{\varepsilon^{0.25}} $$
(6)

where:

LST:

Land surface temperature [K]

T R :

Radiation temperature [K]

ε :

Emissivity

Afterwards, the land surface temperature values can be converted from Kelvin to Celsius degrees. To do so, the following expression must be used:

$$ {\mathrm{LST}}_{\mathrm{C}}=\mathrm{L}\mathrm{S}\mathrm{T}\hbox{--}\ 273.15 $$
(7)

where:

LSTC :

Land surface temperature [°C]

LST:

Land surface temperature [K]

A prediction of the air temperature (T agl) directly from Landsat 5 TM data (LST) was realized via the standard statistical idea of constructing the best (fitted) regression model

$$ {T}_{\mathrm{agl}} = f\left(\mathrm{L}\mathrm{S}\mathrm{T},\ X,\ Y\right) + \xi, $$

using the properly chosen function f of independent variables LST and X, Y describing the geographical location of measurement points, while ξ denotes random error (with an expected value of 0). In order to select the best function f, a statistical analysis was performed for a combination of different (very wide) classes of functions (linear, non-linear, exponential, logarithmic, and hyperbolic including trigonometrical terms) in order to take into account the seasonal effect. Due to the relatively small number of observations (n = 120, i.e., observations made in eight locations for 15 Landsat scenes—limiting numbers of parameters included in a model), the analyzed functions were classified into five groups (sets):

$$ {f}_1\left(\mathrm{L}\mathrm{S}\mathrm{T},\ X,Y\right) = {P}_1\left(\mathrm{L}\mathrm{S}\mathrm{T},X,Y\right)+A\times \sin \left[{P}_2\left(\mathrm{L}\mathrm{S}\mathrm{T},X,Y\right)\right] + B\times \cos \left[{P}_3\left(\mathrm{L}\mathrm{S}\mathrm{T},X,Y\right)\right], $$
(8)
$$ {f}_2\left(\mathrm{L}\mathrm{S}\mathrm{T},\ X,Y\right)=\frac{1}{P_1\left(\mathrm{L}\mathrm{S}\mathrm{T},X,Y\right)} + A\times \sin \left[{P}_2\left(\mathrm{L}\mathrm{S}\mathrm{T},\ X,Y\right)\right] + B\times \cos \left[{P}_3\left(\mathrm{L}\mathrm{S}\mathrm{T},\ X,\ Y\right)\right] $$
(9)
$$ {f}_3\left(\mathrm{L}\mathrm{S}\mathrm{T},\ X,Y\right)= \ln \left[{P}_1\left(\mathrm{L}\mathrm{S}\mathrm{T},X,Y\right)\right]+A\times \sin \left[{P}_2\left(\mathrm{L}\mathrm{S}\mathrm{T},X,Y\right)\right]+B\times \cos \left[{P}_3\left(\mathrm{L}\mathrm{S}\mathrm{T},X,Y\right)\right], $$
(10)
$$ {f}_4\left(\mathrm{L}\mathrm{S}\mathrm{T},\ X,Y\right)={e}^{P_1\left(\mathrm{L}\mathrm{S}\mathrm{T},\ X,Y\right)}+A\times \sin \left[{P}_2\left(\mathrm{L}\mathrm{S}\mathrm{T},\ X,\ Y\right)\right] + B\times \cos \left[{P}_3\left(\mathrm{L}\mathrm{S}\mathrm{T},\ X,\ Y\right)\right], $$
(11)
$$ {f}_5\left(\mathrm{L}\mathrm{S}\mathrm{T},\ X,Y\right)={e}^{P_1\left(\tilde{\mathrm{LST}},\kern0.5em X,Y\right)}+A\times \sin \left[{P}_2\left(\mathrm{L}\mathrm{S}\mathrm{T},\ X,\ Y\right)\right] + B\times \cos \left[{P}_3\left(\mathrm{L}\mathrm{S}\mathrm{T},\ X,\ Y\right)\right], $$
(12)

where:

$$ {P}_h\left(\mathrm{L}\mathrm{S}\mathrm{T},\ X,Y\right)={\displaystyle \sum_{i=0,\ j=0,k=0}^4}{b}_{hijk}{\mathrm{LST}}^i{X}^j\ {Y}^k,\kern0.5em h=1,2,3 $$
(13)

are independent selected polynomials of variables LST, X, Y, and \( \tilde{\mathrm{LST}}= \ln \left(\mathrm{L}\mathrm{S}\mathrm{T}\right) \) [logarithm of LST] that have (highest) degree 4, where LSTi, X jY j mean the ith power of LST, jth power of X, and kth power of Y, respectively.

Examples clarifying types of above polynomials are:

$$ \begin{array}{l}\mathrm{f}\mathrm{o}\mathrm{r}\ h=1:\kern0.5em {P}_1\left(\mathrm{L}\mathrm{S}\mathrm{T},\ X,Y\right) = {b}_{1000} + {b}_{1100}\mathrm{L}\mathrm{S}\mathrm{T}++{b}_{1200}{\mathrm{LST}}^2\kern0.5em \\ {}\mathrm{f}\mathrm{o}\mathrm{r}\ h=2:\kern0.5em {P}_2\left(\mathrm{L}\mathrm{S}\mathrm{T},\ X,Y\right) = {b}_{2000} + {b}_{2100}\mathrm{L}\mathrm{S}\mathrm{T} + {b}_{2010}X + {b}_{2001}Y+{b}_{2011}X\times Y\ \\ {}\mathrm{f}\mathrm{o}\mathrm{r}\ h=3:\ {P}_3\left(\mathrm{L}\mathrm{S}\mathrm{T},\ X,Y\right) = {b}_{3000} + {b}_{3110}\mathrm{L}\mathrm{S}\mathrm{T}\times X + {b}_{3101}\mathrm{L}\mathrm{S}\mathrm{T}\times Y+{b}_{3111}\mathrm{L}\mathrm{S}\mathrm{T}\times X\times Y, \end{array} $$

where LST2 means second power of LST, and coefficients b hijk for other indices i, j, k described in Eq. 13 and not used in above polynomials are equal to 0.

Thus, the investigated models were of the form proposed by formula (8) using specific types of functions described in formulas (9–13):

$$ {T}_{\mathrm{agl}} = {f}_i\left(\mathrm{L}\mathrm{S}\mathrm{T},\ X,Y\right) + \xi, \kern0.5em i=1, \dots, 5 $$
(14)

with the estimated coefficients b hijk of the variables in polynomials defined by (14), as well as parameters A and B, and under the assumption of normality in errors ξ. The estimation was performed using the standard least square method. It is important to underline that the geographical coordinates of measurement points (described by variables X and Y) were also included in the model in order to determine their significance. In the procedure, a stepwise (backward) regression was used to determine:

  1. 1.

    If the prediction is dependent on the geographical location of measurement points at a statistically significant level

  2. 2.

    If the prediction is dependent on seasonal effects (measured by trigonometric terms with coefficients A and B) at a statistically significant level

  3. 3.

    The possibility of reduction in the degree of polynomials P 1, P 2, and P 3 (according to Ockham’s razor principle: Numquam ponenda est pluralitas sine necessitate—plurality must never be posited without necessity)

Models were constructed and estimated with the use of statistical package Statistica 10 (stepwise procedure was done partly manually due to complexity of the analyzed models and taking some of parameters b hijk as equal to 0 in order to overcome the problem of overparametrization). The methodology described above is a generalization of methods (linear prediction) proposed by Kawashima et al. (2000) and Colombi et al. (2007). The idea was to construct a more advanced model that is valid for different times of the day and year while taking into account the location of measurement points. It should be noted that the above regression models are only an attempt to quantify the analyzed dependencies according to the limited possibilities of their verification due to difficulties with obtaining subsequent observations.

5 Results

5.1 UHI on the basis of “in situ” data

The city center is warmer than the surrounding area during most of the year (Półrolniczak et al. 2015). The average annual UHI intensity in the downtown area is 1.0 °C and decreases towards the periphery of the city along with a declining development intensity (Fig. 2). The intensity of the UHI was in general higher in the spring and summer seasons, even though it must be stated that extreme values may occur throughout the year. For instance, the maximum intensity of the UHI (Piekary 7.0 °C) occurred on 26 January 2010, with the second highest on 4 April 2009 (Collegium Minus 6.2 °C). As shown in Fig. 2, the spatial variability of mean UHI in Poznań for the period 2008–2013 takes usually the cellular form which is characterized by distinct foci on heat areas separated by cooler air. The UHI spatial differentiation in cold (X–III) and warm (IV–IX) seasons is similar to the warmest areas in the city center and the coldest in the outskirts, particularly in the eastern, northern, and northeastern regions. Spatial differentiation of the UHI is clearly correlated with the urban structures since the densely built-up downtown and areas with high buildings are much warmer than the urban green areas, water bodies, and their surroundings. Despite the similar spatial differentiation, the intensity of UHI is higher in the warm half of the year (Fig. 2).

Fig. 2
figure 2

The mean intensity of UHI in Poznań for a cold (X–III) and warm (IV–IX) part of the year. Data from the period June 2008–May 2013

Figure 3 shows the diurnal and annual UHI intensity in all the city’s studied points. The predominant nocturnal nature of the UHI is evident especially in the city center (Fig. 3a, b) and in areas with high buildings (Fig. 3c). Higher values (more than 2 °C) appear in 20–22 UTC from April to July in the city center (Fig. 3a), while smaller values (about 1.5 °C) may occur during the period from March to September in the afternoon (ca. 19 UTC). From March to August, the UHI intensity during daytime is about 1 °C, and in April, it can reach 2 °C between 13 and 15 UTC (Fig. 3b). Areas with low buildings surrounded by urban green (Fig. 3g) have the highest values of UHI from the afternoon to the next morning mainly from March to May, while in the morning during the summer season, there is a so-called cold lake.

Fig. 3
figure 3

Monthly and hourly mean intensity of UHI (°C) in measurement points: Piekary (a), Collegium Minus (b), Rusa (c), Świerczewo (d), Strzeszyn (e), Collegium Geographicum (f), Słoneczna (g), Dębina (h). Data from the period June 2008–May 2013

Generally, a smaller intensity as well as two maxima during the day are the characteristic features of the UHI in a compact single-family housing area (Fig. 3d). These maximum values reach above 0.5 °C between 8 and 15 UTC from February to September and from about 18 to 4 UTC from March to June.

In areas with a substantial share of green (Fig. 3e, f, h), the UHI occurs during the day with a peak in March and April (Fig. 3e, h), and at night, these areas get significantly cooler. The strong intensity of the “cold lake” appears in the evening and night for nearly most of the year, with a maximum in the summer between 19 and 23 UTC (Fig. 3e).

During daytime, the values of standard deviation for the mean UHI intensity vary from 0.51 °C (at Strzeszyn measurement point) to 0.73 °C (at Collegium Geographicum). At night, the standard deviation is higher and varies from 0.72 °C in Strzeszyn to 1.13 °C at Collegium Geographicum. The measurement point located in the city center—Piekary—is characterized by a standard deviation of 0.63 °C during the day and 0.82 °C at night.

5.2 UHI on the basis of Landsat 5 TM data

To characterize the spatial extent of the UHI in Poznań, a map of the mean land surface temperature anomalies was calculated (Fig. 4). This map was drawn up from the maximal number of available Landsat 5 TM data. In the case of Poznań, in a selected time period, there were 15 scenes from days with clear-sky conditions. The LST anomalies’ map (LSTanom) was calculated as the difference between the mean map of LST derived from all 15 Landsat scenes (LSTM15) and the mean constant value of the LSTM15 raster (LSTac), as shown in Eq. 15:

Fig. 4
figure 4

Spatial variability of LSTanom (°C) in Poznań and profile lines used in Fig. 5 (A–A') and Fig. 6 (B–B')

$$ {\mathrm{LST}}_{\mathrm{anom}}={{\mathrm{LST}}_{\mathrm{M}}}_{15}\hbox{--} {\mathrm{LST}}_{\mathrm{ac}.} $$
(15)

The highest value of LSTanom is about 12.6 °C, and the lowest value reaches −8.6 °C. This points to the negative SUHI phenomenon. As can be seen from the map (Fig. 4), there is a noticeable difference between the urban fabric and green urban areas (parks, forests, arable land, etc.).

Selected land cover types were used to calculate the mean values of LSTanom. According to Corine Land Cover 2006, there are 12 different land cover types in Poznań (Table 2). The parameters presented in Table 2 were calculated with the ArcGIS 10.1 software using the Spatial Analyst Tools toolbox and function Zonal Statistics as Table. The highest values of mean LST anomalies have the following: continuous urban fabric (3.4 °C), industrial or commercial units (2.8 °C), and airports (2.2 °C). The lowest values have different types of forests such as broad-leaved forest (−3.1 °C), coniferous forest (−2.9 °C), and mixed forest (−2.9 °C). Between urbanized areas and forests, the average values of mean LST anomalies were reached by green urban areas, arable land, pastures, sport facilities, and discontinuous urban fabric. The mean maximum LST anomalies (12.0 °C) are placed within industrial or commercial units, while the minimum for mixed forest is −0.7 °C.

Table 2 Mean, minimum, maximum, and standard deviation of LSTanom and T anom (°C) for individual land cover types

The outcome from Table 2, in terms of green urban and various types of forest areas, should be discussed. The T anom values presented below are differences of air temperature obtained from the statistical model based on Landsat and in situ measurement data (as shown in Eq. 14) and constant territorial average air temperature for entire city in its administrative boundaries. In contrast to other land cover types, the T anom values are equal to or higher than that of the LST. The overestimation of model temperature values in relation to the LSTM15 for green areas is probably caused by a differentiation of the structure of energy balance for different types of surface (Oke 1988; Paszyński et al. 1999; Arnfield 2003). Moreover, researchers have decided to use a univariate regression model in terms of different surface types, which may incorporate additional bias due to the uneven weight provided by specific land use categories.

The mean difference between the measured air temperature at 2 m a.g.l. and the mean temperature of every pixel value (30 × 30 m) (LSTM15) is significantly higher for urbanized areas (as shown later in Fig. 7). In urbanized areas, a particular pixel is characterized by a more irregular type of surface. The surface configuration (roughness, geometry, albedo, etc.) has an impact on the temperature inside every pixel, which strongly modifies the obtained results. The results shown in Table 2 indicate an overestimation by the model T agl for green urban areas and perhaps an underestimation in others. The obtained results show that the construction of a univariate model for calculating T agl on the basis of LST in urban areas is very difficult.

To characterize in detail the spatial extent of the surface urban heat island in Poznań, two profile graphs were made (Fig. 4). Profile A–A' crosses the city from the north-west to south-east (Fig. 5). The mean LSTanom for the entire profile is 0.8 °C. The maximum reaches 5.8 °C and the minimum −2.7 °C. The range of LSTanom in profile A–A' is 8.5 °C. To create this graph, 814 raster cells were used, which gives the distance between the start and end point at about 24 km. The LSTanom values vary, depending on the type of land use. Profile A–A' starts at the northwestern suburbs of Poznań, where the dominant land cover types are arable land, small housing estates, and forests. Then, the profile crosses through a green urban area known as the western wedge. In this area, the minimum LSTanom is reached. Then, the profile A–A' crosses the warmer part of Poznań—the city center. Afterwards, the profile cuts through the Warta River valley, housing estate and retail park, where the maximum LSTanom values are reached before heading to the south-eastern borders of Poznań where LSTanom sharply decreases. As can be seen from profile A–A', the south-eastern suburbs are warmer than the north-western’s and is caused by the industrial character of the area.

Fig. 5
figure 5

(A–A' Profile graph of LSTanom (°C) on the background of Corine Land Cover 2006

Profile graph B–B' crosses Poznań from west to east (Fig. 6). The mean LSTanom for the whole profile is 0.19 °C, with a maximum of 6.6 °C and a minimum of −4.3 °C. The B–B' profile consists of 661 raster cells (about 20 km), and begins in the western suburbs of Poznań (mainly arable lands and discontinuous urban fabric). As can be seen in the profile graph, there are 5 rapid LSTanom drops. These drops are respectively caused by forests and allotment gardens in northern part of the eastern wedge of the green area. The warmest areas are found in the Poznań International Fair, main railway station, and continuous urban fabric in the city center. The increase in LSTanom in the eastern suburbs is caused by the industrial and residential character of the surrounding district.

Fig. 6
figure 6

B–B' Profile graph of LSTanom (°C) on the background of Corine Land Cover 2006 (used colors denoting land use types same as in Fig. 5)

5.3 UHI on the basis of regression models

One of the simplest solutions for predicting T agl on the basis of LST is the linear regression approach, using the least squares method for estimating the regressions’ coefficients:

$$ \hat{T_{\mathrm{agl}}}=a\times \mathrm{L}\mathrm{S}\mathrm{T}+b $$
(16)

where:

\( \hat{T_{\mathrm{agl}}} \) :

Predicted T agl on the basis of LST

a, b :

Linear regression coefficients (slope and intercept, respectively)

The results from the linear models in terms of predicting \( \hat{T_{\mathrm{agl}}} \) are shown in Fig. 7. The coefficient of determination (r 2) values varies from 0.78 (Ławica—airport) to 0.90 (Dębina) and for all measurement points (total) r 2 = 0.77. However, the residuals for all types of land uses seem to have a very high range (in some cases beyond 10 °C, Fig. 8).

Fig. 7
figure 7

Scatterplots of LST and corresponding air temperature at 2 m height. ΔT denotes the mean difference between LST and T agl. Gray area indicates 95 % confident interval for linear assumption.

Fig. 8
figure 8

Distribution of residuals for the estimated models: non-linear regression model (blue line and blue stripes) and linear model (red dashed line and red stripes)

In the next part of the study, an attempt to improve the obtained results was made by means of the non-linear regression model. The best function to forecast air temperature turned out to be the regression model based on the function f 5 with the polynomial P 1 ((\( \tilde{\mathrm{LST}} \), X, Y) of the degree 1 (being the linear function of the logarithm of LST), independent statistically in trigonomical terms. It could be presented in the form:

$$ \hat{T_{\mathrm{agl}}}={b}_0\times {\left(\mathrm{L}\mathrm{S}\mathrm{T}\right)}^{{\boldsymbol{b}}_1} $$
(17)

where:

LST:

Land surface temperature [°C]

b 0 , b 1 :

Coefficients of the regression model

\( \hat{T_{\mathrm{agl}}} \) :

Estimated air temperature 2 m above ground level [°C]

The prediction is independent of the location of the measurement point.

The results procured on the basis of the above type of function are sufficiently reliable. The reduced final model is built with the use of 120 observations which, for such a linearized one variable model, constitute a sufficient sample size (Kelley and Maxwell 2003; Soper 2015). It is confirmed by a relatively high value of coefficient of determination (r 2 = 0.84). The estimated parameters of the regression model are presented in Table 3.

Table 3 Estimated parameters of the model

Presented below (Fig. 8) is the distribution of residuals (\( \hat{\varDelta T = {T}_{\mathrm{agl}}\hbox{--} {T}_{\mathrm{agl}}} \)) of the estimated models which confirms their expected normality. It is worth noticing that the obtained residuals for the non-linear model are slightly better fitted in comparison to the linear assumption.

As a result of the presented statistical non-linear model, the mean map of air temperature anomalies was calculated (T anom, Fig. 9). This map is calculated from the LSTM15 map as the difference between the mean map of T agl and the city area mean value of T agl (constant value). The maximum of the mean air temperature anomaly (T anom) reaches almost 9.4 °C, while the minimum equals −6.3 °C. Within the measurement points, the highest values of received T anom can be found in Ławica (3.4 °C), Piekary (2.2 °C), and Świerczewo (2.0 °C). The lowest values are achieved in measurement points Dębina (−0.9 °C) and Słoneczna (−0.6 °C).

Fig. 9
figure 9

Spatial variability of T anom (°C) in Poznań. Values calculated using non-linear regression model

6 Discussion and conclusions

In situ measurements clearly show that the UHI phenomenon is a common occurrence in the city of Poznań (Półrolniczak et al. 2015). The mean annual intensity of UHI reaches 1.0 °C, while the intensity of UHI is greater in the warm half of the year than in the cold one. Analyzing the diurnal course, the highest intensity of UHI is reached between 20 and 22 UTC from April to July in the city center (above 2 °C in average). This is despite the fact that at night, the city center is clearly warmer than the peripherals. The coldest times during the day in the center of Poznań are between 5 and 7 UTC. The UHI intensity reaches its minimum values of 0.5 to 0.7 °C—in comparison to Ławica (airport) station. Cold lakes (i.e., situations with air temperature being higher at the airport) appear beyond the city center during night hours, mainly in areas with low urban development, with a maximum intensity below −1 °C in Strzeszyn station. Both the maximal intensity and cold lake situations are mostly observed during the warm season of the year (Półrolniczak et al. 2015). These features of UHI were also found by other researchers worldwide (e.g., Arnfield 2003; Zhang et al. 2014) and confirm the regularities of the annual and diurnal course of this phenomenon found in other Polish cities (Szymanowski 2004; Bokwa 2010).

The SUHI in Poznań based on LST derived from Landsat 5 TM imageries show differences for particular land cover categories. Continuous urban fabric and industrial and commercial units are the warmest categories and constitute the hot spots in the area of Poznań. A slightly lower LST characterizes areas with lower urban development, while the coolest are green urban areas, sport and leisure facilities, or pastures and forests. The biggest difference in LSTanom for selected land covers reaches 6.45 °C (difference between the mean LSTanom for continuous urban fabric and broad-leaved forests), even though the differences between the hottest and coolest pixels (LSTM15) could be considered greater and can reach as high as 21.2 °C.

To calculate the air temperature (T agl) from Landsat 5 TM data, both linear and non-linear regression models were applied. Slightly better results were obtained from the non-linear regression model, which are confirmed by a slightly higher value of coefficient of determination (r 2 = 0.84) and normally distributed residuals.

For the areas represented by the highest degree of urbanization (continuous urban fabric and industrial and commercial units), T anom is equal to 2.5 and 2.1 °C respectively. The same order of magnitude is recognized for the coolest areas (forests), where the minimum equals −2.2 °C. The biggest difference between the mean T anom values concerning various types of land cover is 4.7 °C and is lower than the LST one (the difference between the mean T anom for continuous urban fabric and broad-leaved forests). Similarly, the maximum difference concerning the hottest and coolest pixels on T agl is lower than that on the LST map, which is about 15.7 °C.

Even if applying a simple statistical model does not allow for treating the obtained air temperature equally with traditional in situ measurements, it is robust enough to indicate areas of hot spots and cold lakes with a resolution of raster cell size (for Landsat 5 TM, it is equal to 30 × 30 m). These small-scale details might often be diminished (or completely omitted) while using a traditional coarse in situ measurement network. However, simplistic statistical models are reported to perform not so well in the case of neural networks or even more expanded multiple regression analyses and may lead to considerable biases for heads and tails of distributions (Fig. 7). The source of the problems is related to possible LST estimation errors (Schwarz et al. 2011), in terms of high complexity of land use types in a city. Different types of surfaces may strongly vary in terms of thermal capacity (Wloczyk et al. 2011), which means that a single universal regression model applied to an entire city may have problems with extreme values. This is despite the fact that the obtained accuracies for linear and non-linear models are similar to other models applied in similar research. For instance, Utsunomiya (1992), using the in situ measurements and Landsat 5 TM data, constructed a linear model that allows for calculating air temperature on Kyushu Island. On the basis of 51 samples, the accuracy of the model as given by the coefficient of determination was equal to 0.80, while in the case of Poznań wherein 15 imageries were used, the accuracy was slightly higher (0.84). In a study conducted by Wloczyk et al. (2011) based on Landsat 7ETM+ for more homogenous green areas, the coefficients of determination were slightly higher (r 2 = 0.89), with an average root mean square error of about 3 K. The same order of accuracy was obtained in a research based on MODIS thermal products conducted by Sun et al. (2005), wherein over 80 % of the analyzed samples had an accuracy of 3 K. A strong relationship between LST (derived from Landsat TM) and air temperature was also found in a recent study by Martin et al. (2015) for Montreal and by Klok et al. (2012) for Rotterdam, which obtained coefficients of determination of up to 0.81.

In every aforementioned example, the coefficients of determination are relatively high and comparative with values obtained in this study, even though in all cases there is a possibility of elevated biases in connection with a very complex structure of land use in the city (e.g., urban fabric, land cover, topography, etc.). Nevertheless, based on coupling both in situ and thermal remote sensing data, the calculated fields allow for characterizing UHI and SUHI spatial variability to an extent greater than using only one source of information.