1 Introduction

The concept of continentality in climatology refers to all characteristics of climate influenced by continents and is regarded as a notion opposed to climate oceanity. The continental impact is most frequently considered in the context of its effect on air temperature (thermal continentality) and precipitation (pluvial or hygric continentality). The continental climate is characterised by higher diurnal and annual amplitude of air temperature, hot summers, and cold winters. As the distance from the ocean increases, the cloudiness and precipitation rates decline and the annual distribution of precipitation changes and exhibits a distinct summer maximum. The oceanic climate is characterised by high air humidity and high rates of precipitation distributed evenly throughout the year, with low annual air temperature amplitudes, cool summers, and mild winters.

In a macroscale, it can be assumed that climate features of any place on the globe comprise a signal coming from the continents and a supposedly opposite signal from the oceans. To quantify these interactions, a number of indices, primarily describing the “strength” of land impacts, have been developed. Given the disproportion between the land and ocean cover (ca. 30 %—lands and 70 %—oceans), they can be regarded as a specific modification of the prevailing oceanic climate on the planet. As a rule, indicators of continentality express the relation between continental and oceanic features of climate and, hence, underline the relative nature of continentality. The continentality has been assigned particular importance in characterisation of areas with the so-called transitional climate combining continental and oceanic impacts, which exhibit high year-to-year variability depending on macro-circulation features. For Poland, such analyses have been performed by Kożuchowski and Marciniak (1986, 1992), Kożuchowski and Wibig (1988), and Kożuchowski (2003). Recently, continentality indices have been applied in environmental analyses, particularly as a variable supporting the analysis of variability and spatial distribution of, e.g., air temperature (Hogewind and Bissolli 2011), evapotranspiration (Marti and Gasque 2010) and bioclimatic changes (Torregrosa et al. 2013). The analytical usefulness of continentality indices has also been corroborated in the investigations of glacier mass balance (Holmlund and Schneider 1997), plant species range and the treeline (Caccianiga et al. 2008), and plant pollen in northern Europe (Salonen et al. 2012).

Given the recent climate changes, it is expected that continentality indices bring important signs of these changes, particularly when research-based thereon will address the dynamic year-to-year changes, multi-year trends, and predictions supplemented with detailed analysis of extreme values and circulation relationships. On the one hand, maps of continentality indices can provide data concerning a number of environmental elements, thereby being an important variable in multi-dimensional spatial modelling of these elements, as mentioned above. Methodology for accurate spatialization of continentality indices based on discrete point observations is a prerequisite for generation of such maps. This paper is focused on spatial interpolation of the thermal continentality index for Poland. It has a methodological character allowing indication of a preferable spatialization algorithm in accordance with the environmental characteristics of the study area and the nature of the input data set.

The annual temperature amplitude, i.e., the difference between the mean temperature of the warmest and coldest month, is the simplest absolute measure of thermal continentality. The drawback of this indicator lies in its dependence on seasonal changes in the quantity of incoming solar energy, which results in an increase in the annual amplitude with latitude. The effect is compensated for in the thermal continentality indices by dividing the temperature amplitude by the sine of latitude with concurrent introduction of empirical parameters facilitating rescaling of the index into the assumed interval in accordance with theoretical assumptions (Conrad 1946).

Various continentality indices used in climatology are based on the average annual temperature amplitude scaled with the sine of latitude. However, Driscoll and Yee Fong (1992) suggest that there is no conclusive evidence for the validity of the use of this divisor with respect to amplitude changes determined by the incoming solar energy. This group comprises indicators specified by Gorczyński (1920), Johansson (1926), Raunio (1948), Conrad (Conrad and Pollak 1950), Hela (1953), Ivanov (1953), Khromov (1957), Ewert (1963), and Hogewind and Bissolli (2011). Formulas extending this type of thermal continentality indices with other environmental parameters include indicators developed by Spitaler (1922), Ringleb and Johansson (after Szreffel 1961), and Ivanov (1959). Berg (1944), Bailey (1968), and Oliver (1970) proposed formulas based on other assumptions than annual temperature amplitude. These ideas have been put forward by Driscoll and Yee Fong (1992) and Mikolášková (2009).

Classic approach to present continentality indices uses hand-drawn isolines (Swoboda 1922; Kożuchowski and Marciniak 1992), with generalisation typical of this technique, especially in small scales. In recent studies, index maps are generated in the GIS environment, taking advantage of raster maps and showing more details (Mikolášková 2009; Torregrosa et al. 2013). The spatialization approaches applied are relatively simple, one-dimensional, and disregard selection of an optimal interpolation method and accuracy of the results.

Currently, environmental research, including climatology, employs various interpolation techniques (Hengl 2007; Li and Heap 2008; Szymanowski et al. 2012), which often leads to difficulties in choosing a method that is appropriate in a given case. This problem has been addressed, e.g., in the COST 719 research project “The Use of GIS in Climatology and Meteorology” (Dobesch et al. 2007; COST Action 719 Final Report 2008), but the investigations did not yield a conclusive solution. The spectrum of methods employed for spatial interpolation of climate elements is very wide and comprises deterministic and geostatistical techniques and their combinations. The best results are usually obtained with multivariate methods considering the role of environmental factors, in particular the elevation and coordinates (Szymanowski et al. 2012). This group includes, among others, the multiple linear regression (MLR) and residual kriging (MLRK, regression kriging) methods. Analyses of the spatial properties of climate elements, particularly non-stationarity, indicate that the local geographically weighted regression (GWR) model with residual kriging (geographically weighted regression kriging, GWRK) is more suitable for modelling the spatial variability of these elements (Szymanowski and Kryza 2011, 2012). Methodological research aiming at interpolation of the temperature field for Poland resulted in construction of a decision tree for selection of an optimal multivariate interpolation method considering the potential non-stationarity of the spatial process (Szymanowski et al. 2012, 2013).

The main objective of this study is to perform spatial interpolation and generate maps of Ewert’s index of continentality (1972) for selected cases in Poland representing long-term annual mean values and years with extreme values or characteristics of the spatial distribution of this indicator. This methodological paper emphasises the choice of an optimal interpolation method and indication of guidelines for automated or semi-automated interpolation of the indicator for a large data set (multi-year series) to identify the trend in climate change. Second aim is to validate the usefulness of the scheme of selection of the optimal interpolation method developed by Szymanowski et al. (2012, 2013) for interpolation of the continentality indices.

2 Study Area

Poland (312,679 km2) is situated in the central part of Europe. The terrain elevation in the country ranges from −1.8 m in the north to 2499 m above sea level in the highest, mountainous area in the south of the country. The north-western border of Poland is delimited by the Baltic Sea, and in the south, the border extends along the main ridge of the Sudetes and Carpathians (Fig. 1).

Fig. 1
figure 1

Study area and location of meteorological stations used in the study

Poland is located in the temperate transitional climate zone with clear continental and oceanic impacts. Westerly winds and polar air masses are predominant. The spatial distribution of the lowest values of the mean annual air temperature is determined by elevation, with the top parts of the mountains (−0.7 °C), and continentality, with the north-eastern part of the country being the second coolest region (<7 °C). The mean annual temperature increases from the north-east to the south-west, where it exceeds 8.5 °C (Woś 2010). July, with temperatures ranging from 17.3 to 18.8 °C (except for the mountains), is the warmest month, and January, with temperatures in the range between −3.4 and −1.3 °C, is the coldest (Kożuchowski 2011). In January, longitudinal distribution of isotherms dominates, with temperatures decreasing eastwards. In July, the course of isotherms exhibits latitudinal distribution with temperatures decreasing from the central regions northwards, towards the Baltic Sea, and southwards along the increasing elevation in the mountains. The mean annual temperature amplitude varies between ca. 15 °C in high mountain areas and 22 °C in the eastern part of the country. The longitudinal course of isoamplitudes typical for the east of the country is deformed in the west. The longitudinal distribution is only observed there in the central part, whereas in the north and south, the arrangement has a latitudinal direction similar to the course of the coast and mountain ranges (Woś 2010). A similar distribution is characteristic for the multi-year continentality indices. The areal average value of Ewert’s index of continentality is 44.3 % (Kożuchowski 2011). In decades dominated by oceanic influences, it ranges from below 38 % at the Baltic Sea to over 48 % in the east of the country. In decades dominated by continental influences, the values of Ewert’s index range from below 44 % and over 56 %, respectively (Kożuchowski and Marciniak 1992).

3 Data and Methods

3.1 Ewert’s Index of Continentality

One of the thermal continentality indices presented in the Introduction section, i.e., Ewert’s index (1972), was selected for the analysis. Compared with other indicators involving compensation of the influence of latitude, Ewert’s index includes additional parameterization considering the impact of land cover on the value of the annual air temperature amplitude in the individual latitude zones. These relationships were estimated particularly carefully for Central Europe, where the best results are achieved (Ewert 1972). Additional important assumptions for the development of the index include the possibility of applying it to all latitudes and any—annual or multi-year—period. Ewert’s index of continentality K [%] is expressed by the formula:

$$K = \frac{A - (3.81\sin \varphi + 0.1)}{38.39\sin \varphi + 7.47} \times 100,$$
(1)

where A is the annual temperature amplitude and φ is latitude. The index is not limited to the range of 0–100 %, but it has the values from −1.5 % for Turk Island to 141.5 % for Yakutsk. Areas with negative index values are characterised by “hyperoceanity” and those with the values >100 % by “hypercontentality” (Ewert 1972). Minimal “hyperoceanity” has been identified at several Pacific stations and “hypercontentality” in a large part of eastern Siberia (Ewert 1972).

To achieve one of the objectives of this study, i.e., development of a method and general guidelines for spatial interpolation of Ewert’s index of continentality for Poland, the analyses were based on five data sets representing mean and extreme values of the index from 1981 to 2010 (Table 1):

Table 1 Summary of statistics of Ewert’s index of continentality (K) in Poland for the four selected years and the 1981–2010 average
  • the mean value of the index (K 1981–2010);

  • the years 1989 and 2006 with the lowest and highest areal means for Poland, respectively (K 1989, K 2006); and

  • the years 1990 and 2002 with the smallest and largest index ranges, respectively (K 1990, K 2002).

The study was based on measurement data provided by 111 synoptic meteorological stations, including 53 Polish stations and 58 stations located outside Polish borders (Fig. 1). The localization of each station was carefully verified by comparing station metadata with ortophotomaps and digital elevation model. The values of the annual amplitude of air temperature were calculated based on information contained in global summary of the day (GSOD) database and provided by German weather service—Deutscher Wetterdienst (Klimadaten für Deutschland—online—frei; http://www.dwd.de).

3.2 Environmental Variables

The findings of the COST 719 project (COST Action 719 Final Report 2008) and the previous experience related to interpolation of meteorological elements for Poland (Ustrnul and Czekierda 2005, 2009; Szymanowski et al. 2012) indicate a significant value of multivariate interpolation algorithms constituted for large range of climatological applications. The methods consider the deterministic relationships between the modelled element and environmental variables—predictors. The spatial distribution of the predictors is represented in the interpolation procedure by raster layers with a fixed spatial resolution. Variables prepared previously for interpolation of temperature (Szymanowski et al. 2012, 2013), which could have a significant impact on the distribution of the continentality index, were used in the study. Three groups of variables were included in the set of potential predictors:

  • variables of the overall spatial trend comprising layers of raster cell coordinates (X, Y) and the sea distance index (SDI);

  • terrain elevation represented by the digital elevation model (DEM) and derivative layers: the concavity/convexity index (CCI), the foehn index (FI), and insolation (IT); and

  • land cover and its derivatives: the percentage of natural surfaces (NS) and artificial surfaces (AS) in the neighbourhood of the data point.

Variables X and Y denote coordinate values in the local coordinate system PUWG-92. The influence of the Baltic Sea is illustrated by the SDI index. To consider the decreasing impact of the Baltic Sea along with distance, the index was constructed as a square root of the smallest Euclidean distance (expressed by the number of cells) of a raster cell from the coastline. SRTM-3 elevation data (http://www2.jpl.nasa.gov/srtm) were used as a digital elevation model (DEM). To achieve the goal of this study, the data were transformed into the PUWG-92 system and resampled into 250-m resolution. This resolution is a compromise between the computational costs and details of information introduced to the models and expressed on the maps. High resolution is of special importance when the interpolated variable is potentially strongly dependant on local factors (including changes in altitude). The choice of spatial resolution applied is supported by earlier studies of Szymanowski et al. (2012, 2013) and Szymanowski and Kryza (2015). Based on the 250-m DEM, the concavity/convexity index (CCI) expressing the cool air accumulation effects of concave terrain forms and the foehn index (FI) illustrating the thermal impact of the foehn wind were calculated (Szymanowski et al. 2007). Insolation (IT), expressed by sums of energy of potential total radiation incoming to the terrain surface, is a variable illustrating the role of energy factors. The calculations were performed with the use of the r.sun program implemented in the GIS—GRASS software (GRASS Development Team 2011). The r.sun program is a functionally best-developed module for calculation of radiation in GIS, which can be successfully applied to large areas (Šuri and Hofierka 2004) and works with high-resolution terrain models (Kryza et al. 2010). Variables describing the percentage of the surface area arbitrarily called “artificial” (AS) and “natural” (NS) in the surroundings with a radius of 2500 m around each point were prepared on the basis of CLC2000—CORINE Land Cover 2000 database (2004) for European Union countries and the USGS Land Cover database (2011) for Ukraine, Belarus, and Russia. The values of all environmental variables were extracted from raster layers at the coordinates of localization of each meteorological station and were used to specify regression models as described below.

3.3 Statistical and Spatial Analysis Methods

In this study, a scheme of selection of an optimal interpolation method developed with an example of air temperature is employed (Szymanowski et al. 2012, 2013). The scheme is based on Hengl’s decision tree (2007), and was extended with methods dedicated to non-stationary spatial processes. It is generally assumed that the analysed continentality index is characterised by a statistically significant correlation with at least one environmental variable; therefore, the deterministic component can be modelled with the regression method. In the absence of the correlation, spatial interpolation would have to be performed using one of the one-dimensional methods, e.g., ordinary kriging (OK) or inverse distance weighting (IDW), depending on the degree of spatial autocorrelation of the analysed variable (Hengl 2007). In the scheme discussed, the modelled indicator is regarded as a regionalised variable (Matheron 1971), i.e., a random variable comprising structural, local, and random components. This cumulative model was termed a universal model of spatial variability by Matheron. The structural—deterministic (\(\hat{m}(s_{0} )\)) and local—stochastic (deterministic model residuals—\(\hat{e}(s_{0} )\)) components of variability can be modelled separately and the estimated value of variable z in position \(s_{0}\)\(\hat{z}(s_{0} )\) is a sum of these components (Eq. 2):

$$\hat{z}(s_{0} ) = \hat{m}(s_{0} ) + \hat{e}(s_{0} ).$$
(2)

If the deterministic component is modelled with the regression method, and kriging is used for modelling regression residuals, such a model is called residual kriging (Hengl 2007). The decision scheme is based on Matheron’s model and offers a possibility of application of one of the four interpolation methods at a general assumption of existence of a correlation between the modelled variable and environmental variables. The choice of an optimal method follows decisions that are taken in two steps. Initially, based on the goodness-of-fit of the model to the observations, the stationarity of the spatial process is assessed. The choice is made between the global, multiple linear regression (MLR) model dedicated to stationary processes, and the local, geographically weighted regression (GWR) model employed for non-stationary processes.

The global multiple linear regression MLR model can be expressed as the following equation:

$$\hat{m}\;(s_{0} ) = \mathop \sum \limits_{k = 0}^{p} \hat{\beta }_{k} q_{k} (s_{0} ),$$
(3)

where \(\hat{\beta }_{k}\) are estimated regression coefficients (\(\hat{\beta }_{0}\)—estimated intercept and \(q_{k} (s_{0} )\)—explanatory variables). The explanatory variables were selected for the model with the stepwise regression method. The prerequisite for inclusion of a variable was its statistically significant (p < 0.05) correlation with the continentality index and the absence of collinearity with other explanatory variables determined by the variance inflation factor, VIF (VIF <10).

The strongest reservations concerning the MLR model in the context of spatial interpolation are that the possible local variations and non-constant spatial relation between the predictor and the estimated variable are ignored both during model calibration and in the prediction stage. These factors are considered in the geographically weighted regression model (Fotheringham et al. 2002). The validity of the application of this model for air temperature in Poland has been demonstrated by Szymanowski et al. (2012). When the GWR model is employed, the deterministic component of the spatial variability model will be expressed as the following equation:

$$\hat{m}(s_{0} ) = \mathop \sum \limits_{k = 0}^{p} \hat{\beta }_{k} (s_{0} ) q_{k} (s_{0} ),$$
(4)

where \(\hat{\beta }_{k} (s_{0} )\) are estimated regression coefficients in location \(s_{0}\) and \(q_{k} (s_{0} )\)—explanatory variables. Estimation of regression parameters is performed locally using the weighted least-squares method. The schemes of weighing in GWR moving windows are developed in such a way that the weight would decrease with the distance between the estimation point and the data point. In this study, we used the bi-square function in adaptive kernels, whose value was selected on the basis of minimisation of the Akaike information criterion (AIC; Fotheringham et al. 2002). Exactly, the same auxiliary variables as in the MLR were used in the GWR model.

In practice, the decision concerning the choice of the regression model for approximation of the deterministic component of the universal model can be taken only on the basis of the degree of the goodness-of-fit of the model to the observation data. It was based on such measures of the fit as the adjusted coefficient of determination (R 2adj. ), standard error of estimation (STE), corrected Akaike information criterion (AICc), and analysis of variance (ANOVA) for regression model residuals. ANOVA was used to check whether the improvement of the model fit, expressed as a decrease in the sum of squared residuals of the model, was statistically significant (Szymanowski and Kryza 2012).

The justification for the use of the GWR model instead of MLR was tested using two tests of the spatial variability of local geographically weighted regression coefficients, i.e., Monte Carlo implemented in the GWR3.0 software (Fotheringham et al. 2002) and the geographical variability (GV) test from the GWR4.0 software (Nakaya 2016). In the case of stationarity of any of the explanatory variables, it would be justified to apply a mixed GWR (MGWR) model instead of a fully local approach (Nakaya et al. 2005). The mixed models allow to mix in one model the explanatory variables which are spatially non-stationary (like for ordinary GWR) and the stationary predictors (like used in MLR). Calibration of the semiparametric GWR model was performed for comparison using the GWR4.0 software.

The other decision in the presented scheme of selection of an optimal interpolation method concerns the possibility of extending the model with a stochastic component, i.e., interpolation of regression model residuals with ordinary kriging. This procedure is employed when there is a significant spatial autocorrelation of regression residuals, which is a basis for modelling a variogram that is different from the pure nugget effect. If the pure nugget effect was the only variogram possible to fit to the experimental variogram of regression residuals, the deterministic model would be corrected at each studied point by an average of the residuals, which equals zero, in accordance with the assumptions of MLR as a best linear unbiased predictor (BLUP) model. In the GWR model, which does not meet the criterion of unbiasedness, the average of the residuals varies, although it is sufficiently close to zero (Fotheringham et al. 2002) to assume that the modification of prediction by residual kriging is negligible in the absence of autocorrelation (Fotheringham et al. 2002). Therefore, the absence of autocorrelation of the residuals excludes the extension of the MLR or GWR models into MLRK or GWRK, respectively (Szymanowski et al. 2012).

Spatial autocorrelation is a mathematical expression of spatial relationships described by Tobler’s first law of geography (Tobler 1970), i.e., decreasing similarity of features of geographical objects along the increasing distance between them. It describes the degree of correlation of the variable value in one location with the value of the same variable in a different location, which implies that the values of the analysed variable determine and, concurrently, are determined by realisation of the variable in different locations. These relationships result in spatial clustering of similar values, which is referred to as positive autocorrelation. The value of the autocorrelation and its statistical significance was determined by calculating Moran’s I statistics (Moran 1950).

The deterministic component residuals were modelled with ordinary kriging (Eq. 5):

$$\hat{e}(s_{0} ) = \mathop \sum \limits_{i = 1}^{n} \lambda_{i} e(s_{i} ),$$
(5)

where \(\lambda_{i}\) are kriging weights determined from the spatial dependence of the deterministic component residuals and \(e(s_{i} )\)—residual in position \(s_{i}\). Hence, the full model of residual kriging based on the global regression MLRK model (Eqs. 2, 3, 5) can be expressed as the following equation:

$$\hat{z}(s_{0} ) = \hat{m}\left( {s_{0} } \right) + \hat{e}(s_{0} ) = \mathop \sum \limits_{k = 0}^{p} \hat{\beta }_{k} q_{k} (s_{0} ) + \mathop \sum \limits_{i = 1}^{n} \lambda_{i} e(s_{i} ),$$
(6)

whereas residual kriging based on the geographically weighted regression (GWR) model (Eqs. 2, 4, 5) can be expressed as the following equation:

$$\hat{z}(s_{0} ) = \hat{m}\left( {s_{0} } \right) + \hat{e}(s_{0} ) = \mathop \sum \limits_{k = 0}^{p} \hat{\beta }_{k} (s_{0} ) q_{k} (s_{0} ) + \mathop \sum \limits_{i = 1}^{n} \lambda_{i} e(s_{i} ).$$
(7)

Modelling of variograms of the regression residuals was carried out using automated fitting procedure in ArcGIS Geostatistical Analyst. This approach is based on the Levenberg–Marquardt method (Press et al. 1988) of non-linear least-squares approximation. The choice between candidate variogram models (spherical, circular, and exponential) was done basing on the fit quality, using the criterion of the lowest root-mean-squared error (RMSE). To avoid inconsistencies on the edges of local neighbours’ search area and subsequent artificial tearing of continuity of interpolated variable, global settings were used to specify kriging models, i.e., all data points were considered in determination of the weights at any interpolation point.

Due to the limited 111-element input data set, the quantification of the modelling results was performed using the leave-one-out cross-validation (CV) technique. It yielded a 111-element set of CV errors, which were used in the validation of the model in two ways. The values of the summary diagnostic measures were calculated and analysis of the spatial distribution of the CV errors was carried out, particularly in terms of systematic local and regional trends.

Three synthetic measures, i.e., mean error (ME), mean absolute error (MAE), and RMSE, were used in the analysis of the CV errors. The relationship between the sizes of the aforementioned errors can be defined as ME ≤ MAE ≤ RMSE, with the two latter measures having only non-negative values with the predicted zero value. The use of the square of the CV errors in the RMSE index makes the measure substantially biased even by an inconsiderable number of large errors, although the other errors may be small and acceptable. According to some researchers, MAE is regarded as the most natural diagnostic measure (Willmott and Matsuura 1995).

Since the differences in the MAE of CV errors may be negligible in the compared methods, the decision as to whether any of the methods yielded significant improvement, i.e., has a significantly lower MAE, was made by comparison of MAE ± \(\hat{\sigma }_{\text{MAE}}\) intervals, where \(\hat{\sigma }_{\text{MAE}}\) was a MAE calculation error. Assuming that the mean µ of CV errors is \(\mu_{\text{CV}} \cong 0\), and \(\hat{\sigma }_{\text{MAE}}\) for an n element set with standard deviation \(\sigma_{\text{CV}}\) can be calculated with the formula (Kalarus et al. 2010; Eq. 8):

$$\hat{\sigma }_{\text{MAE}} = \frac{{\sigma_{\text{CV}} }}{{\sqrt {\text{nc}} }},\quad c = \frac{\pi }{\pi - 2}.$$
(8)

Therefore, if the MAE ± \(\hat{\sigma }_{\text{MAE}}\) intervals for the two methods are disjoint, the method with a smaller MAE can be considered better. If, however, the analysed intervals have a certain common range, this statement is not justified.

The quantitative analysis of the model was also accompanied by visual evaluation, which by definition serves identification of features of the model that cannot be shown by methods based on the actual values of the interpolated variable. In particular, it facilitates detection of such little realistic effects as spatial discontinuity, unusually large or small values of the modelled variable, strong directional or regional trends, and various artefacts illustrating the characteristics of the interpolation algorithm rather than those of the interpolated variable. Although it is based on the expert knowledge of the modelled element, such validation is subjective and only complementary to quantitative evaluation.

A number of various computer programs were used in the study, both commercial packages and free software. The spatial analyses primarily concerning the structure of the layers of the explanatory variables and spatial interpolation as well as the final maps were generated in the ArcGIS and GIS GRASS software. The analysis of stepwise regression and the global model was carried out in the STATISTICA program, and the complementary analyses of geographically weighted regression (ANOVA, non-stationarity tests, calibration o semiparametric GWR model) were performed using the GWR3.0 (Fotheringham et al. 2002) and GWR4.0 (Nakaya 2016) packages. For cross validation, the R scripts were developed, using gstat and spgwr packages.

4 Results and Discussion

4.1 Ewert’s Index Spatial Predictors

The specification of the multiple linear regression (MLR) model using the stepwise method showed that only four environmental variables in the five analysed cases significantly determined the value of Ewert’s index: coordinates (X and Y), elevation (DEM), and the distance from the sea (SDI; Table 2). These variables indicate the global nature of the determinants of the continentality index distribution. It should be stressed that variables combining global and local factors, such as insolation (IT), regional factors, such as the foehn index (FI), or local factors reflecting the effect of the terrain relief (CCI) or land cover (AS, NS), were not introduced into the model in any of the analysed cases. These variables were, therefore, not correlated significantly with the continentality index based on air temperature amplitude, although they often proved to be significantly correlated with air temperature alone (Szymanowski et al. 2012, 2013). This may have been caused by the fact that the input data for the analysis presented in this paper were only obtained from synoptic stations that, in accordance with the WMO guidelines, had been located in a way to minimise local impacts; no data from climatological stations were available in the study.

Table 2 Explanatory variables and standardized regression coefficients in multiple linear regression (MLR) models of Ewert’s index of continentality (K) for the four selected years and the 1981–2010 average (descriptions of variables in the text)

Each time, a maximum of three variables was introduced into one model, and only elevation was included in all the five models. This is related to the expected decline in the temperature amplitude together with elevation. Variable X describing the rate of change in the east–west orientation was introduced four times, except for the index calculated for 1990 (Table 2). Similarly, one of the variables characterising the zonal distribution of Ewert’s index, Y, or SDI was introduced in four cases. Importantly, in none of the models, do these variables appear simultaneously. In the geographical conditions of Poland with the Baltic Sea in the north of the country, and given arrangement of meteorological stations, variable Y (northing) and the sea distance index SDI are correlated, and hence, only one of them was introduced into the regression model even if both were significantly correlated with the continentality index (SDI—3 times, Y—1 time; Table 2). There was no significant correlation with one of the zonal variables Y or SDI only in 2006.

The values of standardized regression coefficients indicate a varying impact of the environmental variables on the individual cases, which concurrently imply significant differences in the distribution of the continentality index. The sign of the coefficients identifies the direction of the relationships: in general, Ewert’s index of continentality decreases together with the terrain elevation and northwards and increases along the distance from the sea and eastwards (Table 2).

4.2 Regression Models

Global regression model (MLR) explains over 66 % of the Ewert’s index variability in four analysed cases (Table 3). The coefficient of determination was as low as 0.37 only for 2006. The model for the 1981–2010 average was the best fitted one (R 2adj.  = 0.90) with a 1.63 % standard error of estimation. In the poorest fitted model for 2006, STE was found to be 5.20 %.

Table 3 Selected statistics of multiple linear regression (MLR), geographically weighted regression (GWR), and mixed geographically weighted regression (MGWR) models of Ewert’s index of continentality (K) for the four selected years and the 1981–2010 average

Local geographically weighted regression models (GWR) for all the cases were characterised by a better fit to the observations. The highest increase in the coefficient of determination was found for 2006: from 0.37 for MLR to 0.75 for GWR. In the other cases, there was an increase in the explanation of the variance of Ewert’s index from +2 % for the multi-year average to +11 % for 1989. Increase in explained variance was accompanied by a simultaneous decrease in the STE value and a significant (>3) reduction in the Akaike information criterion (AICc, Table 3). The ANOVA analysis indicated a statistically significant decrease in the residual sum of squares of the GWR models, compared with MLR (Table 4).

Table 4 ANOVA of multiple linear regression (MLR), geographically weighted regression (GWR), and mixed geographically weighted regression (MGWR) models of Ewert’s index of continentality (K) for the four selected years and the 1981–2010 average

The Monte Carlo and geographical variability (GV) tests for spatial non-stationarity indicated possible spatial stationarity of the DEM variable GWR coefficients in two cases: for 1989 (GV test only) and 2002 (both tests, but the result of the GV test was not significant at 0.05; Table 5).

Table 5 Local parameter variability tests for geographically weighted regression (GWR) models of Ewert’s index of continentality (K) for the four selected years and the 1981–2010 average

The mixed MGWR models for the two cases considered were characterised by a slightly lower residual sum of squares than the GWR models (Table 4). However, the coefficients of determination and estimation errors were similar to the GWR model, and AICc did not indicate a significant improvement in the fit to the observations (Table 3).

The analysis presented in this section shows a better fit of the local GWR regression model than that of the global MLR model. This suggests the non-stationarity of the spatial process in each analysed case of the spatial distribution of Ewert’s index of continentality. No significant difference between the GWR and mixed MGWR models was found either. Thus, accordingly to Occam’s razor principle, where simpler models are preferable to more complex ones, because they are better testable and falsifiable, simpler GWR model was used for further analysis.

4.3 Residual Kriging Models

Extension of the regression models to the form of residual kriging was preceded by an analysis of the spatial autocorrelation of the regression residuals. Moran’s I statistics showed a statistically significant positive autocorrelation, i.e., clustering of similar values of residuals, for all the MLR models (Table 6). For the GWR residuals, a tendency towards clustering of similar values was observed in four cases, but the autocorrelation was not significant at the 0.05 level. In the case of year 2002, a random distribution of the GWR residuals, i.e., the absence of autocorrelation, was observed (Table 6).

Table 6 Spatial autocorrelation of multiple linear regression (MLR) and geographically weighted regression (GWR) residuals of Ewert’s index of continentality (K) for the four selected years and the 1981–2010 average

Here, as in the previous studies (Szymanowski et al. 2012, 2013), an exclusion criterion was used, i.e., the absence of a significant (<0.05) positive autocorrelation prevented correct fitting of the theoretical variogram different from the pure nugget effect. Thus, according to none significant autocorrelation of all GWR residuals (Table 6), variogram modelling was performed only for MLR residuals (Table 7). In all the cases, the spherical model was the best fitted (lowest or at least the same RMSE) while comparing to exponential and circular ones (Table 7).

Table 7 Variogram fitting evaluation and parameters of spherical variograms of multiple linear regression (MLR) residuals of Ewert’s index of continentality (K) for the four selected years and the 1981–2010 average

4.4 Cross-Validation Results

The diagnostic measures of CV errors of the continentality index for Polish synoptic stations are presented in Table 8. Summary statistics were calculated for the two regression models (MLR and GWR) and one residual kriging model (MLRK). The choice of the model that would be optimal considering cross-validation results was made based on ME, MAE, and RMSE measures and extreme errors (Table 8). MAE was regarded as the main measure in this case, since it facilitated assessment of the statistically significant difference between the error sizes and, consequently, identification of significant improvement provided by the model with lower MAE values. This is particularly important, as the differences in the diagnostic measures, including MAE, are often inconsiderable, particularly when similar categories of models, e.g., regression or residual kriging, are compared (Table 8).

Table 8 Summary statistics of cross-validation (CV) errors for multiple linear regression (MLR), geographically weighted regression (GWR), and multiple linear regression—kriging (MLRK) models of Ewert’s index of continentality (K) for the four selected years and the 1981–2010 average

The better fit of the GWR models, comparing with MLR, was also confirmed by cross validation. In general, the GWR model was characterised by better CV summary statistics in each analysed case. This did not imply that the model was better in terms of all the five measures employed. Sometimes, the ME or one of the extreme errors was closer to zero in MLR than in GWR. However, the MAE and RMSE for GWR were always lower than for MLR. Importantly, GWR was characterised by a significantly lower MAE only in 2006 (Table 8).

The absence of autocorrelation of GWR residuals limits the model of spatial variation only to explanation of the deterministic component. Thus, even if the fit of the GWR model is significantly better than that of MLR (as in the case of 2006; Table 3), GWRK is not effective when regression residuals are randomly distributed. Paradoxically, this may imply that a better fitted deterministic model does not necessarily guarantee better spatial interpolation. Consequently, the scheme of the optimal interpolator selection applied so far (Szymanowski et al. 2012, 2013) requires revision, so that it will consider the above-mentioned possibility of the absence of autocorrelation of GWR residuals accompanied by a significant positive autocorrelation of MLR residuals (Fig. 2). Therefore, a question arises whether a reverse situation, i.e., the absence of autocorrelation of MLR residuals accompanied by a significant positive autocorrelation of GWR residuals, should be considered in such scheme. Such situation is highly unlikely, since it would have to result from a better fit of the MLR than the GWR model. In fact, MLR can be regarded as a special case of GWR with the constant weight along the distance and all observation data points used for model calibration. This implies that the GWR model may work better or be comparable to MLR, but the reverse situation should be rather excluded. Thus, such option is not included in the proposed selection tree (Fig. 2).

Fig. 2
figure 2

Decision tree for selecting a suitable spatial prediction model under the assumption of existing environmental correlations (after Szymanowski et al. 2012, 2013, extended)

In all the analysed cases, the MLRK models have smaller errors than both the regression models. Moreover, due to MAE criterion, MLRK was significantly different from MLR and GWR, so it can be assumed the optimal spatial interpolation algorithm. To confirm the choice, an additional visual assessment of maps of the continentality index was carried out.

4.5 Maps of Ewert’s Index of Continentality

To present the characteristics and differences between the maps generated with the three spatial models analysed, two cases were selected, i.e., the years 2002 and 2006. Due to the large range and considerable variability of the index (Table 1), these were the most demanding cases in terms of spatial interpolation of the analysed data set. This was also reflected in the largest cross-validation errors (Table 8).

The largest differences between the fit of the MLR and GWR models were found for 2006 (Table 3). This was clearly reflected on maps generated with these two methods (Fig. 3a, b). The DEM and X variables included in the MLR model are responsible for the overall decrease in the index with elevation and an increase eastwards. The process, however, is highly non-stationary (Table 5) and the model significantly overestimates the values obtained on the Baltic Sea coast and underestimates the values from the rest of the country (Fig. 3a). The considerably better fitted GWR model substantially increases the value of the index in the central part of the country and slightly decreases it on the coast, contributing to the reduction of the CV errors size. The general tendency towards overestimation is still visible in the northern part of Poland (Fig. 3b). The map generated using the GWR method clearly emphasises the role of terrain relief, particularly in the northern part of the country. This is related to the small kernel bandwidth (32 points), i.e., the smallest of all the analysed cases (Table 3), which can locally emphasise the effect of the explanatory variable. The correction introduced by the geostatistical component is significant in the MLRK model (Fig. 3c) which can be regarded as optimal in this case.

Fig. 3
figure 3

Ewert’s index of continentality (K) in Poland in 2006 spatialized using selected interpolation methods

In the case of 2002, although the improvement of the fit introduced by the GWR model compared with MLR is statistically significant (Table 4), the maps of Ewert’s index do not show significant differences, the spatial distribution and size of the CV errors are similar (Fig. 4a, b), and the MAE criterion does not indicate a significant difference between the methods. All three models for 2002 exhibit a poor local fit to the observations in the central-northern part of the country (Fig. 4). Despite this limitation, MLRK, which is statistically significantly better than MLR and GWR in terms of the MAE criterion, can be regarded as an optimal model for 2002 (Table 8).

Fig. 4
figure 4

Ewert’s index of continentality (K) in Poland in 2002 spatialized using selected interpolation methods

In the other analysed cases, MLRK were also regarded as optimal models. This supported the conclusions drawn from the analysis of the cross-validation errors also confirming the necessity of introduction of a modified scheme of selection of an optimal predictor (Fig. 2).

The differences in the size, range, and variation of Ewert’s index indicated in the observations (Table 1) were also reflected on the maps (Fig. 5). The spatial distribution of the 1981–2010 multi-year average is characterised by a distinct upward trend from NW to SE with a clearly decreased value of the index in the top parts of the highest mountains. This is particularly visible in the western part of the Sudetes, SW Poland (Fig. 5a). Although discussion of the reasons for this situation lies beyond the scope of this paper, it should be noted that the differences in the distribution between the individual years in relation to the averaged field are significant. In 1990, there was a pronounced zonal trend in the distribution of the continentality index; in 2002, the distribution was longitudinal, and in 2006, high values of the continentality index covered large areas of the central Poland (Fig. 5). Such strong variability of the index distribution may indicate its analytical usefulness as a complementary indicator of macro- and mesoscale atmospheric processes.

Fig. 5
figure 5

Maps of Ewert’s index of continentality in Poland for the four selected years and the 1981–2010 average

5 Summary and Conclusions

The primary aim of the paper was to perform spatial interpolation and to generate maps of Ewert’s index of continentality in Poland for the selected years and an average from the period 1981–2010. The main emphasis was placed on the methodological side of spatial interpolation to develop guidelines for automation of the interpolation process.

Additional objective was to test, on the example of Ewert’s index, the validity of the scheme of selection of an optimal interpolation method developed by Szymanowski et al. (2012, 2013) for spatialization of air temperature. An optimal method was yielding the smallest errors and simultaneously allowing generation of an acceptable map on the basis of expert knowledge about the spatial characteristics of the interpolated variable.

The continentality index for the period 1981–2010 was calculated based on the daily data from 111 meteorological stations. Four years (1989, 1990, 2002, and 2006) characterised by extreme values of the areal average and index range were selected for the analysis. The set of analysed cases was complemented with the mean value of Ewert’s index for the multi-year period 1981–2010.

The set of potential predictors of the continentality index comprised nine environmental variables previously used for spatial interpolation of air temperature in Poland (Szymanowski et al. 2012, 2013). However, the stepwise regression analysis demonstrated that only four variables in the analysed cases exhibited a significant correlation with Ewert’s index, i.e., coordinates X and Y, elevation, and the distance from the sea. Regional or local variables were not significantly correlated with the index, which implies that the features of the continentality index field are mainly determined by macroscale and regional factors (distance from the Baltic Sea) with modification depending on terrain elevation. Noteworthy, the analysis was performed on the basis of the index calculated for the synoptic stations; the location of which, by definition, minimises the local impacts on climate elements. Data from lower-rank, climatological stations, where the impact may be more significant, were not included.

The correlation analysis performed to eliminate collinearity in the regression model additionally showed that variables Y (northing) and SDI were correlated in the geographical conditions of Poland and considering given set of meteorological station. Therefore, only one of them was included in the multiple linear regression MLR model (SDI—3 times, Y—1 time). The elevation variable was included in the models of each of the five cases, although this does not mean that its role was dominant. In fact, the role of the explanatory variables in the regression models varied between the cases, which indicates significant changes in the determinants and, hence, variable features of the Ewert’s index distribution. The general regularities of the distribution allow a conclusion that the continentality index for Poland decreases with elevation and northwards and increases along the distance from the sea and eastwards, which is also supported by the previous studies (Kożuchowski and Marciniak 1992).

The environmental correlation for each of the analysed cases was statistically significant, although for 2006, the MLR model explained solely 37 % of the Ewert’s index variation. In each case, the local GWR model was better fitted to the observations and it significantly improved the MLR regression results. The tests of the spatial variability of the GWR regression coefficients indicated their stationarity. However, compared with the GWR, the use of mixed local–global regression models (MGWR) did not significantly improve the fit.

The residuals of the MLR models were characterised by a positive spatial autocorrelation, which clearly justified extension of the model to the form of residual kriging, MLRK. In contrast, no statistically significant autocorrelation was found in any case of GWR, so the GWRK model was excluded from further considerations.

The cross validation was carried out for three models of each case of Ewert’s index. The local GWR model was characterised by smaller errors. In terms of the MAE criterion, MLR was found to be statistically significantly worse than GWR only for 2006. CV carried out for residual kriging models clearly indicated that the MLRK models produced smaller errors than both regression models. However, the specification of GWRK was not justified and gave grounds for modification of the scheme of optimal interpolator selection in the presence of the environmental correlation. The modified scheme of Szymanowski et al. (2013) now allows a situation where the absence of an autocorrelation of GWR residuals is accompanied by an autocorrelation of MLR residuals, which indicates that MLRK is an optimal method, even if GWR is better fitted to observations than MLR.

The visual assessment of the maps confirmed the results of the cross validation and the MAE criterion. The MLRK model is recommended to spatialize the Ewert’s index for Poland.

Based on the results obtained, it can be postulated that the following elements should be considered in the procedure of automation of the interpolation of the continentality index for Poland:

  • Regression models should be specified using four potential predictors: coordinates (X, Y), elevation (DEM), and distance from the sea (SDI).

  • The decision concerning selection of the interpolator should in each case be based on the proposed scheme (Fig. 2), including comparison of the fit of the regression models to observations by, e.g., comparing the coefficient of determination and autocorrelation regression residuals using Moran’s I statistics.

  • For autocorrelated residuals, the automatic fitting of the variogram can be done using the spherical model.

In the previous papers (Szymanowski and Kryza 2012; Szymanowski et al. 2012, 2013), it was assumed that the choice of an optimal interpolation method was determined by the stationarity or non-stationarity of the spatial process, which could be inferred from the better fit of one of the regression models, i.e., global or local. The results obtained in this study demonstrated greater importance of the geostatistical component in the universal model of spatial variation (Hengl 2007). Residual kriging is highly efficient if regression residues exhibit a strong positive autocorrelation contributing to the well fit of the theoretical variogram to the experimental one. This does not imply, however, a possibility of exclusion of the deterministic component if the environmental correlation is significant. Although this would lead to maintenance of the good fit of the model to observation in measurement points, the spatial distribution of the index would simultaneously “diverge” from environmental features beyond these points.