1 Introduction

Severe weather is an extreme meteorological event or phenomenon, which represents a real threat to human life and property (World Meteorological Organization 2004). Weather hazards may occur suddenly and not leave much time for reaction. Furthermore, these occurrences are spatially varied and should be identified by an area. Hence, knowledge about the spatial distribution and strength of dangerous atmospheric phenomena is crucial for reliable local risk assessment as well as effectiveness in preventing and mitigating weather disasters. Some of the most common dangerous weather phenomena that occur over Poland are fog, thunderstorms (lightning and hail) and icing (rime and glaze).

Understanding the spatial variability and intensity of severe weather phenomena makes it possible to determine regional sensitivity to extreme atmospheric hazards at different risk levels. Spatial prediction methods for several weather elements are relatively well recognized and widely described (e.g. Hartkamp et al. 1999; Chapman and Thornes 2003; Dobesch et al. 2007; Li and Heap 2008; Tveito et al. 2008; Sluiter 2009). The most frequently studied were: air temperature (e.g. Ninyerola et al. 2000, 2007b; Brown and Comrie 2002; Vicente-Serrano et al. 2003; Ustrnul and Czekierda 2005, Benavides et al. 2007; Szymanowski and Kryza 2013), solar radiation (e.g. Heuvelink and Griffith 2010; Kryza et al. 2010; Ruiz-Arias et al. 2011; Wang et al. 2014), precipitation (e.g. Ninyerola et al. 2000, 2007a; Martínez-Cob 1996; Brown and Comrie 2002; Vicente-Serrano et al. 2003; Wagner et al. 2012; Di Piazza et al. 2011), evapotranspiration (e.g. Martínez-Cob 1996; Vicente-Serrano et al. 2007; Shirin Manesh et al. 2013) and snow cover (e.g. López-Moreno and Nogués-Bravo 2005, 2006; López Moreno and Vicente-Serrano 2007; Blanchet and Lehning 2010). Thus far, not much attention has been paid so far to techniques for mapping some visually observed atmospheric phenomena (e.g. fog, dew, hoarfrost, icing, rime, glaze, thunderstorms), which are usually small-scale, often spatially discontinuous, inherently complicated and difficult to predict and/or measure. The volume of source data is also usually too small for interpolation because phenomena are visually observed only at synoptic and research weather stations where a human observer records observations. For spatially continuous mapping of atmospheric phenomena characteristics, estimation based on selected physical relationships is recommended (World Meteorological Organization 2011). This often requires application of predictive modelling and mapping techniques also known as geospatial data mining (Yuan et al. 2005; Shekhar et al. 2005) instead of traditional spatial interpolation methods (Berry 2005; Diem and Comrie 2002). In this case, geospatial predictive modelling and mapping techniques were used for estimating the spatial distribution of selected atmospheric phenomena over the area of Poland on the basis of rasterised environmental variables (Franklin 1995).

The main purpose of the study is to present an improved climate mapping method for evaluating the spatial distribution of selected small-scale atmospheric phenomena based on limited input data. Geospatial predictive models of thunderstorms, fog, glaze, and rime were developed to obtain spatially continuous maps. Cartographic versions of similar maps have already been published in the Meteorological hazard atlas of Poland (Ustrnul et al. 2014) where the spatial and temporal variability of each given phenomenon is discussed. In this study, the methodological approach is presented.

2 Study Area and Data

2.1 Weather Observation Data (in Situ Data)

The study area is Poland, a central European country (313,000 km2) characterized by a transitional climate with both oceanic and continental influences. Elevation varies from almost 2500 m above sea level in the Tatra Mountains in the south to 2 m below sea level in the north. Changing oceanic and continental influences together with complex physiography make weather phenomena in Poland difficult to predict.

Even though advanced measurement methods of weather parameters do exist, there is still an evident shortage of objective and quantitative observational data on atmospheric phenomena. Taking into account weather phenomena such as fog, thunderstorms, glaze, and rime, visual observations performed at weather stations serve as the only available data source for Poland.

However, there are several limitations of in situ observations, which have to be considered:

  • Subjectivity of the observations, which depend on the knowledge and experience of the observer,

  • The way observations are coded in the SYNOP dispatch; current weather code makes it impossible to obtain information on the intensity of each given phenomenon (no quantifiers available),

  • The local nature of atmospheric phenomena strongly depends on the variability of key local environmental conditions, such as topography and land cover.

All such limitations seriously complicate the use of interpolations method for the purpose of creating spatially continuous maps of hazardous atmospheric phenomena. Nevertheless, visual observation data were presented directly in form of graduated symbol maps (Fig. 1) to gain general view on variability of a given phenomenon over the Poland territory and gather an input data to modelling results validation.

Fig. 1
figure 1

Average annual number of days with thunderstorms (a), glaze (b), rime (c), and fog (d) observed at weather stations

Daily observational data for a period of 45 years (1966–2010) were used. Careful verification and homogenisation of the time series was done in accordance with WMO recommendations (World Meteorological Organization 2011) in order to complete the missing values. Finally, daily data on observed thunderstorms, fog, glaze, and rime obtained from 61 weather stations spread across Poland were included in the analysis (Fig. 2a).

Fig. 2
figure 2

Location of the weather stations used in the study (a) in comparison to NCEP/NCAR reanalysis and RegCM model gridded data (b)

For favourable meteorological conditions algorithms construction (see Sect. 3.1) and RegCM modelling validation, data from every 3 h observations were also used. Additionally, aerological data (soundings from 00 UTC and 12 UTC) from three Polish upper-air stations (Łeba, Wrocław, and Legionowo) were derived and used in glaze and thunderstorms algorithm construction and modelling results validation.

All observation data were derived from meteorological stations which work within the national network and were provided by the Institute of Meteorology and Water Management—National Research Institute.

2.2 Reanalysis and Regional Climate Model Data

Due to the lack of objective quantitative data derived from an adequate number of observations, additional environmental data were tested for statistical dependence (predictor testing). To obtain regularly-gridded coverage, a 45-year study period (1966–2010) was used with NCEP/NCAR reanalysis data (Kalnay et al. 1996; Kistler et al. 2001) with 6-h temporal resolution ([1] NCEP 2015). A 2.5 × 2.5 decimal degree gridded dataset contained information at the surface level (2 m above the ground) and 17 different pressure levels in the atmosphere. It was used as an input for dynamical downscaling (Wilby and Wigley 1997) carried out with the use of the RegCM model, version 4.1.1. (Elguindi et al. 2011). Thanks to its improved parameters associated with atmospheric physics and land cover patterns, the RegCM regional climate model is suitable for modelling weather conditions on a regional scale (Giorgi and Anyah 2012). As a result, homogenous gridded datasets for the surface layer (2 m) and 23 upper atmospheric levels were obtained with a spatial resolution of 20 km and a temporal resolution of 3 h (Fig. 2b).

2.3 Environmental Variables

Some atmospheric phenomena vary considerably even over a relatively small area. Because of this, valuable data on local meteorological conditions as well as information on several key environmental variables need to be included for proper climate mapping. In this study, fog was the variable most dependent on local environmental characteristics. According to COST Action 722 (Short range forecasting methods of fog, visibility and low clouds, Jacobs et al. 2007), topography and land use (connected also with soil moisture and vegetation) were concerned as the most important predictors. Several auxiliary parameters were extracted from an SRTM Digital Elevation Model v.4.1 (Jarvis et al. 2008) with spatial resolution at 90 m and from the CORINE Land Cover 2000 vector database (Bossard et al. 2000; [2] EEA Data and Maps 2015) with the smallest mapping unit at 25 hectares. Final maps were generated with a spatial resolution of 250 m, which seems to be adequate for climate mapping on the scale of Poland.

3 Methodology

Section 3 outlines the methods used for the map generation process, including a proposed approach to discontinuous phenomena mapping.

The whole workflow (Fig. 3) consisted of three steps which were finalised by three different types of climate maps:

Fig. 3
figure 3

General workflow scheme

  1. 1.

    Direct presentation of in situ observation (graduated symbols maps) (Sect. 2.1).

  2. 2.

    Conditional probability mapping based on favourable meteorological conditions algorithms (FMC maps) (Sects. 3.1 and 4.1).

  3. 3.

    Extending FMC map with environmental predictors employed in multiple linear regression kriging (FMEC map) (Sects. 3.2 and 4.2).

3.1 Predictive modelling based on favourable meteorological conditions (fmc) algorithms and gridded data

In order to obtain spatially continuous climate maps of the probability of occurrence of fog, thunderstorms, glaze, and rime an approach based on conditional probability algorithms and predictive modelling was employed. These methods are widely used in weather forecasting (Lorenc 1986; Keller and Kuchera 2004), but since they are based on the detailed analysis of real data, they can also be successfully applied in climate research (Thorne et al. 2012). The first step consisted of an extended analysis of long-term data from daily and 3 h weather observations. Favourable meteorological conditions for each phenomenon were determined through an analysis of all measured and observed weather parameters during the occurrence of each studied phenomenon. In the case of thunderstorms and glaze, upper-air soundings were additionally analysed for characteristic patterns. Finally, empirical formulas and thresholds were defined for all the selected phenomena. The entire process of algorithm definition was undertaken as a part of a task called “Maps of Meteorological Hazards” which was done within the framework of the EU co-financed project “IT system of the country’s protection against extreme hazards” (Polish acronym: ISOK) (Wypych et al. 2014; Ustrnul et al. 2015). The detailed procedure for the construction and validation of each algorithm is beyond the scope of this paper (an example for thunderstorms can be found in Walawender et al. 2015), which focuses mainly on mapping methods. Simplified versions of favourable weather conditions algorithm for each given phenomenon are shown in Table 1.

Table 1 Components of favourable weather conditions algorithms (simplified version)

Subsequently, algorithms were implemented with the use of RegCM modelled data (1966–2010) (see Sect. 2.2). Finally, the mean annual frequency of favourable conditions was calculated for each of the four studied phenomena. Gridded values were interpolated via an exact method (radial basis function—completely regularized spline) to keep the grid point values in the output geostatistical surface.

Validation of obtained RegCM modelling results was performed through implementation of FMC algorithms into available in situ data. Every 3 h data from weather stations without any missing values were used in the process. Finally, 1966–2010 data from 42 weather station were used for rime and fog FMC modelling validation. Validation of thunderstorms and glaze FMC map (RegCM results) was performed only for three weather-stations (Leba, Legionowo, Wroclaw—as FMC algorithms demanded data from upper-air soundings) using the data from 2003–2010 period because of the very poor upper weather data availability (described in Walawender et al. 2015).

3.2 Environmental Predictors and Regression Kriging: Fog Map Example

As the occurrence of fog is usually highly dependent on local environmental conditions, more predictors (apart from certain weather parameters) are needed to explain its spatial variability. The distribution of fog is first of all determined by relief and its interactions with the air. There exists a strong and complex relationship between fog occurrence and local topography (elevation and landform) and land cover patterns (e.g. amount of water bodies or green areas). These parameters are crucial for spatial prediction of fog (Jacobs et al. 2007). Predictive mapping makes it possible to integrate both meteorological and geographic data (Vicente-Serrano et al. 2010) to estimate the spatial distribution of a given variable.

In this study multiple linear regression (MLR, also known as environmental correlation) with additional residual interpolation (so called Multiple Linear Regression Kriging—MLRK, Hengl et al. 2004; Hengl 2007, 2009) was tested for mapping fog occurrences over the territory of Poland.

Exploratory analysis was carried out first to select an appropriate set of key environmental variables. The locations of all available weather stations were verified with the use of high resolution aerial photographs and orthophotomaps. All improperly assigned XY coordinates were adjusted by geocoding postal addresses. Finally, a location accuracy of 90 m (consistent with nominal SRTM pixel size) was achieved and made it possible to check relationships between predictors and input data with greater precision and reliability. Selected environmental predictors were converted into raster format. The diagram in Fig. 4 illustrates the process.

Fig. 4
figure 4

Fog environmental predictors (P1P5) and geoprocessing functions used (see Sects. 2.3 and 3.2 for explanations)

Topographic predictors were obtained from SRTM data. Elevation (meters above sea level) (P1) was extracted directly from SRTM data and further used as an input raster to calculate the Topographic Position Index (TPI) (Gallant and Wilson 2000). A dedicated ArcGIS extension developed by Jenness (2006) was used to calculate TPI on the basis of elevation data. TPI (P2) is an objective semi-automated landform classifier based on neighbourhood analyses. It shows the difference between the elevation in each pixel and the average elevation in its neighbourhood defined by a circle of arbitrary radius which depends on the level of detail assumed in the analysis. In this study, for the resampled 250 m DEM raster dataset, the radius of 200 cells was defined in order to extract large terrain features that differentiate key regional climate conditions. To obtain the final raster predictor (P2), TPI was recalculated into absolute values as both uplands (TPI ≫ 0) and valleys (TPI ≪ 0) favour the occurrence of fog.

Corine Land Cover (CLC), which was originally created as a vector dataset was transformed to a raster with 250 m resolution. Urban areas, green (vegetated) areas, as well as wetlands and water bodies were extracted from reclassified CLC data (Table 2).

Table 2 Reclassified and aggregated CLC Classes

Regarding conditions of fog occurrence, buffer and focal analyses were applied to obtain the percentage of urban/green/water areas within a 2.5 km buffer zone around each available weather station and similarly around each CLC raster cell. The analysis was repeated three times in order to compute three raster datasets (three predictors): percentage share of urban areas (P3), percentage share of green areas (P4), and percentage share of wetlands and water bodies (P5). The entire set of predictors (including P6—fog favourable meteorological conditions described in Sect. 2.2) are shown in Table 3.

Table 3 Fog environmental predictors chosen for testing

The selected predictors were examined with stepwise regression analysis to check the spatial relationships between the occurrence of key phenomena and corresponding predictors. The following statistical criteria were taken into account to identify the best fitted Multiple Linear Regression (MLR) model based on the ordinary least squares (OLS) method: adjusted R-squared (AdjR2), corrected Akaike Information Criterion (AICc), Koenker Statistic p value (K(BP)) and the Maximum Variance Inflation Factor (MaxVIF). Jarque–Bera (JB) test was performed on model residuals to indicate whether they are normally distributed. Global Moran’s I statistic p values (SA) were calculated to decide whether the regression residuals are spatially autocorrelated and further interpolation is required.

The probability of fog favourable meteorological and environmental conditions (FMEC) was calculated using map algebra on predictor raster datasets, applying selected MLR model—MLR output raster dataset. Regression residuals were then interpolated via simple kriging (recommended by Hengl et al. 2004; Hengl 2007, 2009)—SK output raster dataset. The final result (MLRK output raster dataset) was achieved by adding both the MLR raster dataset and the interpolated residuals raster dataset:

$$\begin{aligned}MLRK\;\;raster\; =MLR\;\;raster\;\;\nonumber \\ \quad+\;SK\;\;raster\;\end{aligned}$$

An appropriate FMEC map was then created. The prediction results of both the MLR and the MLRK procedures were cross-validated via a classic leave-one-out approach (Geisser 1975; Isaaks and Srivastava 1989). Error statistics including: mean error (ME), mean absolute error (MAE) and root mean square error (RMSE) were calculated to evaluate the obtained results (summarized in Table 4 after Li and Heap 2008). Their use in spatial interpolation and interpretation is widely known and well described (e.g. Willmott and Matsuura 2006; Szymanowski et al. 2012).

Table 4 Error metrics used in FMEC model validation

4 Results

4.1 Favourable meteorological conditions (fmc) maps

Predictive modelling based on FMC algorithms applied to RegCM gridded data was used to identify favourable conditions for selected weather phenomena (see Sect. 3.1 for details). FMC maps (Fig. 5) show spatially continuous probability of a day with favourable conditions for the occurrence of certain weather phenomena during the whole year. In other words, FMC maps show regional variability of weather conditions favouring the occurrence of phenomena such as thunderstorms, glaze, rime, and fog.

Fig. 5
figure 5

Average annual probability (%) of a day with thunderstorms (a), glaze (b), rime (c) and fog (d) favourable conditions occurrence. FMC maps

Validation of RegCM modelling itself was done through the calculation of FMC algorithms using available in situ data (Table 5). For thundersotrms, glaze and rime RegCM FMC calculation resulted as very similar to in situ FMC calculation with mean bias smaller than 1.5 %. In case of the fog FMC map RegCM results were strongly underestimated.

Table 5 Validation of RegCM FMC results against FMC calculated from available in situ data

FMC maps do not correspond exactly with observational data, as they present the probability of the occurrence of favourable conditions, which is not the same as the observed frequency of weather phenomena. Local variability of atmospheric phenomena makes FMC maps quite difficult to subject to any form of objective verification and validation of modelling results against actual observations. One weather station is often not representative of a particular climate region at mesoscale level (3–100 km). Standardized WMO instructions indicate typical location characteristics but it is not easy to confirm them, especially due to weather station exposure (World Meteorological Organization 2008). Of the created maps, only the thunderstorms frequency map can be interpreted as highly representative on a regional scale, which can be clearly seen from the results of correlation analysis between FMC maps and weather station data (Table 6). Thunderstorms FMC probability was found to be very strongly correlated (Pearson’s coefficient r > 0.8, at significance level p < 0.001) with the frequency of thunderstorm observations calculated via in situ data. Despite the occurrence of thunderstorms over a rather limited geographic area at a particular time of day, they usually are associated with convective processes arising within unstable air masses or frontal systems. In such situations the overall annual frequency of observations may be related to the whole region. In addition, the probability of glaze FMC is relatively strongly (r > 0.6, p < 0.001) correlated with observations. The occurrence of glaze is induced by freezing precipitation: hence it can be considered regionally dependent as well. Nevertheless, detailed regional variability remains beyond the spatial resolution of modelled maps.

Table 6 Correlation analysis between FMC results and phenomena observations (n = 61)

Statistical relationship between probabilities of rime and fog FMC and observations are low (r = 0.39 for rime) and weak (r = 0.15 for fog), correspondingly. Rime is in fact, a form of frozen fog caused mostly by the same environmental factors that affect the occurrence of fog. However, the meteorological conditions favouring the deposition of rime are slightly easier to recognize (Ahti and Makkonen 1982) and are also easier to predict. Rime occurrence is also limited to the cold season (in Poland from October to April), whereas fog may appear year-round in Poland and is strongly affected by local environmental conditions. This means some of the very local fog occurrences might not be identified due to the relatively coarse spatial resolution of RegCM data. The fog FMC map was found to be in agreement with the satellite-based map of fog and low stratus frequency created by Cermak et al. (2009) at comparable spatial resolution to that of the RegCM data but FMC frequency values were underestimated.

4.2 Favourable Meteorological and Environmental Conditions (FMEC) MAP: Fog Example

Statistical significance of additional fog predictors was examined through the stepwise regression analysis. The results were summarized in Table 7. Five out of six explanatory variables turned out to be statistically significant for most of the iterations and showed a consistent and stable relationship with the dependent variable. Predictors P1 (elevation), P2 (landform), P4 (% of green areas), and P6 (favourable meteorological conditions) were positively correlated with the dependent variable in 100 % of iterations and P3 (% of urban areas) was negatively correlated, which came into agreement with the initial assumptions made for this study. The predictor P5 (% of water/wetlands) resulted as insignificant and was excluded from the analysis—it seems that the water and wetlands identified within the buffer zones around weather stations seemed not to be sufficiently representative for the purpose of analysis.

Table 7 Summary of variables significance

The final MLR model was determined taking into account both the stepwise regression results (Table 8) and knowledge-based experience. The most complex model with five covariates (P1, P2, P3, P4 and P6) was selected. The model explained 89 % of variation in annual foggy day frequency (adj R 2 = 0.89). The corrected Akaike information criterion (AICc) was found to be the smallest for the selected model, which also show that it is relatively well-fitted. Maximum (from all VIF values checked for every environmental variable) VIF factor was also acceptable (<7.5); in effect, there is no redundancy (multicollinearity) among the model’s explanatory variables. The Koenker Statistic p-value test resulted in a not statistically significant value [K(BP) > 0.05], hence, the relationships modelled are consistent. Although covariates P3 and P4 in the selected regression model were not statistically significant at p < 0.1, it was decided to use them anyway because of their proven influence on fog occurrence (e.g. Sachweh and Koepke 1997; Jacobs et al. 2007), and overall high significance in the stepwise regression analysis (Table 8).

Table 8 Stepwise regression analysis results

While geographically weighted regression (GWR) (Brunsdon et al. 1996; Fotheringham et al. 1998, 2002) may seem to be more suitable for capturing local variability, it could not be used with reliable results in this study. First of all, the performed Koenker Test did not indicate a spatial nonstationarity which should characterised the GWR input data (Brunsdon et al. 1996; Gao and Li 2011). Moreover, number of observation points is regarded as insufficient for sensible GWR calibration. It is suggested that at least 150 data points should be employed in the analysis to build reliable local regressions models and capture local (not regional or global) trends (Fotheringham 2010). There are also a few weather stations in the East Poland which are spatially isolated by large areas over which no data are recorded. It could produce excessive bias in local estimation.

JB test indicated that regression residuals are not normally distributed and the model may need additional geostatistical modelling performed on its residals. Global Moran’s I test was performed (Table 9), as SA statistics (SA = 0.01) had shown spatial autocorrelation among regression residuals. Given a z-score of 2.02, there is a less than 5 % likelihood (p = 0.043) that the clustered pattern could be the result of random events and spatial autocorrelation is statistically significant.

Table 9 Global Moran’s I test summary

Due to the positive result of the spatial autocorrelation test performed on regression residuals, Simple Kriging with normal score transformation was employed for their interpolation. Figure 6 shows the output fog FMEC probability map generated through a combination of a deterministic regression part and residual interpolation (MLRK method).

Fig. 6
figure 6

Average annual probability (%) of day with fog occurrence. FMEC map

Cross-validation resulted in satisfying values of error statistics which were improved for the MLRK model (Table 10) compared to results achieved with the deterministic part (MLR) alone. Both the validation results and visual evaluation make it possible to regard the final FMEC map as valuable and representative on the scale of Poland. It reflects both the general trends presented in the Climatic atlas of Poland (Lorenc 2005) and detailed information on environmental variability. Moreover, the FMEC map was found to be in substantial accordance with some high resolution (1 km) remote sensing derivative products presenting the fog frequency (Musial et al. 2014).

Table 10 Cross-validation results

5 Conclusions

This study describes climate mapping techniques used for spatially discontinuous atmospheric phenomena. Thunderstorms, glaze, rime and fog were selected to test some of the existing methods and develop a predictive modelling approach for mapping location-dependent phenomena. Climatological maps based on a 45-year analysis (1966–2010) were created for Poland in order to identify distinct regions with a higher frequency and intensity of selected weather hazards. In case of hugely location dependent phenomena, observational data obtained from only 61 weather stations are definitely not sufficient for spatial interpolation over the entire territory of Poland. The number of data points is extremely important for spatial interpolation—the more data available, the better—but the complex nature of atmospheric phenomena makes interpolation even more difficult unless additional, environmental factors are also taken into account.

This study introduces quite an innovative method of using predictive modelling for climatological analysis of atmospheric phenomena over Poland. The presented approach remains valid for every relatively large area where orography and land cover are not homogeneous. The method proposed in this study uses a Regional Climate Model to yield regularly distributed gridded data and to assimilate them into algorithms describing favourable meteorological conditions for each given weather phenomenon. The algorithms have been previously validated on both observational and modelled weather data (Wypych et al. 2014; Ustrnul et al. 2015). The resulting FMC maps exhibit an adequate quality of prediction. Both quantitative validation and visual knowledge-based interpretation have confirmed a good estimation of the spatial distribution of selected phenomena over the entire territory of Poland. The maps show general patterns and relationships (Lorenc 2005) and allow to avoid wrong estimation which can occur during traditional interpolation of strongly local data from observations. As FMC maps were designed as regional, they are determined by the resolution of the input data (RegCM grid—20 km). For this reason, it is not possible to identify local differences in the maps, which should rather be considered as moderately generalized—but still valuable for regional analysis. The thunderstorm frequency FMC map is considered to be the most accurate map with very strong correlation with observational data (r = 0.84). The annual glaze and rime frequency FMC maps are also representative for the country area—considering the complexity of factors affecting these phenomena. Only the fog frequency FCM map proved to be strongly underestimated for southern Poland, as it does favour less heterogeneous areas in terms of orography and land cover.

Consequently, an additional fog frequency FMEC map was created with the use of multiple linear regression kriging (MLRK). Environmental correlation was based on several relevant predictors derived from elevation model and land-cover data—together with a previously obtained FMC map. The final map was found to be more accurate for analyzing the spatial patterns on a regional scale.

Although the generated maps cover just the area of Poland, the methodological background presented in this paper may be considered broadly applicable. An appropriate approach to geospatial predictive modelling should be preceded by detailed analysis of physical processes behind the analysed atmospheric phenomena. An extended exploration of key environmental factors affecting the modelled variable should be performed in order to recognise predictors, which may help to explain properly its spatial variability. The proper selection of predictors must avoid underestimation of the model, but also redundancy among proposed variables.

The modelling approach is characterised by some uncertainties, which one should be aware of in the analytical process. Some of these uncertainties can be at least partially eliminated. First, algorithms designed for favourable meteorological conditions may be improved by employing additional parameters or changing coefficients. Second, modelled data (RegCM in this study) are often biased because of certain model assumptions, so an improvement can be made in the model’s physical and topographical parameters’ configuration and downscaling techniques.

In this study, linear regression was used as a deterministic element of a spatial interpolation procedure which may cause some simplification in the final outcome. Although the results were found to be satisfactory, additional methods including nonlinear models (Vicente-Serrano et al. 2010) or artificial neural networks should be tested in the future to verify the possibility of further improvements. Among new and very powerful, spatially continuous environmental datasets which may be used as predictors for this type of analysis, satellite data seem to be invaluable and are increasingly being used for climatological analysis of fog (e.g. Bendix 2002; Cermak et al. 2009; Musial et al. 2014; Avotniece et al. 2015). The importance of climate maps based on remotely sensed data is even greater because of potential use for validation of modelling results in case of limited data from weather observations. Lightning detection and weather radar imagery complemented by conventional meteorological measurements offer opportunities for the validation of predictions of thunderstorms as well as freezing rain.

In addition to the limitations already mentioned, geospatial predictive modelling based on physical relationships between meteorological and environmental parameters favouring the occurrence of a particular atmospheric phenomenon possesses great potential for climate mapping and weather forecasting. Algorithms used to create maps in this study will be also incorporated into an operational numerical weather model.