Model configurations
The WRF model is an open source code conceived and developed since the mid 90’s by NCAR (National Center for Atmospheric Research), NOAA (National Oceanic and Atmospheric Administration), U.S. Air Force, Naval Research Laboratory, University of Oklahoma, and the Federal Aviation Administration. WRF is a mesoscale forecasting system that solves the non hydrostatic fully compressible Euler equations on an Arakawa-C grid with mass-based terrain following coordinates. It is designed for both research and operational applications, capable of operating at spatial resolutions from hundreds of meters to hundreds of kilometers (Powers et al. 2017).
This work focuses on an operational setup currently running at CIMA on behalf of the Italian Civil Protection Department, namely the WRF-OL configuration. It is the Open Loop (OL) configuration without data assimilation. It has three two-way nested domains with 13.5, 4.5, and 1.5 km grid-spacing and with 50 vertical levels (Fig. 1). The innermost domain grid is composed of 943 × 883 points. Initial and hourly boundary conditions are taken from the NCEP-GFS with 0.25∘ grid-spacing. The model runs a 48 hours forecast every day, starting from 00 UTC.
WRF-OL is configured with the following physical parameterizations. The Yonsei University scheme (Hong et al. 2006) is chosen for the planetary boundary layer turbulence closure; the RRTMG shortwave and longwave schemes (Iacono et al. 2008; Mlawer et al. 1997; Iacono et al. 2000) are used for radiation; and the Rapid Update Cycle (RUC) scheme is chosen as a multi-level soil model (6 levels) with higher resolution in the upper soil layer (0, 5, 20, 40, 160, 300 cm) (Smirnova et al. 1997; Smirnova et al. 2000). For consistency with the GFS initial and boundary conditions, the New Simplified Arakawa-Schubert scheme (Han and Pan 2011) is used with the following distinct approaches. No cumulus scheme is activated in the two innermost domains (4.5 and 1.5 km grid-spacing), because the grid-spacing enables to resolve the convection dynamics, while it is activated in the outermost domain (13.5 km grid-spacing).
In the present work, a comparison between WRF-OL and COSMO-2I (ARPAE 2020) is performed. The COSMO-2I domain covers the entire Italian territory, the boundaries conditions are provided by COSMO 5M, which is nested in the ECMWF-IFS global model at 0.125°grid-spacing, while initial conditions are provided by the high resolution KENDA-LETKF deterministic analysis (Schraff et al. 2016). The grid is setup with 2.8 km horizontal grid-spacing and 65 vertical levels. Forty-eight-hour forecasts are provided with two daily runs at 00 and 12 UTC. All details on the setup of the COSMO-2I and COSMO 5M models can be found at ARPAE (2020).
Data description
This work aims to evaluate the WRF-OL precipitation predictive capability by comparing the quantitative precipitation forecast (QPF) fields obtained from the model runs with the observational quantitative precipitation estimates (QPEs).
QPF fields are directly obtained from the model outputs on their native grid. Since WRF forecasts are 48 hours long we will refer to RUN1 as the first 24 hours of the forecast and as RUN2 for the subsequent 24 hours.
Two observational rainfall datasets are considered: the ground-based meteorological station networks and the Italian Radar Network (IRN), both managed by the Italian Civil Protection Department. In particular, the ground-based sensors, that are thermometers, rain gauges, hygrometers and anemometers, belong to two networks: the Functional Centers Network (FCN) and Public Administrations Unique Network (PAUN). In total, there are 5222 ground-based sensors, among which 3551 are from PAUN and 4881 are from FCN (some stations belong to both networks). They have been collecting data since 2004 and 2006, respectively. Data from 4366 rain gauges from the two ground-based networks are used in the present work.
Radar data come from the IRN mosaic operated by the Italian Civil Protection (Vulpiani et al. 2008), that covers the whole Italian territory. In particular, hourly SRT (Surface Rainfall Total) maps are obtained opportunely summing the SRI (Surface Rainfall Intensity) maps, provided every 10 minutes.
The QPEs used in this study are obtained by merging the ground based precipitation data provided by both aforementioned rain gauges networks (stations belonging to both networks are counted only once) together with SRT maps from the IRN mosaic. The merging is performed with the Rainfusion method (Pignone et al. 2013; Sinclair and Pegram 2005; Silvestro et al. 2016) producing hourly maps at 1.5 km × 1.5 km horizontal resolution.
Fuzzy logic analysis
A fuzzy logic analysis allows a comparison between observed and forecast data, with the advantage that the final error is not calculated pointwise, but allowing a spatial window in which the comparison is performed (Ebert 2008). In this study, we exploit the anywhere-in-the-window fuzzy logic approach, as described by Ebert (2008). This is a special case of “minimum coverage” technique, which is an example of Neighbourhood Observation-Neighbourhood Forecast strategy. Ten thresholds of precipitation intensities, namely, 0.1, 0.2, 0.5, 1, 2.5, 5, 7.5, 10, 12 and 15 mm/3 h are considered and the agreement between QPE and QPF is evaluated starting from a single COSMO-2I pixel (2.8 km × 2.8 km = 7.84 km2) up to squares with 65 pixels per side (182 km × 182 km \(\simeq \) 33000 km2). The COSMO-2I grid is chosen as the common grid for the validation (for both modelled and observational products), because it is the coarsest. Remind that, in the fuzzy logic approach, an event is defined when the rainfall intensity (either observed or predicted) overcomes the given threshold (Ebert 2008). Three scores are calculated: the Fractions Skill Score (FSS), the Probability Of Detection (POD), and the False Alarm Ratio (FAR) (Ebert 2008).
The FSS is the main index summarizing the potential of a fuzzy logic verification. It directly compares the forecast and observation fields on a certain area affected by an event (defined when the precipitation exceeds a certain threshold in the unit time), gradually increasing the spatial dimension of the box on which the verification is carried out. It is given by
$$ \text{FSS} = 1 - \frac {\frac{1}{N}{\sum}_{i = 1}^{N} \left( P_{\text{fcs}}-P_{\text{obs}}\right)^{2}}{\frac{1}{N}{\sum}_{i = 1}^{N} P_{\text{fcs}}^{2} + \frac{1}{N}{\sum}_{i = 1}^{N} P_{\text{obs}}^{2}}, $$
(1)
where N is the number of verification boxes in the domain under study and P is the fraction of each single box in which the event occurs (the subscripts fcs and obs stand for “forecast” and “observed”, respectively). The FSS ranges from 0 (complete disagreement) to 1 (perfect agreement). The FSS is equal to 0 if there are no expected events but they do occur or, vice versa, if no events that have been foreseen occur. The FSS value above which the forecast is considered useful (better than the random data) is given by FSSuseful = 0.5 + f0/2, where f0 is the fraction of the domain covered by the observed event (Roberts and Lean 2008). The smallest spatial window for which FSS ≥ FSSuseful is considered to be the useful scale. As the dimensions of the spatial windows increase, the index tends asymptotically to a value between 0 and 1. The closer this value is to 1, the less the forecast is biased. The FSS is sensitive to rare events, which are intense precipitation peaks on limited areas.
The POD is the ratio of the correctly forecast events and the events that actually occurred (range: 0–1, perfect value: 1), namely
$$ \text{POD} = \frac {\textit{hits}}{\textit{hits + misses}}. $$
(2)
The FAR is the proportion of forecasts of the event that did not occur (range: 0–1, perfect value: 0). It is calculated as
$$ \text{FAR} = \frac {\textit{false alarms}}{\textit{hits + false alarms}} $$
(3)
and it measures the probability of false detection. For both POD and FAR computation, contingency tables are considered following the ‘minimum coverage’ technique introduced above, for different box dimensions and rainfall intensity thresholds.
These three scores have different meanings: the FSS measures how the skill of precipitation forecasts varies with spatial scale (Roberts and Lean 2008); POD and FAR indicate, respectively, the probability of detection of a certain kind of event and the rate at which it is likely to forecast an event which actually does not occur.
This analysis is carried out with three-hourly QPE and QPF in the period between March 2018 and February 2019, on a seasonal basis, following the conventional definitions of: spring MAM, summer JJA, fall SON and winter DJF. Considering seasons instead of months enables to have larger numbers of events and, thus, robust statistics. The whole Italian territory is considered, producing charts with the performances of the first 24 hours (RUN1) and second 24 hours (RUN2) of the forecasts in terms of FSS, POD and FAR. Both QPE and QPF are interpolated on the COSMO-2I grid at 0.025° (roughly 2.8 km), excluding the sea, as discussed above.