Skip to main content

Advertisement

Log in

Capturing the time-dependence in the precipitation process for weather risk assessment

  • Original Paper
  • Published:
Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Abstract

This work investigates the seasonal, long-term and time-dependence properties of the precipitation process using daily precipitation amount records in Calgary and develops occurrence–amount models which capture the complex properties of the process. Data show that: (a) the probability of precipitation occurrence not only depends on the occurrence of the previous days, but also depends on the amount values. Moreover this dependence varies with season. (b) The expected amount and its volatility depend on the occurrence as well as the amount on the previous days—these dependencies reveal seasonal patterns. (c) There is a strong long-term dependence in Calgary’s data. The proposed models in this work satisfy these properties. A large set of covariates is built to capture the complexity of various properties of the precipitation process. A grouped sequential model selection approach is used to pick the appropriate covariates which works by assigning the predictors to various groups and sequentially exploring them. This framework is assessed by comparing the simulations from the models to the observed data. This confirms a satisfactory agreement.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

References

  • Akaike H (1974) A new look at the statistical model identification. IEEE Trans Autom Control AC–19:716–723

    Article  Google Scholar 

  • Alexandridis A, Zapranis A (2013) Pricing approaches of temperature derivatives in weather derivatives. In: Weather derivatives: modeling and pricing weather-related risk. Springer, New York, p 55–85

    Chapter  Google Scholar 

  • Bellone E, Hughes JP, Guttorp P (2000) A hidden Markov model for downscaling synoptic atmospheric patterns to precipitation amounts. Clim Res 15:1–12

    Article  Google Scholar 

  • Caballero R, Jewson S, Brix A (2002) Long memory in surface air temperature: detection, modeling, and application to weather derivative valuation. Clim Res 21:127–140

    Article  Google Scholar 

  • Campbell S (2005) Weather forecasting for weather derivatives. J Am Stat Assoc Appl Case Stud 100(469):6–16

    Article  CAS  Google Scholar 

  • Chetner S and The Agroclimatic Atlas Working Group (2003) Agroclimatic atlas of Alberta, 1971 to 2000. Alberta Agriculture, Food and Rural Development, Agdex 071-1, Edmonton

  • Cigizoglu H, Bayazit M (1998) Application of Gamma autoregressive model to analysis of dry periods. J Hydrol Eng 3(3):218–221

    Article  Google Scholar 

  • Chin EH, Miller JF (1980) On the conditional distribution of daily precipitation amounts. Mon Weather Rev 108:1462–1464

    Article  Google Scholar 

  • Engle RF, Russell JR (1998) Autoregressive conditional duration: a new model for irregularly spaced transaction data. Econometrica 66:1127–1162

    Article  Google Scholar 

  • Fernandez B, Salas JD (1986) Periodic Gamma autoregressive processes for operational hydrology. Water Resour Res 22(10):1385–1396

    Article  Google Scholar 

  • Gaver DP, Lewis PAW (1980) First order autoregressive Gamma sequences and point processes. Adv Appl Probab 12:727–745

    Article  Google Scholar 

  • Gronewold A, Stow C, Crooks J, Hunter T (2013) Quantifying parameter uncertainty and assessing the skill of exponential dispersion rainfall simulation models. Int J Climatol 33(3):746–757

    Article  Google Scholar 

  • Hastie T, Tibshirani R, Friedman JH (2009) The elements of statistical learning, 2nd edn. Springer, New York

    Book  Google Scholar 

  • Hosseini R, Le N, Zidek J (2012) Time-varying Markov models for binary temperature series in agrorisk management. J Agric Biol Ecol Stat 17(2):283–305

    Article  Google Scholar 

  • Hosseini A, Fallahnezhad MS, Zare-Mehrjardi Y, Hosseini R (2012) Seasonal autoregressive models for estimating the probability of frost in Rafsanjan. J Nuts Relat Sci 3(2):45–52

    Google Scholar 

  • Hosseini R, Le N, Zidek J (2011a) Selecting a binary Markov model for a precipitation process. Environ Ecol Stat 18(4):795–820

    Article  Google Scholar 

  • Hosseini R, Le N, Zidek J (2011b) A characterization of categorical Markov chains. J Stat Theory Pract 5(2):261–284

    Article  Google Scholar 

  • Hosseini R, Takemura A, Hosseini A (2014) Nonlinear time-varying stochastic models for agroclimate risk assessment. Environ Ecol Stat 22(2):227–246

    Article  Google Scholar 

  • Hosseini R, Newlands NK, Dean CB, Takemura A (2015) Statistical modeling of soil moisture, integrating satellite remote-sensing (SAR) and ground-based data. Remote Sens 7(3):2752–2780

    Article  Google Scholar 

  • Huggins-Rawlins N (2003) Agroclimatic atlas of Alberta: agricultural climate elements. http://www1.agric.gov.ab.ca/$department/deptdocs.nsf/all/ sag6301. Accessed 1 Mar 2015

  • Hutton JL (1990) Nonnegative time series models for dry river flow. J Appl Probab 27:171–182

    Article  Google Scholar 

  • Ing C, Yang C (2014) Predictor selection for positive autoregressive processes. J Am Stat Assoc Theory Methods 109(505):243–253

    Article  CAS  Google Scholar 

  • Katz RW (1977) Precipitation as a chain dependent process. J Appl Meteorol 16:671–676

    Article  Google Scholar 

  • Kedem B, Fokianos K (2002) Regression models for time series analysis. In: Wiley series in probability and statistics. New York

  • Khalili K, Nazeri Tahoudi M, Mirabbasi R, Ahmadi F (2015) Investigation of spatial and temporal variability of precipitation in Iran over the last half century. Stoch Environ Res Risk Assess. doi:10.1007/s00477-015-1095-4

  • Lennartsson J, Baxevani A, Chen D (2008) Modelling precipitation in Sweden using multiple step Markov chains and a composite model. J Hydrol 363(1–4):42–59

    Article  Google Scholar 

  • Li Z, Brissette F, Chen J (2014) Assessing the applicability of six precipitation probability distribution models on the Loess Plateau of China. Int J Climatol 34(2):462–471

    Article  Google Scholar 

  • Liu J, Li Y, Sadiq R, Deng Y (2014) Quantifying influence of weather indices on PM2.5 based on relation map. Stoch Environ Res Risk Assess 28(6):1323–1331

    Article  Google Scholar 

  • Luca G, Gallo G (2009) Time-varying mixing weights in mixture autoregressive conditional duration models. Econom Rev 28(1–3):102–120

    Google Scholar 

  • Masala G (2014) Rainfall derivatives pricing with an underlying semi-Markov model for precipitation occurrences. Stoch Environ Res Risk Assess 28:717–727

    Article  Google Scholar 

  • Mékis E, Hogg WD (1999) Rehabilitation and analysis of Canadian daily precipitation time series. Atmos Ocean 37:53–85

    Article  Google Scholar 

  • Miller TD (1999) Growth stages of wheat: identification and understanding improve crop management. Texas A&M Agrilife Extension. http://varietytesting.tamu.edu/wheat/docs/mime-5.pdf. Accessed 1 Mar 2015

  • Mohapl J (2002) A precipitation occurrence model. Stoch Environ Res Risk Assess 16(2):143–154

    Article  Google Scholar 

  • Ramírez-Cobo P, Marzo X, Olivares-Nadal AV, Francoso JÁ, Carrizosa E, Pita F (2014) The Markovian arrival process: a statistical model for daily precipitation amounts. J Hydrol 510:459–471

    Article  Google Scholar 

  • Rodriguez-Iturbe I, Cox DR, Isham V (1987) Some models for rainfall based on stochastic point processes. Proc R Soc Lond A 410(1839):269–288

    Article  Google Scholar 

  • Sanso B, Guenni L (1999) A stochastic model for tropical rainfall at a single location. J Hydrol 214:64–73

    Article  Google Scholar 

  • Sanso B, Guenni L (2000) A non-stationary multisite model for rainfall. J Am Stat Assoc 95(452):1089–1100

    Google Scholar 

  • Sarlak N (2008) Annual streamflow modelling with asymmetric distribution function. Hydrol Process 22:3403–3409

    Article  Google Scholar 

  • Schwartz G (1978) Estimating the dimension of a model. Ann Stat 6:461–464

    Article  Google Scholar 

  • Sigrist F, Künsch HR, Stahel WA (2012) A dynamic nonstationary spatio-temporal model for short term prediction of precipitation. Ann Appl Stat 6(4):1452–1477

    Article  Google Scholar 

  • Sim CH (1987) A mixed Gamma ARMA(1, 1) model for river flow time series. Water Resour Res 23:32–36

    Article  Google Scholar 

  • Stern RD, Coe R (1984) A model fitting analysis of daily rainfall data. J R Stat Soc A147:1–34

    Google Scholar 

  • Stidd CK (1973) Estimating the precipitation climate. Water Resour Res 9:1235–1241

    Article  Google Scholar 

  • Svec J, Stevenson M (2007) Modelling and forecasting temperature based weather derivatives. Glob Finance J 18:185–204

    Article  Google Scholar 

  • Tabari H, Taye MT, Willems P (2015) Statistical assessment of precipitation trends in the Upper Blue Nile River Basin. Stoch Environ Res Risk Assess 29(7):1751–1761

    Article  Google Scholar 

  • Tesfaye YG, Meerschaert MM, Anderson PL (2006) Identification of periodic autoregressive moving average models and their application to the modeling of river flows. Water Resour Res 42(1):W01419

    Article  Google Scholar 

  • Tong H (1990) Non-linear time series, a dynamical systems approach. Oxford University Press, New York

    Google Scholar 

  • Wilks DS (1998) Multisite generalization of a daily stochastic precipitation generation model. J Hydrol 210:178–191

    Article  Google Scholar 

  • Wilks DS (1999) Interannual variability and extreme-value characteristics of several stochastic daily precipitation models. Agric For Meteorol 93(3):153–169

    Article  Google Scholar 

  • Wilks DS, Wilby RL (1999) The weather generation game: a review of stochastic weather models. Prog Phys Geogr 23(3):329–357

    Article  Google Scholar 

Download references

Acknowledgments

We would like to thank the Associate Editor and the anonymous referees for their careful review and valuable suggestions. The authors are also thankful to Prof. M. S. Owlia from Yazd University and Prof. A. Mohammadpour from Amirkabir University for fruitful discussions on the topic.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alireza Hosseini.

Appendices

Appendix 1: Model implementation, computation and simulation

As discussed in Sect. 3.1, the proposed model in this work can be estimated using an extension of the GLMs to the time series context via the concept of partial likelihood (Kedem and Fokianos 2002; Hosseini et al. 2011b). The model discussed in this work is implemented with the free statistical software R (https://www.r-project.org/) and the estimation is fast for a single model fitted to our data (less than 0.1 s on a laptop with CPU: 2.9 GHz Intel Core i7, RAM: 8 GB). The parameter estimates for the occurrence and amount components with their standard errors are given in Tables 5 and 6. However the model selection can be computationally intensive because of the large number of covariates (lags, seasonal lags, non-linear lags, etc.). As discussed in Sect. 3.2, there are more than 80 covariates considered here and therefore more than \(2^{80} \approx 10^{24}\) models could be potentially compared—which is not computationally feasible. The GSMS approach overcomes this issue by grouping the data into smaller groups and sequentially exploring each group. For example, for a group with 10 covariates, we need to compare only \(2^{10}=1024\) models. The model selection task for this work was performed in less than 2 h. One way to make the computation faster is to use parallel computing as for a given group, e.g., with \(2^{10}=1024\) subsets, we can partition the power set into 8 partitions of size 128 and use separate cores to fit the models in each partition. By performing parallel computing with 8 cores we reduced the model selection time to less than 20 min on the same computer.

Table 5 Occurrence component parameters’ estimates and standard errors
Table 6 Amount component parameters’ estimates and standard errors

Appendix 2: Application to agro-climate risk

An important application of modeling precipitation series concerns weather risk assessment which plays a central role in many sectors, e.g., energy, agriculture, tourism, aviation and retail industry. Attention to hedging the weather risk using weather derivatives—which are financial instruments whose payoff are based on a specified weather event—has increased significantly from the turn of the century. Temperature weather derivatives are the most commonly used derivatives in the industry and various daily models are suggested in the literature for this purpose. These models usually consider a seasonal trend. Some authors, e.g., Campbell (2005) and Hosseini et al. (2012) use Fourier series for this purpose and Alexandridis and Zapranis (2013) use wavelet networks. The autocorrelation in the temperature series is captured by AR or moving average methods (Caballero et al. 2002; Campbell 2005; Svec and Stevenson 2007; Alexandridis and Zapranis 2013). Caballero et al. (2002) observed that autocorrelation in the temperature series in some cities in the US depends on the season and they suggested fitting a separate model to each season as an ad hoc solution. Hosseini et al. (2012, 2014) also observed this property and developed models with time-varying autocorrelation. Similar issues were seen when dealing with binary weather processes such as the binary temperature series of frost occurrence (below and above zero) and Hosseini et al. (2012) suggested models which capture the seasonal variations in the first- and second-order transition probabilities. This work showed similar time-varying properties of precipitation occurrence and amount and developed time-varying models which capture them. Below we show how these models can be utilized in practice.

We can use simulations from the developed models to calculate the risk of arbitrary hazard events or find the distribution of quantities of interest. An example of a hazard event is: there is a period of 1 month without any precipitation in the coming year. An example of a quantity of interest is: the total precipitation during the coming spring; or the number of wet days during the same period.

The simulation of future precipitation is performed as follows. First we estimate the parameters \(\mathbf{\beta }^{o}\) and \(\mathbf{\beta }^{a}\) by applying the extension of GLMs which also estimates the dispersion parameter for the amount model as discussed in Kedem and Fokianos (2002) (the dispersion parameter for the first model is equal to 1 and does not need to be estimated). At time \(t-1\) to simulate 1 day ahead, i.e., the precipitation amount at time \(t{\text {:}}\) (1) we calculate the covariate processes \(Z^{o}(t-1)\) and \(Z^{a}(t-1)\) which depend only on past observations, (2) we obtain the conditional means for the occurrence and amount, \(E\{O(t)| {\mathcal {F}}(t-1)\}\) and \(E\{Y(t)=1 | Y(t)>0,\,{\mathcal {F}}(t-1)\},\) using Eqs. 2 and 3, (3) the first conditional mean is the probability of precipitation at time t. We use a Bernoulli distribution to simulate the occurrence value, O(t),  which is 0 with that probability and 1 otherwise, (4) using the dispersion parameter and the conditional mean, the Gamma distribution is specified. Using this distribution, we simulate an amount value, A(t),  (5) the final value is obtained by multiplying O(t) and \(A(t){\text {:}}\,Y(t)=O(t)A(t).\) Appendix 1 provides more details about fitting the model, simulating from it and applications in agroclimate risk analysis.

Here we calculate the probability of the following event:

Event A: the total precipitation during the frost-free period during 2007 in Calgary is less than 150 mm and there are no dry periods longer than 2 weeks.

According to Agroclimatic Atlas of Alberta (Huggins 2003; Chetner 2003), the frost-free period in Calgary is considered to be between 20 May 20 and 14 September. In order to calculate the probability of Event A, we simulate 1000 series from the last available date (31 December 2006) to the end of 2007 and we calculate the proportion of the series for which the event occurs. This proportion is 30 %. However due to the parameter uncertainty in the models, this number is an estimate rather than an exact value. To obtain confidence interval for this value, we can use the extension of the GLMs methodology to time series (Kedem and Fokianos 2002) or parametric and non-parametric bootstrap methods. See Hosseini et al. (2011a, 2012) for more details on these approaches for binary series and Hosseini et al. (2012, 2014) for the continuous series of temperature. The 95 % confidence intervals obtained from these various approaches were similar. The non-parametric bootstrap confidence interval is (0.26, 0.33).

We can also use the models developed in this paper to obtain the distribution of quantities of interest, e.g., the longest dry spell in the growing season or frost-free period (Miller 1999). We define two quantities of interest:

Quantity B: longest dry spell in the frost-free period;

Quantity C: accumulated rain amount during the frost-free period.

In Fig. 13 the dashed curves are kernel density estimates fitted to the distribution of the observed quantities between the year 1900 and 2006. We also simulate 100 chains from the model for the same period (1900–2006). Then we build a density curve for each simulation (transparent grey). The figure shows good agreement between the model simulated densities and the data estimated density for both quantities.

Fig. 13
figure 13

(Left) the dashed curve shows the density of the longest dry spell in the frost-free period estimated from observed data between 1900 and 2006. 100 simulations of the model for the same period are used to estimate 100 density curves denoted in grey. (Right) the dashed curve shows the density of the accumulated precipitation amount (mm) in the frost-free period estimated from observed data between 1900 and 2006 by finding each year’s accumulated rain during the frost-free period and fitting a density curve. 100 simulations of the model for the same period are used to estimate 100 density curves (grey). The grey color is transparent and looks darker where there are multiple grey curves overlapping

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hosseini, A., Hosseini, R., Zare-Mehrjerdi, Y. et al. Capturing the time-dependence in the precipitation process for weather risk assessment. Stoch Environ Res Risk Assess 31, 609–627 (2017). https://doi.org/10.1007/s00477-016-1285-8

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00477-016-1285-8

Keywords

Navigation