Abstract
Motivated by a large dataset containing time series of weekly number of rainy days collected over two thousand locations across Europe and Russia for the period 2000–2010, we propose a new class of ARMA-like model for time series of bounded counts, which can also handle extra-binomial variation. We abbreviate this model as bvARMA, as it is based upon a novel operation referred to as binomial variation. After having discussed important stochastic properties and proposed a model-fitting approach relying on maximum likelihood estimation, we apply the bvARMA model family to the rainy-days time series. Results show that both bvAR and bvMA models are adequate and exhibit a similar performance. Furthermore, bvARMA results outperform those obtained by fitting ordinary discrete ARMA (NDARMA) models of the same order.
Similar content being viewed by others
Notes
It should be noted that the model proposed by Blight (1989) only sporadically meets an ARMA autocorrelation structure.
Out of 1713 stations, 603 are assigned to an AR model and 691 to an MA model, for both bvARMA and NDARMA. In contrast, 319 stations were assigned to bvAR(1) and NDMA(1) models whereas 100 stations to bvMA(1) and NDAR(1) models.
References
Agnew MD, Goodess CM, Hemming D, Giannakopoulos C, Bindi M, Dibari C, El-Askary H, El-Hattab M, El-Raey M, Ferrise R, Harzallah A, Hatzak M, Kostopoulou E, Lionello P, Abed SS, Sánchez-Arcilla A, Senouci M, Sommer R, Zoheir Taleb M, Tanzarella A (2013) Stakeholders. In: Navarra A, Tubiana L (eds) Regional assessment of climate change in the mediterranean. Springer International Publishing, Berlin, pp 23–37
Berchtold A (2002) High-order extensions of the double chain Markov model. Stoch models 18:193–227
Blight PA (1989) Time series formed from the superposition of discrete renewal processes. J Appl Probab 26:189–195
Chang TJ, Kavvas ML, Delleur JW (1984) Modeling of sequences of wet and dry days by binary discrete autoregressive moving average processes. J Clim Appl Meteor 23:1367–1378
Cui Y, Lund R (2009) A new look at time series of counts. Biometrika 96:781–792
Deckmyn A, Minka TP, Brownrigg R, Becker RA, Wilks AR (2017) maps: draw geographical maps. R package version 320
Delleur JW, Chang TJ, Kavvas ML (1989) Simulation models of sequences of dry and wet days. J Irrig Drain Eng 115:344–357
Ehelepola NBD, Ariyaratne K, Buddhadasa WMNP, Ratnayake S, Wickramasinghe M (2015) A study of the correlation between dengue and weather in Kandy City, Sri Lanka (2003–2012) and lessons learned. Infect Dis Poverty 4:42
Grunwald G, Hyndman RJ, Tedesco L, Tweedie RL (2000) Non-gaussian conditional linear AR(1) models. Aust Nz J Statist 42:479–495
Jacobs PA, Lewis PAW (1983) Stationary discrete autoregressive-moving average time series generated by mixtures. J Time Ser Anal 4:19–36
Khoo WC, Ong SH, Biswas A (2017) Modeling time series of counts with a new class of INAR(1) model. Stat Pap 58:393–416
Klein Tank AMG, Wijngaard JB, Können GP, Böhm R, Demarée G, Gocheva A, Mileta M, Pashiardis S, Hejkrlik L, Kern-Hansen C, Heino R, Bessemoulin P, Müller-Westermeier G, Tzanakou M, Szala S, Pálsdóttir T, Fitzgerald D, Rubin S, Capaldo M, Maugeri M, Leitass A, Bukantis A, Aberfeld R, van Engelen AFV, Forland E, Mietus M, Coelho F, Mares C, Razuvaev V, Nieplova E, Cegnar T, Antonio López J, Dahlström B, Moberg A, Kirchhofer W, Ceylan A, Pachaliuk O, Alexander LV, Petrovic P (2002) Daily dataset of 20th-century surface air temperature and precipitation series for the european climate assessment. Int J Climatol 22:1441–1453
Lacombe G, McCartney M (2014) Uncovering consistencies in indian rainfall trends observed over the last half century. Clim Change 2:287–299
Maldonado AD, Aguilera PA, Salmerón A (2016) Continuous Bayesian networks for probabilistic environmental risk mapping. Stoch Environ Res Risk Assess 30:1441–1455
Pavlopoulos H, Karlis D (2008) INAR(1) modeling of overdispersed count series with an environmental application. Environmetrics 19:369–393
Pegram GGS (1980) An autoregressive model for multilag markov chains. J Appl Probab 17:350–362
Pohl B, Macron C, Monerie PA (2017) Fewer rainy days and more extreme rainfall by the end of the century in Southern Africa. Sci Rep 7:46466
R Core Team (2017) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna
Scotto MG, Weiß CH, Silva ME, Pereira I (2014) Bivariate binomial autoregressive models. J Multivar Anal 125:233–251
Steutel FW, van Harn K (1979) Discrete analogues of self-decomposability and stability. Ann Probab 7:893–899
Thyregod P, Carstensen J, Madsen H, Arnbjerg-Nielsen K (1999) Integer valued autoregressive models for tipping bucket rainfall measurements. Environmetrics 10:395–411
Tian D, Martinez CJ, Asefa T (2016) Improving short-term urban water demand forecasts with reforecast analog ensembles. J Water Resour Plan Manage 142:04016008
Weiß CH (2013) Serial dependence of NDARMA processes. Comput Stat Data Anal 68:213–238
Weiß CH, Göb R (2008) Measuring serial dependence in categorical time series. AStA Adv Stat Anal 92:71–89
WHO (2014) Climatic factors and the occurrence of dengue fever, dysentery and leptospirosis in Sri Lanka 1996–2010: a retrospective study. Technical report 65
Wickham H (2009) ggplot2: elegant graphics for data analysis. Chapman & Hall/CRC monographs on statistics and applied probability. Springer, New York
Zucchini W, MacDonald IL (2009) Hidden Markov models for time series: an introduction using R. Chapman & Hall/CRC monographs on statistics & applied probability. CRC Press, Boca Raton
Acknowledgements
The authors thank the Associate Editor and the referees for carefully reading the article and for their comments, which greatly improved the article.
Author information
Authors and Affiliations
Corresponding author
Additional information
This research was supported by the German Academic Exchange Service (DAAD) and the Fundação para a Ciência e a Tecnologia (FCT), under the program “Ações Integradas Luso-Alemãs” and the Grants 57212119 and A-38/16. This work was partially supported by FCT, with national (MEC) and European structural funds through the programs FEDER, under the partnership agreement PT2020–within IEETA/UA project UID/CEC/00127/2013 (Instituto de Engenharia Electrónica e Informática de Aveiro, Aveiro) and CIDMA/UA project UID/MAT/04106/2013 (Centro de I&D em Matemática e Aplicações, Aveiro). S. Gouveia acknowledges the postdoctoral grant by FCT (ref. SFRH/BPD/87037/2012).
Appendices
Proofs
1.1 Proof of Theorem 2
For \(k\ge 1\), by conditioning,
Let us now look at the cross-covariances \(a_{k}:= Cov[X_{t}, \epsilon _{t-k}]\) with \(k\in {\mathbb {Z}}\), which satisfies \(a_{k}=0\) for \(k<0\). At lag 0, we obtain by conditioning (in analogy to above)
For lags \(k\ge 1\),
If we now define \(b_{k}:=a_{k}/\sigma _{\epsilon }^{2}\) for \(k=0,1,\dots , {q }\), we get the recursion stated in Theorem 2.
1.2 Derivation of conditional variance
Using (8) for the conditional factorial moments, we obtain
So we obtain
Note that the first-order autoregressive case (\({p }=1\)) gives
Again from (8) with \(r=2\), we obtain the unconditional 2nd factorial moment
Using the stationarity, it follows that
So the unconditional variance \(\sigma ^{2}=V[X_t]\) of the stationary process is given by
So Eq. (9) for the BID follows.
1.3 Cohen’s \(\kappa \) for bvAR(1) and bvMA(1) models
The computation of Cohen’s \(\kappa (h)\) (2) requires knowledge about the bivariate joint distributions with lag h. The bvAR(1) process constitutes a finite Markov chain with the 1-step-ahead transition probabilities \(p_{k|l}\) given in Eq. 11; let \(\mathbf{P }=(p_{k|l})_{k,l=0,\ldots ,n}\) denote the corresponding transition matrix. Then the h-step-ahead transition probabilities \(P(X_t=k\ |\ X_{t-h}=l)\) are the entries of the matrix \(\mathbf{P }^h\), and the stationary marginal distribution is given by the solution of the invariance equation \(\mathbf{P }\, \varvec{\pi }\ =\ \varvec{\pi }\), i. e., \(P(X_t=j)=\pi _j\).
For the bvMA(1) model, the stationary marginal distribution equals
see Example 2. Since \(X_t\) and \(X_{t-h}\) are independent for lags \(h>1\), we then have \(\kappa (h)=0\); so it suffices to compute the bivariate probabilities \(P(X_t=k, X_{t-1}=l)\) for time lag 1. Using that
and denoting \(p_{\epsilon ,u}:=P(\epsilon _t=u)\), we obtain
Simulation results from Sect. 4.2
Rights and permissions
About this article
Cite this article
Gouveia, S., Möller, T.A., Weiß, C.H. et al. A full ARMA model for counts with bounded support and its application to rainy-days time series. Stoch Environ Res Risk Assess 32, 2495–2514 (2018). https://doi.org/10.1007/s00477-018-1584-3
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00477-018-1584-3