Abstract
Coherent forecasting for discrete-valued stationary time series is considered in this article. In the context of count time series, different methods of coherent forecasting such as median forecasting and mode forecasting are used to obtain \(h\)-step ahead coherent forecasting. However, there are not many existing works in the context of categorical time series. Here, we consider the case of a finite number of categories with different possible models, such as the Pegram’s operator-based ARMA(\(p\),\(q\)) model, the mixture transition distribution model and the logistic regression model, and study their \(h\)-step ahead coherent forecasting. Some theoretical results are derived along with some numerical examples. To facilitate comparison among the three models, we use some forecasting measures. The procedure is illustrated using one real-life categorical data, namely the infant sleep status data.
Similar content being viewed by others
References
Agresti, A.: Categorical data analysis. Wiley, New Jersey (2002)
Al-Osh, M., Alzaid, A.A.: First-order integer-valued autoregressive (INAR(1)) process. J. Time Ser. Anal. 8(3), 261–275 (1987)
Alzaid, A., Al-Osh, M.: An integer-valued pth-order autoregressive structure (INAR(p)) process. J. Appl. Probab. 27, 314–324 (1990)
Atkinson, K.: An introduction to numerical analysis. Wiley, New Delhi (2008)
Berchtold, A., Raftery, A.E.: The mixture transition distribution model for high-order Markov chains and non-gaussian time series. Stat. Sci. 17(3), 328–356 (2002)
Biswas, A., Guha, A.: Time series analysis of categorical data using auto-mutual information. J. Stat. Plan. Inference 139(9), 3076–3087 (2009)
Biswas, A., Song, P.X.-K.: Discrete-valued ARMA processes. Stat. Prob. Lett. 79(17), 1884–1889 (2009)
Brockwell, P.J., Davis, R.A.: Time series: theory and methods. Springer, Berlin (2002)
Bu, R., McCabe, B.: Model selection, estimation and forecasting in INAR(p) models: a likelihood-based Markov chain approach. Int. J. Forecast. 24(1), 151–162 (2008)
Carruth, J., Tygert, M., Ward, R.: A comparison of the discrete Kolmogorov–Smirnov statistic and the euclidean distance (2012). arXiv:1206.6367
Fokianos, K., Kedem, B.: Regression theory for categorical time series. Stat. Sci. 18(3), 357–376 (2003)
Freeland, R.K., McCabe, B.P.: Forecasting discrete valued low count time series. Int. J. Forecast. 20(3), 427–434 (2004)
Jacobs, P.A., Lewis, P.A.: Discrete time series generated by mixtures III: Autoregressive processes (DAR(p)). Technical report, Naval Postgraduate School, Monterey (1978c)
Jacobs, P.A., Lewis, P.A.: Discrete time series generated by mixtures. I: correlational and runs properties. J. Royal Stat. Soc. Ser. B (Methodological) 40(1), 94–105 (1978a)
Jacobs, P.A., Lewis, P.A.: Discrete time series generated by mixtures II: asymptotic properties. J. Royal Stat. Soc. Ser. B (Methodological) 40(2), 222–228 (1978b)
Jacobs, P.A., Lewis, P.A.: Stationary discrete autoregressive–moving average time series generated by mixtures. J. Time Ser. Anal. 4(1), 19–36 (1983)
Jung, R.C., Tremayne, A.R.: Coherent forecasting in integer time series models. Int. J. Forecast. 22(2), 223–238 (2006)
McKenzie, E.: Discrete variate time series. Handbook of statistics 21, 573–606 (2003)
McKenzie, E.: Some simple models for discrete variate time series. J. Am. Water Resour. Assoc. 21(4), 645–650 (1985)
McKenzie, E.: Some ARMA models for dependent sequences of Poisson counts. Adv. Appl. Probab. 20(4), 822–835 (1988)
Pegram, G.: An autoregressive model for multilag Markov chains. J. Appl. Probab. 17(2), 350–362 (1980)
Raftery, A.E.: A model for high-order Markov chains. J. Royal Stat. Soc. Ser. B (Methodological) 47(3), 528–539 (1985)
Silva, N., Pereira, I., Silva, M.E.: Forecasting in INAR(1) model. REVSTAT Stat. J. 7(1), 119–134 (2009)
Stoffer, D.S., Scher, M.S., Richardson, G.A., Day, N.L., Coble, P.A.: A Walsh–Fourier analysis of the effects of moderate maternal alcohol consumption on neonatal sleep-state cycling. J. Am. Stat. Assoc. 83(404), 954–963 (1988)
Stoffer, D.S., Tyler, D.E., Wendt, D.A.: The spectral envelope and its applications. Stat. Sci. 15(3), 224–253 (2000)
Weiß, C.H., Göb, R.: Measuring serial dependence in categorical time series. Adv. Stat. Anal. 92(1), 71–89 (2008)
Weiß, C.H.: Empirical measures of signed serial dependence in categorical time series. J. Stat. Comput. Simul. 81(4), 411–429 (2011)
Weiß, C.H.: Serial dependence of NDARMA process. Comput. Stat. Data Anal. 68(1), 213–238 (2013)
Acknowledgments
The authors wish to thank the three anonymous referees and the associate editor for their careful reading and constructive suggestions which led to this improved version of the paper.
Author information
Authors and Affiliations
Corresponding author
Appendix
Appendix
1.1 Appendix A : Proof of Theorem 2
From the model (3.2), the 1-step ahead conditional distribution is given by
with \(\eta _{1l}=\phi _{l}\), \(l=1,\ldots ,p\). Then the two-step ahead conditional distribution is given by
where \(\varvec{\eta _2}=\varvec{\Phi }\varvec{\phi }\). So the result is true for \(h=2\). Let it be true for \((h-1)\), that is \(\varvec{\eta _{h-1}}=\varvec{\Phi }^{h-2}\varvec{\phi }\). Then by induction it is straightforward to show that the \(h\)-step ahead conditional distribution is given by (3.5).
1.2 Appendix B : Proof of Theorem 3
To prove the Theorem 3, it is enough to show that \(\displaystyle \lim _{h\rightarrow \infty }\eta _{hi}=0\) for all \(i\). To show this we use the result that for any \(n \times n\) matrix \(A\) with its eigenvalues \(\lambda _1, \lambda _2,\ldots ,\lambda _s\), \(\displaystyle \lim _{k \rightarrow \infty }A^k=0\) if the spectral radius of \(A\), \(\rho (A)<1\) where \(\rho (A)=\max \{\left| \lambda _1\right| , \left| \lambda _2\right| ,\ldots ,\left| \lambda _s\right| \}\) (See Atkinson 2008). Outline of the proof is given follows.
From the Jordan normal theorem, for any \(n \times n\) matrix \(A\), there exist a non-singular matrix \(V\) and a block diagonal matrix \(J\) such that
for
where the \(m_{i}\times m_{i}\) matrix \(J_{m_{i}}(\lambda _{i})\) being
Now
and, since \(J\) is block diagonal,
Now a standard result on the \(k\)th power of an \(m \times m\) Jordan block states that, for \(k \ge m \),
Since \(\rho (A)<1\), i.e., \(|\lambda _i|<1\) for all \(i\) and \(\displaystyle \lim _{k \rightarrow \infty }{k \atopwithdelims ()i}\lambda ^{k-i}=0\), and hence \(\displaystyle \lim _{k \rightarrow \infty } J_m^k(\lambda )=0\), . This implies that \(\displaystyle \lim _{k \rightarrow \infty }J^k=0\). Therefore,
Note that the eigenvalues of \(\varvec{\Phi }\) are \(\phi _1, \ldots ,\phi _p\) all of which lie between 0 and 1, and hence \(\displaystyle \lim _{h \rightarrow \infty }\varvec{\Phi }^h=0\). Consequently \(\displaystyle \lim _{h \rightarrow \infty } \varvec{\eta }_h=\displaystyle \lim _{h \rightarrow \infty }\varvec{\Phi }^{h-1}\varvec{\phi } =(\displaystyle \lim _{h \rightarrow \infty }\varvec{\Phi }^{h-1})\varvec{\phi }=0\).
1.3 Appendix C: Pegram’s MA(2) model
Here for \(h=1\),
where
and
Similarly for \(h=2\),
where
and
And for \(h>2,\; P(Y_{n+h}=C_{i}| Y_{n}, Y_{n-1}, \ldots )=p_{i}\).
Rights and permissions
About this article
Cite this article
Maiti, R., Biswas, A. Coherent forecasting for stationary time series of discrete data. AStA Adv Stat Anal 99, 337–365 (2015). https://doi.org/10.1007/s10182-014-0243-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10182-014-0243-3