A dynamic programming segmentation procedure for hydrological and environmental time series

Original Paper

Abstract

We present a procedure for the segmentation of hydrological and environmental time series. The procedure is based on the minimization of Hubert’s segmentation cost or various generalizations of this cost. This is achieved through a dynamic programming algorithm, which is guaranteed to find the globally optimal segmentations with K=1, 2, ..., K max segments. Various enhancements can be used to speed up the basic dynamic programming algorithm, for example recursive computation of segment errors and “block segmentation”. The “true” value of K is selected through the use of the Bayesian information criterion. We evaluate the segmentation procedure with experiments which involve artificial as well as temperature and river discharge time series.

Keywords

Time series Segmentation Change point Dynamic programming River discharge 

References

  1. Akaike H (1974) A new look at the statistical model identification. IEEE Trans Automat Control 19:716–723CrossRefGoogle Scholar
  2. Auger IE, Lawrence CE (1989) Algorithms for the optimal identification of segment neighborhoods. Bull Math Biol 51:39–54PubMedGoogle Scholar
  3. Basseville M, Nikiforov IV (1993) Detection of abrupt changes—theory and application. Prentice-Hall, Englewood CliffsGoogle Scholar
  4. Beeferman D, Berger A, Lafferty J (1999) Statistical models for text segmentation. Machine learning 34:177–210CrossRefGoogle Scholar
  5. Bellman R (1961) On the approximation of curves by line segments using dynamic programming. Commun ACM 4:284CrossRefGoogle Scholar
  6. Braun JV, Mueller H-G (1998) Statistical methods for DNA sequence segmentation. Stat Sci 13:142–162CrossRefGoogle Scholar
  7. Braun JV, Braun RK, Mueller H-G (2000) Multiple changepoint fitting via quasilikelihood, with application to DNA sequence segmentation. Biometrika 87:301–314CrossRefGoogle Scholar
  8. Chen H-L (2002) Testing hydrologic time series for stationarity. J Hydr Eng 7:129–136CrossRefGoogle Scholar
  9. Chen J, Gupta AK (1998) Information criterion and change point problem for regular models. Tech. Rep. No. 98-05, Department of Math. and Stat., Bowling Green State U., OhioGoogle Scholar
  10. Fortin V, Perreault L, Salas JD (2004) Retrospective analysis and forecasting of streamflows using a shifting level model. J Hydrol 296:135–163CrossRefGoogle Scholar
  11. Fortin V, Perreault L, Salas JD (2004) Analyse retrospective et prevision des debits en presence de changement de regime. 57e Congres Annuel de l’ Association Canadienne des Ressources HydriquesGoogle Scholar
  12. Himberg J, Korpiaho K, Mannila H, Tikanmaki J, Toivonen HTT (2001) Time series segmentation for context recognition in mobile devices. Proc. of ICDM 2001, pp 203–210Google Scholar
  13. Hipel KW, McLeod AI (1994) Time series modelling of water resources and environmental systems. Elsevier, AmsterdamCrossRefGoogle Scholar
  14. Hubert P (1997) Change points in meteorological analysis. In: Subba Rao T, Priestley MB, Lessi O (eds) Applications of time series analysis in astronomy and meteorology. Chapman and Hall, LondonGoogle Scholar
  15. Hubert P (2000) The segmentation procedure as a tool for discrete modeling of hydrometeorogical regimes. Stoch Env Res Risk Ass 14:297–304CrossRefGoogle Scholar
  16. Jackson B et al (2005) An algorithm for optimal partitioning of data on an interval. IEEE Signal Proc Lett 12:105–108CrossRefGoogle Scholar
  17. Kay SM (1998) Fundamentals of statistical signal processing: detection theory. Prentice-Hall, Englewood CliffsGoogle Scholar
  18. Kearns M, Mansour Y, Ng AY, Ron D (1997) An experimental and theoretical comparison of model selection methods. Machine Learn 27:7–50CrossRefGoogle Scholar
  19. Kehagias Ath (2004) A hidden Markov model segmentation procedure for hydrological and environmental time series. Stoch Environ Res Risk Assess 18:117–130CrossRefGoogle Scholar
  20. Keogh E et al (2001) An online algorithm for segmenting time series. In: Proceedings of the 2001 IEEE International Conference on Data Mining, pp 289–296Google Scholar
  21. Keogh E, Chu S, Hart D, Pazzani M (2003) Segmenting time series: a survey and novel approach, In: Data mining in time series databases. World Scientific Publishing Company, SingaporeGoogle Scholar
  22. Mann ME, Bradley RS, Hughes MK (1999) Northern hemisphere temperatures during the past millennium: inferences, uncertainties, and limitations. Geophys Res Lett 26:759–762CrossRefGoogle Scholar
  23. Ninomiya Y (2003) Asymptotic theory for change-point model and its application to model selection. Tech. Report, Graduate School of Mathematics, Kyusho Univ., JapanGoogle Scholar
  24. Pavlidis T, Horowitz SL (1974) Segmentation of Plane Curves. IEEE Trans Comput 23:860–870CrossRefGoogle Scholar
  25. Schwarz G (1978) Estimating the dimension of a model. Ann Stat 6:461–464CrossRefGoogle Scholar
  26. Sveinsson OGB, Salas JD, Boes DC, Pielke RA Sr (2003) Modeling the dynamics of long term variability of hydroclimatic processes. J Hydrometeorol 4:489–505CrossRefGoogle Scholar
  27. Yao Y-C (1988) Estimating the number of change-points via Schwarz’ criterion. Stat Prob Lett 6:181–189CrossRefGoogle Scholar

Copyright information

© Springer-Verlag 2005

Authors and Affiliations

  1. 1.Faculty of EngineeringAristotle University of ThessalonikiThessalonikiGreece

Personalised recommendations