A dynamic programming segmentation procedure for hydrological and environmental time series
Original Paper
First Online:
- 209 Downloads
- 29 Citations
Abstract
We present a procedure for the segmentation of hydrological and environmental time series. The procedure is based on the minimization of Hubert’s segmentation cost or various generalizations of this cost. This is achieved through a dynamic programming algorithm, which is guaranteed to find the globally optimal segmentations with K=1, 2, ..., K max segments. Various enhancements can be used to speed up the basic dynamic programming algorithm, for example recursive computation of segment errors and “block segmentation”. The “true” value of K is selected through the use of the Bayesian information criterion. We evaluate the segmentation procedure with experiments which involve artificial as well as temperature and river discharge time series.
Keywords
Time series Segmentation Change point Dynamic programming River dischargeReferences
- Akaike H (1974) A new look at the statistical model identification. IEEE Trans Automat Control 19:716–723CrossRefGoogle Scholar
- Auger IE, Lawrence CE (1989) Algorithms for the optimal identification of segment neighborhoods. Bull Math Biol 51:39–54PubMedGoogle Scholar
- Basseville M, Nikiforov IV (1993) Detection of abrupt changes—theory and application. Prentice-Hall, Englewood CliffsGoogle Scholar
- Beeferman D, Berger A, Lafferty J (1999) Statistical models for text segmentation. Machine learning 34:177–210CrossRefGoogle Scholar
- Bellman R (1961) On the approximation of curves by line segments using dynamic programming. Commun ACM 4:284CrossRefGoogle Scholar
- Braun JV, Mueller H-G (1998) Statistical methods for DNA sequence segmentation. Stat Sci 13:142–162CrossRefGoogle Scholar
- Braun JV, Braun RK, Mueller H-G (2000) Multiple changepoint fitting via quasilikelihood, with application to DNA sequence segmentation. Biometrika 87:301–314CrossRefGoogle Scholar
- Chen H-L (2002) Testing hydrologic time series for stationarity. J Hydr Eng 7:129–136CrossRefGoogle Scholar
- Chen J, Gupta AK (1998) Information criterion and change point problem for regular models. Tech. Rep. No. 98-05, Department of Math. and Stat., Bowling Green State U., OhioGoogle Scholar
- Fortin V, Perreault L, Salas JD (2004) Retrospective analysis and forecasting of streamflows using a shifting level model. J Hydrol 296:135–163CrossRefGoogle Scholar
- Fortin V, Perreault L, Salas JD (2004) Analyse retrospective et prevision des debits en presence de changement de regime. 57e Congres Annuel de l’ Association Canadienne des Ressources HydriquesGoogle Scholar
- Himberg J, Korpiaho K, Mannila H, Tikanmaki J, Toivonen HTT (2001) Time series segmentation for context recognition in mobile devices. Proc. of ICDM 2001, pp 203–210Google Scholar
- Hipel KW, McLeod AI (1994) Time series modelling of water resources and environmental systems. Elsevier, AmsterdamCrossRefGoogle Scholar
- Hubert P (1997) Change points in meteorological analysis. In: Subba Rao T, Priestley MB, Lessi O (eds) Applications of time series analysis in astronomy and meteorology. Chapman and Hall, LondonGoogle Scholar
- Hubert P (2000) The segmentation procedure as a tool for discrete modeling of hydrometeorogical regimes. Stoch Env Res Risk Ass 14:297–304CrossRefGoogle Scholar
- Jackson B et al (2005) An algorithm for optimal partitioning of data on an interval. IEEE Signal Proc Lett 12:105–108CrossRefGoogle Scholar
- Kay SM (1998) Fundamentals of statistical signal processing: detection theory. Prentice-Hall, Englewood CliffsGoogle Scholar
- Kearns M, Mansour Y, Ng AY, Ron D (1997) An experimental and theoretical comparison of model selection methods. Machine Learn 27:7–50CrossRefGoogle Scholar
- Kehagias Ath (2004) A hidden Markov model segmentation procedure for hydrological and environmental time series. Stoch Environ Res Risk Assess 18:117–130CrossRefGoogle Scholar
- Keogh E et al (2001) An online algorithm for segmenting time series. In: Proceedings of the 2001 IEEE International Conference on Data Mining, pp 289–296Google Scholar
- Keogh E, Chu S, Hart D, Pazzani M (2003) Segmenting time series: a survey and novel approach, In: Data mining in time series databases. World Scientific Publishing Company, SingaporeGoogle Scholar
- Mann ME, Bradley RS, Hughes MK (1999) Northern hemisphere temperatures during the past millennium: inferences, uncertainties, and limitations. Geophys Res Lett 26:759–762CrossRefGoogle Scholar
- Ninomiya Y (2003) Asymptotic theory for change-point model and its application to model selection. Tech. Report, Graduate School of Mathematics, Kyusho Univ., JapanGoogle Scholar
- Pavlidis T, Horowitz SL (1974) Segmentation of Plane Curves. IEEE Trans Comput 23:860–870CrossRefGoogle Scholar
- Schwarz G (1978) Estimating the dimension of a model. Ann Stat 6:461–464CrossRefGoogle Scholar
- Sveinsson OGB, Salas JD, Boes DC, Pielke RA Sr (2003) Modeling the dynamics of long term variability of hydroclimatic processes. J Hydrometeorol 4:489–505CrossRefGoogle Scholar
- Yao Y-C (1988) Estimating the number of change-points via Schwarz’ criterion. Stat Prob Lett 6:181–189CrossRefGoogle Scholar
Copyright information
© Springer-Verlag 2005