Advertisement

Optimization and Engineering

, Volume 10, Issue 2, pp 253–271 | Cite as

A novel pattern extraction method for time series classification

  • Xiaohang Zhang
  • Jun WuEmail author
  • Xuecheng Yang
  • Haiying Ou
  • Tingjie Lv
Article

Abstract

Multivariate time series classification is of significance in machine learning area. In this paper, we present a novel time series classification algorithm, which adopts triangle distance function as similarity measure, extracts some meaningful patterns from original data and uses traditional machine learning algorithm to create classifier based on the extracted patterns. During the stage of pattern extraction, Gini function is used to determine the starting position in the original data and the length of each pattern. In order to improve computing efficiency, we also apply sampling method to reduce the searching space of patterns. The common datasets are used to check our algorithm and compare with the naive algorithms. Experimental results are shown to reveal that much improvement can be gained in terms of interpretability, simplicity and accuracy.

Keywords

Time series classification Patterns extraction Triangle distance Gini function 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aach J, Church GM (2001) Aligning gene expression time series with time warping algorithms. Bioninformatics 17:495–508 CrossRefGoogle Scholar
  2. Abarbanel HDI, Carroll TA, Pecora LM, Sidorowich JJ, Tsimring LS (1994) Predicting physical variables in time-delay embedding. Phys Rev E 49:1840–1853 CrossRefGoogle Scholar
  3. Alcock RJ, Manolopoulos Y (1999) Time-series similarity queries employing a feature-based approach. In: Proceeding of the 7th Hellenic conference on informatics, Ioannina, Greece Google Scholar
  4. Alonso Gonzalez CJ, Rodriguez Diez JJ (2000) Time series classification by boosting interval based literals. Intel Artif Rev Iberoam Intel Artif 11:2–11 Google Scholar
  5. Ashkenazy Y, Ivanov PC, Havlin S, Peng CK, Goldberger AL, Stanley HE (2001) Magnitude and signal correlations in heartbeat fluctuations. Phys Rev Lett 86:1900–1903 CrossRefGoogle Scholar
  6. Berndt D, Clifford J (1994) Using dynamic time warping to find patterns in time series. In: AAAI workshop on knowledge discovery in databases, pp 229–248 Google Scholar
  7. Boshoff HFV, Grotepass M (1991) The fractal dimension of fricative speech sounds. In: Proceedings of the South African symposium on communication and signal processing, pp 12–61 Google Scholar
  8. Buchler JR, Kollath Z, Serre T, Mattei J (1996) Nonlinear analysis of the lightcurve of the variable star R Scuti. Astrophys J:462–489 Google Scholar
  9. Casdagli M, Mackay RS (1989) Nonlinear prediction of chaotic time series. Physica D 35:335–356 zbMATHCrossRefMathSciNetGoogle Scholar
  10. Chu S, Keogh E, Hart D, Pazzani M (2002) Iterative deepening dynamic time warping for time series. In: Proceeding of SIAM international conference on data mining, pp 195–212 Google Scholar
  11. Ding Q, Zhuang Z, Zhu L, Zhang Q (1999) Application of the chaos, fractal and wavelet theories to the feature extraction of passive acoustic signal. Acta Acust 24:197–203 Google Scholar
  12. Farmer JD, Sidorowich JJ (1988) Exploiting chaos to predict the future and reduce noise. In: Lee YC (ed) Evolution, learning, and cognition. World Scientific, Singapore, pp 277–330 Google Scholar
  13. Geurts P (2001) Pattern extraction for time series classification. In: Principles of data mining and knowledge discovery. LNAI, vol 2168. Springer, Berlin, pp 115–127 CrossRefGoogle Scholar
  14. Kadous MW (1999) Learning comprehensible descriptions of multivariate time series. In: Proceedings of the 16th international conference on machine learning, pp 454–463 Google Scholar
  15. Kadtke J (1995) Classification of highly noisy signals using global dynamical models. Phys Lett A 203:196–202 zbMATHCrossRefMathSciNetGoogle Scholar
  16. Keogh E, Kasetty S (2003) On the need for time series data mining benchmarks: a survey and empirical demonstration. Data Mining Knowl Discov 7(4):349–371 CrossRefMathSciNetGoogle Scholar
  17. Keogh E, Ratanamahatana CA (2005) Exact indexing of dynamic time warping. Knowl Inf Syst 7(3):358–386 CrossRefGoogle Scholar
  18. Kudo M, Toyama J, Shimbo M (1999) Multidimensional curve classification using passing-through regions. Pattern Recognit Lett 20(11–13):1103–1111 CrossRefGoogle Scholar
  19. Kyusung K, Parlos AG (2002) Induction motor fault diagnosis based on neuropredictors and wavelet signal processing. IEEE/ASME Trans Mechatron 7(2):201–219 CrossRefGoogle Scholar
  20. Manganaris S (1997) Supervised classification with temporal data. PhD thesis, Vanderbilt University Google Scholar
  21. Petry A, Augusto D, Barone C (2002) Speaker identification using nonlinear dynamical features. Chaos Solitons Fractals 13:221–231 zbMATHCrossRefGoogle Scholar
  22. Povinelli RJ, Johnson MT, Lindgren AC, Ye J (2004) Time series classification using Gaussian mixture models of reconstructed phase spaces. IEEE Trans Knowl Data Eng 16(6):779–783 CrossRefGoogle Scholar
  23. Rabiner L, Juang B (1986) An introduction to hidden Markov models. IEEE Mag Accoust Speech Signal Process 3(1):4–16 Google Scholar
  24. Rumelhart DE, MacClelland JL (1986) Parallel distributed processing: explorations in the microstructure of cognition, vol 1: foundations. MIT Press/Bradford Books, Cambridge Google Scholar
  25. Schulte-Frohlinde V, Ashkenazy Y, Ivanov PC, Glass L, Goldberger AL, Stanley HE (2001) Noise effects on the complex patterns of abnormal heartbeats. Phys Rev Lett 87:068104 CrossRefGoogle Scholar
  26. Sciamarella D, Mindlin GB (1999) Topological structure of chaotic flows from human speech chaotic data. Phys Rev Lett 82:1450–1453 CrossRefGoogle Scholar
  27. UCI KDD archive (2007) http://kdd.ics.uci.edu
  28. Vlachos M, Kollios G, Gunopulos D (2002) Discovering similar multidimensional trajectories. In: Proceeding of international conference on data engineering, pp 673–684 Google Scholar
  29. Yi BK, Faloutsos C (2002) Fast time sequence indexing for arbitrary Lp norms. In: Proceedings of international conference on very large databases, pp 385–394 Google Scholar

Copyright information

© Springer Science+Business Media, LLC 2008

Authors and Affiliations

  • Xiaohang Zhang
    • 1
  • Jun Wu
    • 1
    Email author
  • Xuecheng Yang
    • 1
  • Haiying Ou
    • 1
  • Tingjie Lv
    • 1
  1. 1.School of Economics and ManagementBeijing University of Posts and TelecommunicationsBeijingChina

Personalised recommendations