Advertisement

Discovering Key Sequences in Time Series Data for Pattern Classification

  • Peter Funk
  • Ning Xiong
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4065)

Abstract

This paper addresses the issue of discovering key sequences from time series data for pattern classification. The aim is to find from a symbolic database all sequences that are both indicative and non-redundant. A sequence as such is called a key sequence in the paper. In order to solve this problem we first we establish criteria to evaluate sequences in terms of the measures of evaluation base and discriminating power. The main idea is to accept those sequences appearing frequently and possessing high co-occurrences with consequents as indicative ones. Then a sequence search algorithm is proposed to locate indicative sequences in the search space. Nodes encountered during the search procedure are handled appropriately to enable completeness of the search results while removing redundancy. We also show that the key sequences identified can later be utilized as strong evidences in probabilistic reasoning to determine to which class a new time series most probably belongs.

Keywords

Time Series Time Series Data Pattern Classification Mining Sequential Pattern Open List 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Agrawal, R., Srikant, R.: Mining sequential patterns. In: Proceedings of the 11th International Conference on Data Engineering, pp. 3–14 (1995)Google Scholar
  2. 2.
    Chan, K.P., Fu, A.W.: Efficient time series matching by wavelets. In: Proceedings of the International Conference on Data Engineering, pp. 126–133 (1999)Google Scholar
  3. 3.
    Fayyad, U., Piatetsky-Shapiro, G., Smyth, P.: From data mining to knowledge discovery. In: Advances in Knowledge Discovery and Data Mining, pp. 1–36. MIT Press, Cambridge (1996)Google Scholar
  4. 4.
    Garofalakis, M.N., Rajeev, R., Shim, K.: SPIRIT: Sequential sequential pattern mining with regular expression constraints. In: Proceedings of the 25th International Conference on Very Large Databases, pp. 223–234 (1999)Google Scholar
  5. 5.
    Hayashi, A., Mizuhara, Y., Suematsu, N.: Embedding time series data for classification. In: Perner, P., Imiya, A. (eds.) MLDM 2005. LNCS, vol. 3587, pp. 356–365. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  6. 6.
    Hetland, M. L.: A survey of recent methods for efficient retrieval of similar time sequences. In: Last, M., Kandel, A., Bunke, H. (eds.): Data Mining in Time Series Databases. World Scientific (2004) Google Scholar
  7. 7.
    Huang, C.-F., Chen, Y.-C., Chen, A.-P.: An association mining method for time series and its application in the stock prices of TFT-LCD industry. In: Perner, P. (ed.) ICDM 2004. LNCS, vol. 3275, pp. 117–126. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  8. 8.
    Huhtala, Y., Kärkkäinen, J., Toivonen, H.: Mining for similarities in aligned time series using wavelets. In: Data Mining and Knowledge Discovery: Theory, Tools, and Technology, Orlando, FL. SPIE Proceedings Series, vol. 3695, pp. 150–160 (1999)Google Scholar
  9. 9.
    Keogh, E., Chakrabarti, K., Pazzani, M., Mehrotra, S.: Locally adaptive dimensionality reduction for indexing large time series databases. In: Proceedings of ACM SIGMOD Conference on Management of Data, Santa Barbara, CA, pp. 151–162 (2001)Google Scholar
  10. 10.
    Keogh, E., Chakrabarti, K., Pazzani, M., Mehrotra, S.: Dimensionality reduction for fast similarity search in large time series databases. Journal of Knowledge and Information Systems (2001)Google Scholar
  11. 11.
    Last, M., Klein, Y., Kandel, A.: Knowledge discovery in time series databases. IEEE Trans. Systems, Man, and Cybernetics — Part B: Cybernetics 31, 160–169 (2001)CrossRefGoogle Scholar
  12. 12.
    Nilsson, M., Funk, P.: A case-based classification of respiratory sinus arrhythmia. In: Funk, P., González Calero, P.A. (eds.) ECCBR 2004. LNCS, vol. 3155, pp. 673–685. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  13. 13.
    Park, S., Chu, W.W., Yoon, J., Hsu, C.: Efficient search for similar subsequences of different lengths in sequence databases. In: Proceedings of the International Conference on Data Engineering, pp. 23–32 (2000)Google Scholar
  14. 14.
    Pray, K.A., Ruiz, C.: Mining expressive temporal associations from complex data. In: Perner, P., Imiya, A. (eds.) MLDM 2005. LNCS, vol. 3587, pp. 384–394. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  15. 15.
    Ray, A.: Symbolic dynamic analysis of complex systems for anomaly detection. Signal Processing 84, 1115–1130 (2004)MATHCrossRefGoogle Scholar
  16. 16.
    von Schéele, B.: Classification Systems for RSA, ETCO2 and other physiological parameters. PBM Stressmedicine, Technical report (1999), http://www.pbmstressmedicine.se
  17. 17.
    Srikant, R., Agrawal, R.: Mining sequential patterns: Generalizations and performance improvements. In: Proceedings of the 5th International Conference on Extending Database Technology, pp. 3–17 (1996)Google Scholar
  18. 18.
    Tung, A.K.H., Lu, H., Han, J., Feng, L.: Breaking the barrier of transactions: Mining inter-transaction association rules. In: Proceedings of ACM Conference on Knowledge Discovery and Data Mining, pp. 297–301 (1999)Google Scholar
  19. 19.
    Wu, Y., Agrawal, D., Abbadi, A.E.: A comparison of DFT and DWT based similarity search in time series databases. In: Proceedings of the 9th ACM CIKM Conference on Information and Knowledge Management, McLean, VA, pp. 488–495 (2000)Google Scholar
  20. 20.
    Yoon, H., Yang, K., Shahabi, C.: Feature subset selection and feature ranking for multivariate time series. IEEE Trans. Knowledge and Data Engineering 17, 1186–1198 (2005)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Peter Funk
    • 1
  • Ning Xiong
    • 1
  1. 1.Department of Computer Science and ElectronicsMälardalen UniversityVästeråsSweden

Personalised recommendations