Abstract
Given a set of multivariate time series, the problem of clustering such data is concerned with the discovering of inherent groupings of the data according to how similar or dissimilar the time series are to each other. Existing time series clustering algorithms can divide into three types, raw-based, feature-based and model-based. In this paper, a model-based multivariate time series clustering algorithm is proposed and its tasks in several steps: (i)data transformation, (ii)discovering time series temporal patterns using confidence value to represent the relationship between different variables, (iii) clustering of multivariate time series based on the degree of patterns discovering in (ii). For evaluate performance of proposed algorithm, the proposed algorithm is tested with both synthetic data and real data. The result shows that it can be promising algorithm for multivariate time series clustering.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Han, J., Kamber, M.: Data Mining: Concepts and Techniques, 2nd edn. Elsevier Inc, Amsterdam (2007)
Liao, T.W.: Clustering of time series data - a survey. Pattern Recogn. 38, 1857–1874 (2005)
Theodoridis, T., Hu, H.: Classifying aggressive actions of 3D human models using dynamic ANNs for mobile robot surveillance. In: IEEE International Conference on Robotics and Biomimetics (Robio-2007), pp. 371–376, 15–18 December 2007
Wang, X.Z., Smith, K., Hyndman, R.: Characteristic-based clustering for time series data. J. Data Min. Knowl. Discov. 13(3), 335–364 (2006)
Verdoolaege, G., Rosseel, Y.: Activation detection in event-related fmri through clustering of wavelet distributions. In: IEEE 17th International Conference on Image Processing, Hong Kong, pp 4393–4395 (2010)
Asuncion, A., Newman, D.J.: UCI machine learning repository. Irvine, CA: University of California, School of Information and Computer Science (2007). http://archive.ics.uci.edu/ml/
Larsen, B., Aone, C.: Fast and effective text mining using linear-time document clustering. In: Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 16–22 (1999)
Rani, S., Sikka, G.: Recent techniques of clustering of time series data: a survey. Int. J. Comput. Appl. 52(15), 1–9 (2012)
Li, C., Biswas, G., Dale, M., Dale, P.: Building models of ecological dynamics using HMM based temporal data clustering - a preliminary study. In: Hoffmann, F., Adams, N., Fisher, D., Guimaraes, G., Hand, D.J. (eds.) IDA 2001. LNCS, vol. 2189, pp. 53–62. Springer, Heidelberg (2001)
Wang, L., Mehrabi, M.G., Kannatey-Asibu Jr., E.: Hidden Markov model-based wear monitoring in turning. J. Manufact. Sci. Eng. 124, 651–658 (2002)
Wong, A.K.C., Wang, C.C.: DECA – a discrete-valued ensemble clustering algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 1, 342–349 (1979)
Guo, C., Jia, H., Zhang, N.: Time series clustering based on ICA for stock data analysis. Wireless Communications, Networking and Mobile Computing, WiCOM ‘08, pp. 1–4 (2008)
Ma, P.C.H., Chan, K.C.C., Chiu, D.K.Y.: Clustering and re-clustering for pattern discovery in gene expression data. J. Bioinform. Comput. Biol. 3(2), 281–301 (2005)
Ye, J., Janardan, R., Li, Q.: Gpca: an efficient dimension reduction scheme for image compression and retrieval. In: KDD ’04: The Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, NY, USA, pp. 354–363 (2004)
Kalpakis, K., Gada, D., Puttagunta, V.: Distance measures for effective clustering of ARIMA time-series. In: IEEE International Conference on Data Mining, San Jose, CA, pp. 273–280 (2001)
Wong, A.K.C., Wu, B., Wu, G.P.K., Chan, K.C.C.: Pattern discovery for large mixed-mode database. In: CIKM’10, pp. 859–868, 26–30 October 2010
Xiong, Y., Yeung, D.-Y.: Mixtures of ARMA models for model-based time series clustering. In: Proceedings of the IEEE International Conference on Data Mining, Maebaghi City, Japan, 9–12 December 2002
Shumway, R.H.: Time–frequency clustering and discriminant analysis. Stat. Probab. Lett. 63, 307–314 (2003)
Dimitrova, E.S., McGee, J.J., Laubenbacher, R.C.: Discretization of Time Course Data (2005). http://polymath.vbi.vt.edu/discretization/DimitrovaMcGeeLaubenbacher.pdf
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Zhou, PY., Chan, K.C.C. (2014). A Model-Based Multivariate Time Series Clustering Algorithm. In: Peng, WC., et al. Trends and Applications in Knowledge Discovery and Data Mining. PAKDD 2014. Lecture Notes in Computer Science(), vol 8643. Springer, Cham. https://doi.org/10.1007/978-3-319-13186-3_72
Download citation
DOI: https://doi.org/10.1007/978-3-319-13186-3_72
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13185-6
Online ISBN: 978-3-319-13186-3
eBook Packages: Computer ScienceComputer Science (R0)