A Comparative Study of Similarity Measures for Time Series Classification

  • Sho Yoshida
  • Basabi ChakrabortyEmail author
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10091)


Time series data are found everywhere in the real world and their analysis is needed in many practical situations. Multivariate time series data poses problem for analysis due to its dynamic nature and traditional machine learning algorithms for static data become unsuitable for direct application. A measure to assess the similarity of two time series is essential in time series processing and a lot of measures have been developed. In this work a comparative study of some of the most popular similarity measures has been done with 43 benchmark data set from UCR time series repository. It has been found that, on the average over the different data sets, DTW performs better in terms of classification accuracy but it has high computational cost. A simple processing technique for reducing the data set to lower computational cost without much degradation in classification accuracy is proposed and studied. A new similarity measure is also proposed and its efficiency is examined compared to other measures.


  1. 1.
    Kantz, H., Schreiber, T.: Nonlinear Time Series Analysis. Cambridge University Press, Cambridge (2004)zbMATHGoogle Scholar
  2. 2.
    Buza, K., Nanopoulos, A., Schmidt-Thieme, L.: Time series classification based on individual error prediction. In: IEEE International Conference on Computational Science and Engineering (2010)Google Scholar
  3. 3.
    Chakraborty, B.: Feature selection and classification techniques for multivariate time series. In: Proceedings of ICICIC 2007, September 2007Google Scholar
  4. 4.
    Zhang, X., Wu, J., et al.: A novel pattern extraction method for time series classification. Optimiz. Engin. 10(2), 253–271 (2009)MathSciNetCrossRefzbMATHGoogle Scholar
  5. 5.
    Buza, K., Nanopoulos, A., Schmidt-Thieme, L.: Fusion of similarity measures for time series classification. In: Corchado, E., Kurzyński, M., Woźniak, M. (eds.) HAIS 2011. LNCS (LNAI), vol. 6679, pp. 253–261. Springer, Heidelberg (2011). doi: 10.1007/978-3-642-21222-2_31 CrossRefGoogle Scholar
  6. 6.
    Chakraborty, B.: A proposal for classification of multi sensor time series databased on time delay embedding. In: Proceedings of 8th International Conference on Sensing Technology ICST, Liverpool, England, pp. 31–35, September 2014Google Scholar
  7. 7.
    Chakraborty, B., Manabe, Y.: An efficient approach for person authentication using online signature verification. In: Proceedings of SKIMA 2008, pp. 12–17 (2008)Google Scholar
  8. 8.
    Keogh, E., Zhu, Q., Hu, B., Hao, Y., Xi, X., Wei, L., Ratanamahatana, C.A.: The UCR Time Series Classification/Clustering (2011).
  9. 9.
    Serra, J., Arcios, J.: An empirical evaluation of similarity measures for time series classification. Knowl. Based Syst. 67, 305–314 (2014)CrossRefGoogle Scholar
  10. 10.
    Giusti, R., Batista, G.: An empirical comparison of dissimilarity measures for time series classification. In: Brazilian Conference on Intelligent Systems (BRACIS 2013) (2013)Google Scholar
  11. 11.
    Xing, Z., Pei, J., Keogh, E.: A brief survey on sequence classification. ACM SIGKDD Explor. Newslett. 12(1), 40–48 (2010)CrossRefGoogle Scholar
  12. 12.
    Lal, T.N., et al.: Support vector channel selection in BCI. IEEE Trans. Biomed. Eng. 51(6), 1003–1010 (2004)CrossRefGoogle Scholar
  13. 13.
    Ye, L., Keogh, E.: Time series shapelets: a new primitive for data mining. In: Proceedings of ACM SIGKDD International Conference of Knowledge Discovery and Data Mining (2009)Google Scholar
  14. 14.
    Yoon, H., Yang, K., Sahabi, C.: Feature subset selection and feature ranking for multivariate time series. IEEE Trans. Knowl. Data Eng. 17(9), 1186–1198 (2005)CrossRefGoogle Scholar
  15. 15.
    Kim, S.B., Han, K.S., et al.: Some effective techniques for naive bayes text classification. IEEE Trans. Knowl. Data Eng. 18(11), 1457–1466 (2006)CrossRefGoogle Scholar
  16. 16.
    Wang, X., Mueen, A., Ding, H., Trajcevski, G., Scheuermann, P., Keogh, E.: Experimental comparison of representation methods and distance measures for time series data. Data Min. Knowl. Discov. 26, 275–309 (2013)MathSciNetCrossRefGoogle Scholar
  17. 17.
    Levenshtein, I.: Binary codes capable of correcting deletions, inssertions and reversals. Sov. Phys. Dokl. 10, 707–710 (1966)Google Scholar
  18. 18.
    Marteau, F.: Time warp edit distance with stiffness adjustment for time series matching. IEEE Trans. PAMI 31, 306–318 (2009)CrossRefGoogle Scholar
  19. 19.
    Alligood, K., Sauer, T., Yorke, J.A.: Chaos: An Introduction to Dynamical Systems. Springer, New York (1997)CrossRefzbMATHGoogle Scholar
  20. 20.
    Manabe, Y., Chakraborty, B.: Identity detection from online handwriting time series. In: Proceedings of SMCia08, pp. 365–370 (2007)Google Scholar
  21. 21.
    Aberbanel, H.D.I.: Analysis of Observed Chaotic Data’. Springer, New York (1996)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  1. 1.Graduate School of Software and Information ScienceIwate Prefectural UniversityTakizawamuraJapan

Personalised recommendations