Advertisement

Sequence Mining-Based Support Vector Machine with Decision Tree Approach for Efficient Time Series Data Classification

  • D. SenthilEmail author
  • G. Suseendran
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 1016)

Abstract

The growing demand for an efficient approach to classify time series data is bringing forth numerous research efforts in data mining field. Popularly known applications like business, medical and meteorology and so on, typically involves majority of data type in the form of time series. Hence, it is crucial to identify and scope out the potential of time series data owing to its importance on understanding the past trend as well as predicting about what would occur in future. To efficiently analyze the time series data, a system design based on Sliding Window Technique-Improved Association Rule Mining (SWT-IARM) with Enhanced Support Vector Machine (ESVM) has been largely adopted in the recent past. However, it does not provide a high accuracy for larger size of the dataset along with huge number of attributes. To solve this problem the proposed system designed a Sequence Mining algorithm-based Support Vector Machine with Decision Tree algorithm (SM-SVM with DT) for efficient time series analysis. In this proposed work, the larger size of the dataset is considered along with huge number of attributes. The preprocessing is performed using Kalman filtering. The hybrid segmentation method is proposed by combining a clustering technique and Particle Swarm Optimization (PSO) algorithm. Based on the sequence mining algorithm, the rule discovery is performed to reduce the computational complexity prominently by extracting the most frequent and important rules. In order to provide better time series classification results, the Support Vector Machine with Decision Tree (SVM-DT) method is utilized. Finally, the Pattern matching-based modified Spearmen’s rank correlation coefficient technique is introduced to provide more similarity and classification results for the given larger time series dataset accurately. The experimental results shows that the proposed system achieves better accuracy, time complexity and rule discovery compared with the existing system.

Keywords

Classification Time series Hybrid segmentation Pattern matching Accuracy 

References

  1. 1.
    Wei, W. W. (2006) Time series analysis. In The Oxford handbook of quantitative methods in psychologyGoogle Scholar
  2. 2.
    Lu, C. J., Lee, T. S., & Chiu, C. C. (2009). Financial time series forecasting using independent component analysis and support vector regression. Decision Support Systems, 47(2), 115–125.CrossRefGoogle Scholar
  3. 3.
    Fu, T. C. (2011). A review on time series data mining. Engineering Applications of Artificial Intelligence, 24(1), 164–181.CrossRefGoogle Scholar
  4. 4.
    Esling, P., & Agon, C. (2012). Time-series data mining. ACM Computing Surveys (CSUR), 45(1), 1–32.CrossRefGoogle Scholar
  5. 5.
    Senthil, D., & Suseendran, G. (2018). Efficient time series data classification using sliding window technique based improved association rule mining with enhanced support vector machine. Submitted to International Journal of Engineering & Technology, 7(2), 218–223.Google Scholar
  6. 6.
    Povinelli, R. J., Johnson, M. T., Lindgren, A. C., & Ye, J. (2004). Time series classification using Gaussian mixture models of reconstructed phase spaces. IEEE Transactions on Knowledge and Data Engineering, 16(6), 779–783.CrossRefGoogle Scholar
  7. 7.
    Keogh, E., Chu, S., Hart, D., & Pazzani, M. (2001). An online algorithm for segmenting time series. In Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference, pp. 289–296.Google Scholar
  8. 8.
    Schmidt, R., & Gierl, L. (2005). A prognostic model for temporal courses that combines temporal abstraction and case–based reasoning. International Journal of Medical Informatics, 74(2–4), 307–315.CrossRefGoogle Scholar
  9. 9.
    Ghalwash, M. F., & Obradovic, Z. (2012). Early classification of multivariate temporal observations by extraction of interpretable shapelets. BMC Bioinformatics, 13(1), 1–12.CrossRefGoogle Scholar
  10. 10.
    Thiyagaraj, M., & Suseendran, G. (2017) Review of chronic kidney disease based on data mining proceedings of the 11th INDIACom; INDIACom–2017; IEEE Conference ID: 40353 2017 4th International Conference on “Computing for Sustainable Global Development”, 01st–03rd March, 2017 Bharati Vidyapeeth’s Institute of Computer Applications and Management (BVICAM), New Delhi (INDIA), pp. 2873–2878.Google Scholar
  11. 11.
    Thiyagaraj, M., & Suseendran, G. (2017). Survey on heart disease prediction system based on data mining techniques. Indian Journal of Innovations and Developments, 6(1), pp. 1–9.Google Scholar
  12. 12.
    Rohini, K., & Suseendran, G. (2016). Aggregated K means clustering and decision tree algorithm for spirometry data. Indian Journal of Science and Technology, 9(44), 1–6.CrossRefGoogle Scholar
  13. 13.
    Thiyagaraj, M., & Suseendran, G. (2018). An efficient heart disease prediction system using modified firefly algorithm based radial basis function with support vector machine. International Journal of Engineering & Technology, 7(2), 1040–1045.Google Scholar
  14. 14.
    Senthil, D., & Suseendran, G. (2017). Data mining techniques using time series analysis. In Proceedings of the 11th INDIACom; INDIACom–2017; IEEE Conference ID: 40353 2017 4th International Conference on “Computing for Sustainable Global Development”, 01st–03rd March, 2017 Bharati Vidyapeeth’s Institute of Computer Applications and Management (BVICAM), New Delhi (INDIA), pp. 2864–2872.Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2020

Authors and Affiliations

  1. 1.Department of Computer ScienceVels Institute of Science, Technology & Advanced Studies (VISTAS)ChennaiIndia
  2. 2.Department of Information and Technology, School of Computing SciencesVels Institute of Science, Technology & Advanced Studies (VISTAS)ChennaiIndia

Personalised recommendations