Binary Shapelet Transform for Multiclass Time Series Classification
Shapelets have recently been proposed as a new primitive for time series classification. Shapelets are subseries of series that best split the data into its classes. In the original research, shapelets were found recursively within a decision tree through enumeration of the search space. Subsequent research indicated that using shapelets as the basis for transforming datasets leads to more accurate classifiers.
Both these approaches evaluate how well a shapelet splits all the classes. However, often a shapelet is most useful in distinguishing between members of the class of the series it was drawn from against all others. To assess this conjecture, we evaluate a one vs all encoding scheme. This technique simplifies the quality assessment calculations, speeds up the execution through facilitating more frequent early abandon and increases accuracy for multi-class problems. We also propose an alternative shapelet evaluation scheme which we demonstrate significantly speeds up the full search.
- 1.Bagnall, A.: UEA time series classification website. http://www.uea.ac.uk/computing/tsc
- 4.Gordon, D., Hendler, D., Rokach, L.: Fast randomized model generation for shapelet-based time series classification (2012). arXiv preprint arXiv:1209.5038
- 5.Grabocka, J.: Learning time series shapelets. http://fs.ismll.de/publicspace/LearningShapelets/
- 6.Grabocka, J., Schilling, N., Wistuba, M., Schmidt-Thieme, L.: Invariant time-series classification. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2014)Google Scholar
- 8.Hills, J.: Mining Time-series Data using Discriminative Subsequences. Ph.D. thesis, School of Computing Sciences, University of East Anglia (2015)Google Scholar
- 10.Keogh, E., Folias, T.: The UCR time series data mining archive. http://www.cs.ucr.edu/eamonn/TSDMA/
- 12.Mueen, A., Keogh, E., Young, N.: Logical-shapelets: an expressive primitive for time series classification. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2011)Google Scholar
- 13.Quinlan, J.R.: C4. 5: Programs for Machine Learning. Morgan kaufmann, Los Altos (1993) Google Scholar
- 14.Rakthanmanon, T., Campana, B., Mueen, A., Batista, G., Westover, B., Zhu, Q., Zakaria, J., Keogh, E.: Searching and mining trillions of time series subsequences under dynamic time warping. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2012)Google Scholar
- 15.Rakthanmanon, T., Keogh, E.: Fast shapelets: a scalable algorithm for discovering time series shapelets. In: Proceedings of the 13th SIAM International Conference on Data Mining (2013)Google Scholar