Skip to main content

Can Automated Smoothing Significantly Improve Benchmark Time Series Classification Algorithms?

  • Conference paper
  • First Online:
Hybrid Artificial Intelligent Systems (HAIS 2019)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11734))

Included in the following conference series:

Abstract

tl;dr: no, it cannot, at least not on average on the standard archive problems. We assess whether using six smoothing algorithms (moving average, exponential smoothing, Gaussian filter, Savitzky-Golay filter, Fourier approximation and a recursive median sieve) could be automatically applied to time series classification problems as a preprocessing step to improve the performance of three benchmark classifiers (1-Nearest Neighbour with Euclidean and Dynamic Time Warping distances, and Rotation Forest). We found no significant improvement over unsmoothed data even when we set the smoothing parameter through cross validation. We are not claiming smoothing has no worth. It has an important role in exploratory analysis and helps with specific classification problems where domain knowledge can be exploited. What we observe is that the automatic application does not help to improve classification performance and that we cannot explain the improvement of other time series classification algorithms over the baseline classifiers simply as a function of the absence of smoothing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.timeseriesclassification.com.

  2. 2.

    https://github.com/TonyBagnall/uea-tsc.

  3. 3.

    http://www.timeseriesclassification.com/Smoothing.php.

References

  1. Chen, Y., Hu, B., Keogh, E.: Time series classification under more realistic assumption. In Proceedings 13th SIAM International Conference on Data Mining (2013)

    Google Scholar 

  2. Bagnall, A., Lines, J., Bostrom, A., Large, J., Keogh, E.: The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances. Data Min. Knowl. Disc. 31(3), 606–660 (2017)

    Article  MathSciNet  Google Scholar 

  3. Bangham, J.: Data-sieving hydrophobicity plots. Anal. Biochem. 174(1), 142–145 (1988)

    Article  Google Scholar 

  4. Bangham, J., Harvey, R., Ling, P., Aldridge, R.: Morphological scale-space preserving transforms in many dimensions. J. Electron. Imaging 5, 283–299 (1996)

    Article  Google Scholar 

  5. Bangham, J., Ling, P., Harvey, R.: Scale-space from nonlinear filters. IEEE Trans. Pattern Anal. Mach. Intell. 18(5), 520–528 (1996)

    Article  Google Scholar 

  6. Benavoli, A., Corani, G., Mangili, F.: Should we really use post-hoc tests based on mean-ranks? J. Mach. Learn. Res. 17, 1–10 (2016)

    MathSciNet  MATH  Google Scholar 

  7. Betta, G., Capriglione, D., Cerro, G., Ferrigno, L., Miele, G.: The effectiveness of Savitzky-Golay smoothing method for spectrum sensing in cognitive radios. In: Proceedings of the 2015 18th AISEM Annual Conference, pp. 1–4 (2015)

    Google Scholar 

  8. Chen, Y., et al.: The UEA-UCR time series classification archive (2015). http://www.cs.ucr.edu/~eamonn/time_series_data/

  9. Chou, Y.: Statistical Analysis. Holt International (1975)

    Google Scholar 

  10. Cooley, J., Lewis, P., Welch, P.: The fast fourier transform and its applications. IEEE Trans. Educ. 12(1), 27–34 (1969)

    Article  Google Scholar 

  11. Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)

    MathSciNet  MATH  Google Scholar 

  12. Fernandes, B., Colletta, G., Ferreira, L., Dutra, O.: Utilization of Savitzky-Golay filter for power line interference cancellation in an embedded electrocardiographic monitoring platform. In: Proceedings IEEE International Symposium on Medical Measurements and Applications, pp. 7–12 (2017)

    Google Scholar 

  13. García, S., Herrera, F.: An extension on “statistical comparisons of classifiers over multiple data sets” for all pairwise comparisons. J. Mach. Learn. Res. 9, 2677–2694 (2008)

    MATH  Google Scholar 

  14. Lines, J., Taylor, S., Bagnall, A.: HIVE-COTE: the hierarchical vote collective of transformation-based ensembles for time series classification. In Proceedings IEEE International Conference on Data Mining (2016)

    Google Scholar 

  15. Ratanamahatana, C., Keogh, E.: Three myths about dynamic time warping data mining. In: Proceedings 5th SIAM International Conference on Data Mining (2005)

    Google Scholar 

  16. Rodriguez, J., Kuncheva, L., Alonso, C.: Rotation forest: a new classifier ensemble method. IEEE Trans. Pattern Anal. Mach. Intell. 28(10), 1619–1630 (2006)

    Article  Google Scholar 

  17. Savitzky, A., Golay, M.: Smoothing and differentiation of data by simplified least squares procedures. Anal. Chem. 36(8), 1627–1639 (1964)

    Article  Google Scholar 

  18. Schafer, R.: What is a Savitzky-Golay filter? IEEE Signal Process. Mag. 28(4), 111–117 (2011)

    Article  Google Scholar 

  19. Tan, C., Herrman, C., Forestier, G., Webb, G., Petitjean, F.: Efficient search of the best warping window for dynamic time warping. In: Proceedings 18th SIAM International Conference on Data Mining (2018)

    Chapter  Google Scholar 

Download references

Acknowledgement

This work is supported by the UK Engineering and Physical Sciences Research Council (EPSRC) [grant number EP/M015807/1] and Biotechnology and Biological Sciences Research Council [grant number BB/M011216/1]. The experiments were carried out on the High Performance Computing Cluster supported by the Research and Specialist Computing Support service at the University of East Anglia and using a Titan X Pascal donated by the NVIDIA Corporation.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anthony Bagnall .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Large, J., Southam, P., Bagnall, A. (2019). Can Automated Smoothing Significantly Improve Benchmark Time Series Classification Algorithms?. In: Pérez García, H., Sánchez González, L., Castejón Limas, M., Quintián Pardo, H., Corchado Rodríguez, E. (eds) Hybrid Artificial Intelligent Systems. HAIS 2019. Lecture Notes in Computer Science(), vol 11734. Springer, Cham. https://doi.org/10.1007/978-3-030-29859-3_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-29859-3_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-29858-6

  • Online ISBN: 978-3-030-29859-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics