Can Automated Smoothing Significantly Improve Benchmark Time Series Classification Algorithms?

Large, James; Southam, Paul; Bagnall, Anthony

doi:10.1007/978-3-030-29859-3_5

James Large¹³,
Paul Southam¹³ &
Anthony Bagnall¹³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11734))

Included in the following conference series:

International Conference on Hybrid Artificial Intelligence Systems

1375 Accesses
2 Citations

Abstract

tl;dr: no, it cannot, at least not on average on the standard archive problems. We assess whether using six smoothing algorithms (moving average, exponential smoothing, Gaussian filter, Savitzky-Golay filter, Fourier approximation and a recursive median sieve) could be automatically applied to time series classification problems as a preprocessing step to improve the performance of three benchmark classifiers (1-Nearest Neighbour with Euclidean and Dynamic Time Warping distances, and Rotation Forest). We found no significant improvement over unsmoothed data even when we set the smoothing parameter through cross validation. We are not claiming smoothing has no worth. It has an important role in exploratory analysis and helps with specific classification problems where domain knowledge can be exploited. What we observe is that the automatic application does not help to improve classification performance and that we cannot explain the improvement of other time series classification algorithms over the baseline classifiers simply as a function of the absence of smoothing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Chen, Y., Hu, B., Keogh, E.: Time series classification under more realistic assumption. In Proceedings 13th SIAM International Conference on Data Mining (2013)
Google Scholar
Bagnall, A., Lines, J., Bostrom, A., Large, J., Keogh, E.: The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances. Data Min. Knowl. Disc. 31(3), 606–660 (2017)
Article MathSciNet Google Scholar
Bangham, J.: Data-sieving hydrophobicity plots. Anal. Biochem. 174(1), 142–145 (1988)
Article Google Scholar
Bangham, J., Harvey, R., Ling, P., Aldridge, R.: Morphological scale-space preserving transforms in many dimensions. J. Electron. Imaging 5, 283–299 (1996)
Article Google Scholar
Bangham, J., Ling, P., Harvey, R.: Scale-space from nonlinear filters. IEEE Trans. Pattern Anal. Mach. Intell. 18(5), 520–528 (1996)
Article Google Scholar
Benavoli, A., Corani, G., Mangili, F.: Should we really use post-hoc tests based on mean-ranks? J. Mach. Learn. Res. 17, 1–10 (2016)
MathSciNet MATH Google Scholar
Betta, G., Capriglione, D., Cerro, G., Ferrigno, L., Miele, G.: The effectiveness of Savitzky-Golay smoothing method for spectrum sensing in cognitive radios. In: Proceedings of the 2015 18th AISEM Annual Conference, pp. 1–4 (2015)
Google Scholar
Chen, Y., et al.: The UEA-UCR time series classification archive (2015). http://www.cs.ucr.edu/~eamonn/time_series_data/
Chou, Y.: Statistical Analysis. Holt International (1975)
Google Scholar
Cooley, J., Lewis, P., Welch, P.: The fast fourier transform and its applications. IEEE Trans. Educ. 12(1), 27–34 (1969)
Article Google Scholar
Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)
MathSciNet MATH Google Scholar
Fernandes, B., Colletta, G., Ferreira, L., Dutra, O.: Utilization of Savitzky-Golay filter for power line interference cancellation in an embedded electrocardiographic monitoring platform. In: Proceedings IEEE International Symposium on Medical Measurements and Applications, pp. 7–12 (2017)
Google Scholar
García, S., Herrera, F.: An extension on “statistical comparisons of classifiers over multiple data sets” for all pairwise comparisons. J. Mach. Learn. Res. 9, 2677–2694 (2008)
MATH Google Scholar
Lines, J., Taylor, S., Bagnall, A.: HIVE-COTE: the hierarchical vote collective of transformation-based ensembles for time series classification. In Proceedings IEEE International Conference on Data Mining (2016)
Google Scholar
Ratanamahatana, C., Keogh, E.: Three myths about dynamic time warping data mining. In: Proceedings 5th SIAM International Conference on Data Mining (2005)
Google Scholar
Rodriguez, J., Kuncheva, L., Alonso, C.: Rotation forest: a new classifier ensemble method. IEEE Trans. Pattern Anal. Mach. Intell. 28(10), 1619–1630 (2006)
Article Google Scholar
Savitzky, A., Golay, M.: Smoothing and differentiation of data by simplified least squares procedures. Anal. Chem. 36(8), 1627–1639 (1964)
Article Google Scholar
Schafer, R.: What is a Savitzky-Golay filter? IEEE Signal Process. Mag. 28(4), 111–117 (2011)
Article Google Scholar
Tan, C., Herrman, C., Forestier, G., Webb, G., Petitjean, F.: Efficient search of the best warping window for dynamic time warping. In: Proceedings 18th SIAM International Conference on Data Mining (2018)
Chapter Google Scholar

Download references

Acknowledgement

This work is supported by the UK Engineering and Physical Sciences Research Council (EPSRC) [grant number EP/M015807/1] and Biotechnology and Biological Sciences Research Council [grant number BB/M011216/1]. The experiments were carried out on the High Performance Computing Cluster supported by the Research and Specialist Computing Support service at the University of East Anglia and using a Titan X Pascal donated by the NVIDIA Corporation.

Author information

Authors and Affiliations

University of East Anglia, Norwich Research Park, Norwich, UK
James Large, Paul Southam & Anthony Bagnall

Authors

James Large
View author publications
You can also search for this author in PubMed Google Scholar
Paul Southam
View author publications
You can also search for this author in PubMed Google Scholar
Anthony Bagnall
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Anthony Bagnall .

Editor information

Editors and Affiliations

University of León, León, Spain
Hilde Pérez García
University of León, León, Spain
Lidia Sánchez González
University of León, León, Spain
Manuel Castejón Limas
University of A Coruña, Ferrol, Spain
Héctor Quintián Pardo
University of Salamanca, Salamanca, Spain
Emilio Corchado Rodríguez

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Large, J., Southam, P., Bagnall, A. (2019). Can Automated Smoothing Significantly Improve Benchmark Time Series Classification Algorithms?. In: Pérez García, H., Sánchez González, L., Castejón Limas, M., Quintián Pardo, H., Corchado Rodríguez, E. (eds) Hybrid Artificial Intelligent Systems. HAIS 2019. Lecture Notes in Computer Science(), vol 11734. Springer, Cham. https://doi.org/10.1007/978-3-030-29859-3_5

Download citation

DOI: https://doi.org/10.1007/978-3-030-29859-3_5
Published: 26 August 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-29858-6
Online ISBN: 978-3-030-29859-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics