Recurrence and Self-attention vs the Transformer for Time-Series Classification: A Comparative Study

Katrompas, Alexander; Ntakouris, Theodoros; Metsis, Vangelis

doi:10.1007/978-3-031-09342-5_10

Alexander Katrompas¹⁰,
Theodoros Ntakouris¹¹ &
Vangelis Metsis¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13263))

Included in the following conference series:

International Conference on Artificial Intelligence in Medicine

2232 Accesses
7 Citations

Abstract

Recently the transformer has established itself as the state-of-the-art in text processing and has demonstrated impressive results in image processing, leading to the decline in the use of recurrence in neural network models. As established in the seminal paper, Attention Is All You Need, recurrence can be removed in favor of a simpler model using only self-attention. While transformers have shown themselves to be robust in a variety of text and image processing tasks, these tasks all have one thing in common; they are inherently non-temporal. Although transformers are also finding success in modeling time-series data, they also have their limitations as compared to recurrent models. We explore a class of problems involving classification and prediction from time-series data and show that recurrence combined with self-attention can meet or exceed the transformer architecture performance. This particular class of problem, temporal classification, and prediction of labels through time from time-series data is of particular importance to medical data sets which are often time-series based (Source code: https://github.com/imics-lab/recurrence-with-self-attention).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Australian Bureau of Meteorology (BOM): Australia, rain tomorrow. Australian BOM National Weather Observations
Google Scholar
Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. arXiv 1409, September 2014
Google Scholar
Banos, O., et al.: Design, implementation and validation of a novel open framework for agile development of mobile health applications. Biomed. Eng. OnLine 14(2), S6 (2015)
Article Google Scholar
Cheng, J., Dong, L., Lapata, M.: Long short-term memory-networks for machine reading. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 551–561, January 2016
Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997). https://doi.org/10.1162/neco.1997.9.8.1735
Article Google Scholar
Katrompas, A., Metsis, V.: Enhancing LSTM models with self-attention and stateful training. In: Arai, K. (ed.) IntelliSys 2021. LNNS, vol. 294, pp. 217–235. Springer, Cham (2022). https://doi.org/10.1007/978-3-030-82193-7_14
Chapter Google Scholar
Lin, Z., et al.: A structured self-attentive sentence embedding, March 2017
Google Scholar
Luong, M.T., Pham, H., Manning, C.: Effective approaches to attention-based neural machine translation, August 2015
Google Scholar
Qin, Y., Song, D., Cheng, H., Cheng, W., Jiang, G., Cottrell, G.: A dual-stage attention-based recurrent neural network for time series prediction, April 2017
Google Scholar
Rahman, L., Mohammed, N., Al Azad, A.K.: A new LSTM model by introducing biological cell state. In: 2016 3rd International Conference on Electrical Engineering and Information Communication Technology (ICEEICT), pp. 1–6 (2016)
Google Scholar
De Vito, S.: Air quality data set. https://archive.ics.uci.edu/ml/datasets/Air+quality
Shaw, P., Uszkoreit, J., Vaswani, A.: Self-attention with relative position representations, pp. 464–468, January 2018
Google Scholar
Vaswani, A., et al.: Attention is all you need. In: 31st Conference on Neural Information Processing Systems (NIPS 2017), June 2017
Google Scholar
Vavoulas, G., Chatzaki, C., Malliotakis, T., Pediaditis, M., Tsiknakis, M.: The MobiAct dataset: recognition of activities of daily living using smartphones. In: Proceedings of the International Conference on Information and Communication Technologies for Ageing Well and e-Health, pp. 143–151. SciTePress (2016)
Google Scholar
Wang, J., Yang, Y., Mao, J., Huang, Z., Huang, C., Xu, W.: CNN-RNN: a unified framework for multi-label image classification. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition, April 2016
Google Scholar
Wu, N., Green, B., Ben, X., O’Banion, S.: Deep transformer models for time series forecasting: the influenza prevalence case (2020)
Google Scholar
Zhao, H., Jia, J., Koltun, V.: Exploring self-attention for image recognition, pp. 10073–10082, June 2020
Google Scholar

Download references

Author information

Authors and Affiliations

Texas State University, San Marcos, TX, 78666, USA
Alexander Katrompas & Vangelis Metsis
University of Patras, Patras, Greece
Theodoros Ntakouris

Authors

Alexander Katrompas
View author publications
You can also search for this author in PubMed Google Scholar
Theodoros Ntakouris
View author publications
You can also search for this author in PubMed Google Scholar
Vangelis Metsis
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vangelis Metsis .

Editor information

Editors and Affiliations

University of Minnesota, Minneapolis, MN, USA
Martin Michalowski
Dalhousie University, Halifax, NS, Canada
Syed Sibte Raza Abidi
Dalhousie University, Halifax, NS, Canada
Samina Abidi

A Appendix: Detailed Classification Report Results

(See Tables 4 and 5).

Table 4. Experimental results mHealth data set. Cat: Category, Acc: Accuracy, Prec: Precision, Rec: Recall, F1: F1-score, MacAvg: Macro Average, WAvg: Weighted Avg

Full size table

Table 5. Experimental results ECG Classification data set. Cat: Category, Acc: Accuracy, Prec: Precision, Rec: Recall, F1: F1-score, MacAvg: Macro Average, W. Avg: Weighted Average, 0: Non-Ectopic, 1: Superventrical Ectopic, 2: Ventricular Ectopic, 3: Fusion, 4: Unknown

Full size table

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Katrompas, A., Ntakouris, T., Metsis, V. (2022). Recurrence and Self-attention vs the Transformer for Time-Series Classification: A Comparative Study. In: Michalowski, M., Abidi, S.S.R., Abidi, S. (eds) Artificial Intelligence in Medicine. AIME 2022. Lecture Notes in Computer Science(), vol 13263. Springer, Cham. https://doi.org/10.1007/978-3-031-09342-5_10

Download citation

DOI: https://doi.org/10.1007/978-3-031-09342-5_10
Published: 09 July 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-09341-8
Online ISBN: 978-3-031-09342-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Recurrence and Self-attention vs the Transformer for Time-Series Classification: A Comparative Study

Abstract

Access this chapter

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

A Appendix: Detailed Classification Report Results

A Appendix: Detailed Classification Report Results

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation