Seasonality in Infection Predictions Using Interpretable Models for High Dimensional Imbalanced Datasets

Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 12721)


Seasonality plays a significant role in the prevalence of infectious diseases. We evaluate the performance of different approaches used to deal with seasonality in clinical prediction models, including a new proposal based on sliding windows. Class imbalance, high dimensionality and interpretable models are also considered since they are common traits of clinical datasets.

We tested these approaches with four datasets: two created synthetically and two extracted from the MIMIC-III database. Our results corroborate that clinical prediction models for infections can be improved by considering the effect of seasonality. However, the techniques employed to obtain the best results are highly dependent on the dataset.


Seasonality Concept drift Clinical prediction models 


  1. 1.
    Cánovas-Segura, B., et al.: Exploring antimicrobial resistance prediction using post-hoc interpretable methods. In: Marcos, M., Juarez, J.M., Lenz, R., Nalepa, G.J., Nowaczyk, S., Peleg, M., Stefanowski, J., Stiglic, G. (eds.) KR4HC/TEAAM -2019. LNCS (LNAI), vol. 11979, pp. 93–107. Springer, Cham (2019). Scholar
  2. 2.
    Christiansen, C., Pedersen, L., Sørensen, H., Rothman, K.: Methods to assess seasonal effects in epidemiological studies of infectious diseases-exemplified by application to the occurrence of meningococcal disease. Clin. Microbiol. Infect. 18(10), 963–969 (2012)CrossRefGoogle Scholar
  3. 3.
    Dowell, S.F.: Seasonal variation in host susceptibility and cycles of certain infectious diseases. Emerg. Infect. Dis. 7(3), 369–374 (2001)Google Scholar
  4. 4.
    Friedman, J., Hastie, T., Tibshirani, R.: Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33(1) 1 (2010)Google Scholar
  5. 5.
    Godahewa, R., Yann, T., Bergmeir, C., Petitjean, F.: Seasonal averaged one-dependence estimators: a novel algorithm to address seasonal concept drift in high-dimensional stream classification. In: Proceedings of the International Joint Conference on Neural Networks (2020)Google Scholar
  6. 6.
    Hastie, T., Qian, J.: Glmnet ignette. Technical report. Accessed 24 Jan 2021
  7. 7.
    Johnson, A.E., et al.: MIMIC-III, a freely accessible critical care database. Sci. Data 3(1) (2016)Google Scholar
  8. 8.
    Novoselova, N., Wang, J., Pessler, F., Klawonn, F.: Biocomb: Feature Selection and Classification with the Embedded Validation Procedures for Biomedical Data Analysis. Accessed 24 Jan 2021
  9. 9.
    Williams, D.J., et al.: Predicting severe pneumonia outcomes in children. Pediatrics 138(4), e20161019 (2016)Google Scholar
  10. 10.
    Yu, L., Liu, H.: Efficient feature selection via analysis of relevance and redundancy. J. Mach. Learn. Res. 5, 1205–1224 (2004)MathSciNetzbMATHGoogle Scholar

Copyright information

© Springer Nature Switzerland AG 2021

Authors and Affiliations

  1. 1.Computer Science FacultyUniversity of MurciaMurciaSpain

Personalised recommendations