Seasonality in Infection Predictions Using Interpretable Models for High Dimensional Imbalanced Datasets
- 274 Downloads
Seasonality plays a significant role in the prevalence of infectious diseases. We evaluate the performance of different approaches used to deal with seasonality in clinical prediction models, including a new proposal based on sliding windows. Class imbalance, high dimensionality and interpretable models are also considered since they are common traits of clinical datasets.
We tested these approaches with four datasets: two created synthetically and two extracted from the MIMIC-III database. Our results corroborate that clinical prediction models for infections can be improved by considering the effect of seasonality. However, the techniques employed to obtain the best results are highly dependent on the dataset.
KeywordsSeasonality Concept drift Clinical prediction models
- 1.Cánovas-Segura, B., et al.: Exploring antimicrobial resistance prediction using post-hoc interpretable methods. In: Marcos, M., Juarez, J.M., Lenz, R., Nalepa, G.J., Nowaczyk, S., Peleg, M., Stefanowski, J., Stiglic, G. (eds.) KR4HC/TEAAM -2019. LNCS (LNAI), vol. 11979, pp. 93–107. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-37446-4_8CrossRefGoogle Scholar
- 3.Dowell, S.F.: Seasonal variation in host susceptibility and cycles of certain infectious diseases. Emerg. Infect. Dis. 7(3), 369–374 (2001)Google Scholar
- 4.Friedman, J., Hastie, T., Tibshirani, R.: Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33(1) 1 (2010)Google Scholar
- 5.Godahewa, R., Yann, T., Bergmeir, C., Petitjean, F.: Seasonal averaged one-dependence estimators: a novel algorithm to address seasonal concept drift in high-dimensional stream classification. In: Proceedings of the International Joint Conference on Neural Networks (2020)Google Scholar
- 6.Hastie, T., Qian, J.: Glmnet ignette. Technical report. https://web.stanford.edu/~hastie/Papers/Glmnet_Vignette.pdf. Accessed 24 Jan 2021
- 7.Johnson, A.E., et al.: MIMIC-III, a freely accessible critical care database. Sci. Data 3(1) (2016)Google Scholar
- 8.Novoselova, N., Wang, J., Pessler, F., Klawonn, F.: Biocomb: Feature Selection and Classification with the Embedded Validation Procedures for Biomedical Data Analysis. https://cran.r-project.org/web/packages/Biocomb/index.html. Accessed 24 Jan 2021
- 9.Williams, D.J., et al.: Predicting severe pneumonia outcomes in children. Pediatrics 138(4), e20161019 (2016)Google Scholar