Skip to main content

Modeling and Forecasting Tuberculosis Cases Using Machine Learning and Deep Learning Approaches: A Comparative Study

  • Conference paper
  • First Online:
Data Management, Analytics and Innovation (ICDMAI 2022)

Abstract

Due to the prevalence of tuberculosis, it has become a source of concern in the world. From day to day, new cases are still discovered. To help governments and health policymakers make appropriate decisions and impose restrictions to reduce the prevalence of tuberculosis, efficient forecasting methods are required to be applied. Machine learning and deep learning techniques are commonly used due to their abilities in producing accurate results. In this paper, seven different models are applied to accomplish this objective, including SARIMAX, LSTM, CNN-LSTM Hybrid, MLP network, SVR, XGboost, and RF Regression models. These models are applied to forecast pulmonary positive, negative, and TB incidence cases. The models with the lowest errors are then chosen and used to forecast the number of pulmonary positive, negative, and TB incidence cases. The results of the experiments showed that the CNN-LSTM Hybrid and MLP networks achieved the lowest forecasting errors compared to the other models and were chosen for forecasting pulmonary negative, positive, and TB incidence cases from 2020 to 2029. The forecasting results revealed that there would be 117.861557 new pulmonary negative incidences, 153.029385 new pulmonary positive incidences, and 414.4704 new tuberculosis incidence cases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. R. Miggiano, M. Rizzi, D.M. Ferraris, Mycobacterium tuberculosis pathogenesis, infection prevention and treatment. Pathogens 9(5) (2020)

    Google Scholar 

  2. I.D. Page, R. Byanyima, S. Hosmane, N. Onyachi, C. Opira, M. Richardson, R. Sawyer, A. Sharman, D.W. Denning, Chronic pulmonary aspergillosis commonly complicates treated pulmonary tuberculosis with residual cavitation. Eur. Respir. J. 53(3) (2019)

    Google Scholar 

  3. W.H.O. (WHO), Tuberculosis. http://www.who.int/news-room/fact-sheets/detail/tuberculosis. Last accessed 30 May 2021

  4. H. Mohajan, Tuberculosis is a Fatal Disease Among Some Developing Countries of the World (2014)

    Google Scholar 

  5. W.H.O. (WHO), Tuberculosis. http://www.who.int/news-room/fact-sheets/detail/tuberculosis. Last accessed 25 June 2021

  6. W.H.O.(WHO), Tuberculosis. https://www.who.int/countries/yem/tuberculosis. Last accessed 20 Apr 2021

  7. D.R. Silva, M. Muñoz-Torrico, R. Duarte, T. Galvão, E.H. Bonini, F.F. Arbex, M.A. Arbex, V.M. Augusto, M.F. Rabahi, F.C.D.Q. Mello, Risk factors for tuberculosis: diabetes, smoking, alcohol use, and the use of other drugs. J. Brasil. Pneumol. 44,145–152 (2018)

    Google Scholar 

  8. J. Brownlee, Introduction to time series forecasting with python: how to prepare data and develop models to predict the future, in Machine Learning Mastery (2017)

    Google Scholar 

  9. N. Talkhi, N.A. Fatemi, Z. Ataei, M.J. Nooghabi, Modeling and forecasting number of confirmed and death caused covid-19 in Iran: a comparison of time series forecasting methods. Biomed. Sig. Process. Control 66, 102494 (2021)

    Google Scholar 

  10. Q. Liu, Z. Li, Y. Ji, L. Martinez, U. H. Zia, A. Javaid, W. Lu, J. Wang, Forecasting the seasonality and trend of pulmonary tuberculosis in Jiangsu province of China using advanced statistical time-series analyses. Infect. Drug Resist. 12, 2311 (2019)

    Google Scholar 

  11. H. Wang, C. Tian, W. Wang, X. Luo, Time-series analysis of tuberculosis from 2005 to 2017 in China. Epidemiol. Infect.146(8), 935–939 (2018)

    Google Scholar 

  12. Z.-Q. Li, H.-Q. Pan, Q. Liu, H. Song, J.-M. Wang, Comparing the performance of time series models with or without meteorological factors in predicting incident pulmonary tuberculosis in Eastern China. Infect. Dis. Poverty 9(1), 1–11 (2020)

    Google Scholar 

  13. S. Ade, W. Békou, M. Adjobimey, O. Adjibode, G. Ade, A.D. Harries, S. Angoon, Tuberculosis case finding in Benin, 2000–2014 and beyond: a retrospective cohort and time-series study, in Tuberculosis Research, and Treatment (2016)

    Google Scholar 

  14. V. Vapnik, The Nature of Statistical Learning Theory, 2nd edn. (Springer Science & Business Media, USA, 2013)

    Google Scholar 

  15. H. Wu, Y. Cai, Y. Wu, R. Zhong, Q. Li, J. Zheng, D. Lin, Y. Li, Time series analysis of weekly influenza-like illness rate using a one-year period of factors in random forest regression. Biosci. Trends (2017)

    Google Scholar 

  16. J. Brownlee, XGBoost with python: gradient boosted trees with XGBoost and scikit-learn, in Machine Learning Mastery (2016)

    Google Scholar 

  17. A. Zeroual, F. Harrou, A. Dairi, Y. Sun, Deep learning methods for forecasting covid-19 time-series data: a comparative study. Chaos Solitons Fractals 140, 110121 (2020)

    Google Scholar 

  18. W. Lu, J. Li, Y. Li, A. Sun, J. Wang, A CNN-LSTM-based model to forecast stock prices, in Complexity (2020)

    Google Scholar 

  19. M.W. Gardner, S. Dorling.: Artificial neural networks (the multilayer perceptron)—a review of applications in the atmospheric sciences. Atmosp. Environ. 32(14–15), 2627–2636 (1998)

    Google Scholar 

  20. B. Abdualgalil, S. Abraham, Applications of machine learning algorithms and performance comparison: a review, in 2020 International Conference on Emerging Trends in Information Technology and Engineering (ic-ETITE) (IEEE, Vellore, India, 2020), pp. 1–6

    Google Scholar 

  21. U. Khair, H. Fahmi, S. Al-Hakim, R. Rahim, Forecasting error calculation with mean absolute deviation and mean absolute percentage error. J. Phys. Conf. Ser. 930(1), 012002 (2017). IOP Publishing, Medan, Sumatera Utara, Indonesia

    Google Scholar 

  22. T. Chai, R.R. Draxler, Root mean square error (RMSE) or mean absolute error (MAE)?—arguments against avoiding rmse in the literature. Geosci. Model Dev. 7(3), 1247–1250 (2014)

    Google Scholar 

Download references

Acknowledgements

Our thanks to the tuberculosis program of the General Administration of Epidemiological Surveillance (GAES), Sana’a, YEMEN for giving us the dataset.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bilal Abdualgalil .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Abdualgalil, B., Abraham, S., Ismael, W.M., George, D. (2023). Modeling and Forecasting Tuberculosis Cases Using Machine Learning and Deep Learning Approaches: A Comparative Study. In: Goswami, S., Barara, I.S., Goje, A., Mohan, C., Bruckstein, A.M. (eds) Data Management, Analytics and Innovation. ICDMAI 2022. Lecture Notes on Data Engineering and Communications Technologies, vol 137. Springer, Singapore. https://doi.org/10.1007/978-981-19-2600-6_11

Download citation

Publish with us

Policies and ethics