Machine Learning Algorithms for Anemia Disease Prediction

  • Manish JaiswalEmail author
  • Anima Srivastava
  • Tanveer J. Siddiqui
Conference paper
Part of the Lecture Notes in Electrical Engineering book series (LNEE, volume 524)


The remarkable advances in health industry have led to a significant production of data in everyday life. These data require processing to extract useful information, which can be useful for analysis, prediction, recommendations, and decision making. Data mining and machine learning techniques are used to transform the available data into valuable information. In medical science, disease prediction at the right time is the central problem for professionals for prevention and effective treatment plan. Sometimes, in the absence of accuracy this may lead to death. In this study, we investigate supervised machine learning algorithms—Naive Bayes, random forest, and decision tree algorithm—for prediction of anemia using CBC (complete blood count) data collected from pathology centers. The results show that Naive Bayes technique outperforms in terms of accuracy as compared to C4.5 and random forest.


Anemia Classification algorithms Decision making Complete blood count (CBC) 


  1. 1.
    Arun, V., et al. (2015). Privacy of health information in telemedicine on private cloud. International Journal of Family Medicine & Medical Science Research.Google Scholar
  2. 2.
    Provenzano, R., Lerma, E. V., & Szczech, L. (2018). Management of anemia. Springer.Google Scholar
  3. 3.
    Ezzati, M., Lopez, Ad, Rodgers, A., & Murray, C. J. L. (2004). Comparative quantification of health risks: Global and regional burden of disease attributable to selected major risk factors. Geneva: World Health Organization.Google Scholar
  4. 4.
    Balarajan, Y., et al. (2011). Anaemia in low-income and middle-income countries.CrossRefGoogle Scholar
  5. 5.
    Haas, J. D., Brownlie, T. (2001). Iron deficiency and reduced work capacity: A critical review of the research to determine a causal relationship. The Journal of Nutrition.Google Scholar
  6. 6.
    Kozuki, N., Lee, A. C., & Katz, J. (2012). Child health epidemiology reference group. Moderate to severe, but not mild, maternal anemia is associated with increased risk of small-for-gestational-age outcomes. The Journal of Nutrition.Google Scholar
  7. 7.
    Steer, P. J. (2000). Maternal hemoglobin concentration and birth weight. The American Journal of Clinical Nutrition.Google Scholar
  8. 8.
    Shilpa, S. A., Nagori, M., & Kshirsaga, V. (2011). Classification of anemia using data mining techniques. In Swarm, evolutionary, and memetic computing (pp. 113–121). Springer.Google Scholar
  9. 9.
    Amin, N., & Habib, A. (2015). Comparison of different classification techniques using WEKA for hematological data. American Journal of Engineering Research, 4(3), 55–61.Google Scholar
  10. 10.
    Dogan, S., & Turkoglu, I. (2008). Iron deficiency anemia detection from hematology parameters by using decision tree. International Journal of Science and Technology, 85–92.Google Scholar
  11. 11.
    Abdullah, M., & Al-Asmari, S. (2016). Anemia types prediction based on data mining classification algorithms. In Communication, management and information technology. London: Taylor & Francis Group.Google Scholar
  12. 12.
    Jerez-Aragonés, J. M., et al. (2003). A combined neural network and decision trees model for prognosis of breast cancer relapse. Artificial Intelligence in Medicine, 45–63.CrossRefGoogle Scholar
  13. 13.
    Podgorelec, V., et al. (2002). Decision trees: An overview and their use in medicine. Journal of Medical Systems, 445–463.CrossRefGoogle Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2019

Authors and Affiliations

  • Manish Jaiswal
    • 1
    Email author
  • Anima Srivastava
    • 1
  • Tanveer J. Siddiqui
    • 1
  1. 1.Department of Electronics and CommunicationUniversity of AllahabadAllahabadIndia

Personalised recommendations