Abstract
Vector borne diseases like malaria fever is one of the most elevating issues in medical domain. Accurate identification of a patient from the given set of samples and classification becomes one of the challenging task when dealing with imbalanced datasets. Many conventional machine learning and data mining algorithms are shows poor performance to classify skewed distributed data because they are trained very well with the majority class samples only. Proposing an ensemble method called majority voting defined with a set of machine learning algorithms namely decision tree—C4.5, Naive Bayesian and K-Nearest Neighbor (KNN) classifiers. Classification of samples can be done based on the majority voting of classifiers. Experiment results stating that voting ensemble method shows classification accuracy of 95.2% on imbalanced malaria disease data whereas dealing with balanced malaria disease data voting ensembler shows 92.1% of accuracy. Consequently voting shows 100% classification report on precision, Recall and F1-Score on imbalanced malaria disease data sets whereas on balanced malaria disease data voting shows 96% of Precision, Recall and F1-Score metrics.
Keywords
- Malaria disease
- Balanced data
- Imbalanced data
- Voting ensembler
This is a preview of subscription content, access via your institution.
Buying options






References
Bui TQ, Pham HM (2016) Web based GIS for spatial pattern detection: application to malaria incidence in Vietnam. Bui Pham Springer Plus 5(1014):1–14
MacLeod DA, Jones A, Di Giuseppe F, Caminade C, Morse AP (2015) Demonstration of successful malaria forecasts for Botswana using an operational seasonal climate model. Environ Res Lett 10:044005, 1–11 (IOP Publishing)
Rahman MZ, Roytman L, Kadik A, Rosy DA (2015) Environmental data analysis and remote sensing for early detection of dengue and malaria. In: Proceedings of SPIE, vol 9112, pp 1–9
WHO Malaria Report (2016) http://www.who.int/mediacentre/factsheets/fs387/en/
Pengfei J, Chunkai Z, Zhenyu H (2014) A new sampling approach for classification of imbalanced data sets with high density. In: IEEE—BigComp, pp 217–222
Ditzler G, Polikar R (2012) Incremental learning of concept drift from streaming imbalanced data. IEEE Trans Knowl Data Eng, pp 1–30
Nugroho HA, Akbar SA, Murhandarwati EEH (2015) Feature extraction and classification for detection malaria parasites in thin blood smear. In: IEEE 2nd international conference on information technology, computer and electrical engineering (ICITACEE), pp 197–201
Das DK, Maiti AK, Chakraborty C (2015) Automated system for characterization and classification of malaria-infected stages using light microscopic images of thin blood smears. J Microsc 257(3):238–252
Ruiz D, Brun C, Connor SJ, Omumbo JA, Lyon B, Thomson MC (2014) Testing a multi-malaria-model ensemble against 30 years of data in the Kenyan highlands. Malaria J 13:206, 1–14
Smith T, Ross A, Maire N, Chitnis N, Studer A, Hardy D, Brooks A, Penny M, Tanner M (2012) Ensemble modeling of the likely public health impact of pre-erythrocytic malaria vaccine. PLOS Med 9(1):1–20
Pandit P, Anand A (2016, August) Artificial neural networks for detection of malaria in RBCs. ArXiv: 1608.06627)
Bbosa F, Wesonga R, Jehopio P (2016) Clinical malaria diagnosis: rule based Classification statistical prototype. Springer Plus 5:939
Wu C, Wong PJY (2016) Multi-dimensional discrete Halanay inequalities and the global stability of the disease free equilibrium of a discrete delayed malaria model. Adv Differ Equ 2016:113
Tsai M-H, Tsai M-H, Yu S-S, Chan Y-K, Jen C-C (2015) Blood smear image based malaria parasite and infected-erythrocyte detection and segmentation. Transactional Processing Systems. J Med Syst 39:118. https://doi.org/10.1007/s10916-015-0280-9
Rahmanti FZ, Ningrum NK, Imania NK, Purnomo MH (2015, November) Plasmodium vivax classification from digitalization microscopic thick blood film using combination of second order statistical feature extraction and K-Nearest Neighbour (K-NN) classifier method. In: IEEE 4th international conference on instrumentation, communications, information technology, and biomedical engineering (ICICI-BME), Bandung, pp 2–3
Charpe KC, Bairagi V (2015) Automated malaria parasite and there stage detection in microscopic blood images. In: IEEE sponsored 9th international conference on intelligent systems and control (ISCO)
Somasekar J, Reddy BE (2015) Segmentation of erythrocytes infected with malaria parasites for the diagnosis using microscopy imaging. Comput Electr Eng, pp 336–351 (Elsevier)
Cameron E, Battle KE, Bhatt S, Weiss DJ, Bisanzio D, Mappin B, Dalrymple U, Hay SI, Smith DL, Griffin JT, Wenger EA, Eckhoff PA, Smith TA, Penny MA, Gething PW (2015) Defining the relationship between infection prevalence and clinical incidence of Plasmodium falciparum malaria. Nat Commun 6:8170, 1–10
Krawczyk B (2016) Learning from imbalanced data: open challenges and future directions. Prog Artif Intell, pp 1–12
Deng X, Zhong W, Ren J, Zeng D, Zhang H (2016) An imbalanced data classification method based on automatic clustering under-sampling. IEEE Trans, pp 1–8
Ali A, Shamsuddin SM, Ralescu AL (2013) Classification with class imbalance problem: a review. Int J Adv Soft Comput Appl 5(3):1–30
Poolsawad N, Kambhampati C, Cleland JGF (2014) Balancing class for performance of classification with a clinical dataset. In: Proceedings of the World Congress on engineering, vol 1, pp 1–6
Rahman MM, Davis DN (2013) Addressing the class imbalance problem in medical datasets. Int J Mach Learn Comput 3(2):224–228
Haixiang G, Yijing L, Shang J, Mingyun G, Yuanyue H, Bing G (2016) Learning from class-imbalanced data: review of methods and applications. Expert Syst Appl, pp 1–49
Jamal S, Periwal V, Scaria V (2013) Predictive modeling of anti-malarial molecules inhibiting apicoplast formation. BMC Bioinform 14:55, 1–8
Andrade BB, Reis-Filho A, Souza-Neto SM, Clarencio J, Carmargo LMA, Barral A, Barral-Netto M (2010) Severe Plasmodium vivax malaria exhibits marked inflammatory imbalance. Malaria J 9:13, 1–8
Dubey R, Zhou J, Wanga Y, Thompson PM, Ye J (2014) Analysis of sampling techniques for imbalanced data: An n = 648 ADNI study. Elsevier Neuro Image 87:220–241
Ng WWY, Hu J, Yeung DS, Yin S, Roli F (2015) Diversified sensitivity-based under sampling for imbalance classification problems. IEEE Trans Cybern, pp 1–11
Roumani YF, May JH, Strum DP, Vargas LG (2013) Classifying highly imbalanced ICU data. Health care Manag Sci 16:119–128
Pengfei J, Chunkai Z, Zhenyu H (2014) A new sampling approach for classification of imbalanced data sets with high density. In: IEEE transaction, pp 217–222
Garcia V, Sanchez JS, Mollineda RA (2012) On the effectiveness of preprocessing methods when dealing with different levels of class imbalance. Knowl Based Syst 25:13–21 (Elsevier)
Thongkam J, Xu G, Zhang Y, Huang F (2009) Toward breast cancer survivability prediction model through improving training space. Expert Syst Appl 36:12200–12209 (Elsevier)
Zhao X-M, Li X, Chen L, Aihara K (2007) Protein classification with imbalanced data. Wiley InterSci 70:125–1132
López V, Fernandez A, Garcia S, Palade V, Herrera F (2013) An insight into classification with imbalanced data: empirical results and current trends on using data intrinsic characteristics. Inf Sci 250:113–141 (Elsevier)
Ma L, Fan S (2017) CURE-SMOTE algorithm and hybrid algorithm for feature selection and parameter optimization based on random forests. BMC Bioinform 18:169
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Sajana, T., Narasingarao, M.R. (2019). Majority Voting Algorithm for Diagnosing of Imbalanced Malaria Disease. In: Pandian, D., Fernando, X., Baig, Z., Shi, F. (eds) Proceedings of the International Conference on ISMAC in Computational Vision and Bio-Engineering 2018 (ISMAC-CVB). ISMAC 2018. Lecture Notes in Computational Vision and Biomechanics, vol 30. Springer, Cham. https://doi.org/10.1007/978-3-030-00665-5_4
Download citation
DOI: https://doi.org/10.1007/978-3-030-00665-5_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00664-8
Online ISBN: 978-3-030-00665-5
eBook Packages: EngineeringEngineering (R0)