Classification of Heart Disease Using Naïve Bayes and Genetic Algorithm

  • Santosh Kumar
  • G. Sahoo
Conference paper
Part of the Smart Innovation, Systems and Technologies book series (SIST, volume 32)


Data mining techniques have been widely used to mine knowledgeable information from medical data bases. In data mining Classification is a supervised learning that can be used to design models describing important data classes, where class attribute is involved in the construction of the classifier. Naïve Bayes is very simple, most popular, highly efficient and effective algorithm for pattern recognition. Medical data bases are high volume in nature. If the data set contains redundant and irrelevant attributes, classification may produce less accurate result. Heart disease is the leading cause of death in India as well as different parts of world. Hence there is a need to define a decision support system that helps clinicians to take precautionary measures. In this paper we propose a new algorithm which combines Naïve Bayes with genetic algorithm for effective classification. Experimental results shows that our algorithm enhance the accuracy in diagnosis of heart disease.


Naïve Bayes Genetic algorithm Heart disease Data mining 


  1. 1.
    Berry, M.W., et al.: Lecture Notes in Data Mining. World Scientific, Singapore (2006)CrossRefMATHGoogle Scholar
  2. 2.
    Sahoo, A.J., Kumar, Y.: Seminal quality prediction using data mining methods. Technol. Health Care (2014) Google Scholar
  3. 3.
    Kumar, Y., Sahoo, G.: Prediction of different types of liver diseases using rule based classification model. Technol. Health Care 21(5), 417–432 (2013)Google Scholar
  4. 4.
    Yadav, G., Kumar, Y., Sahoo, G.: Predication of Parkinson’s disease using data mining methods: a comparative analysis of tree, statistical, and support vector machine classifiers. Indian J. Med. Sci. 65(6), 231 (2011)CrossRefGoogle Scholar
  5. 5.
    Lewis, D.D.: Naive (Bayes) at forty: the independence assumption in information retrieval. In: Machine Learning ECML-98, pp. 4–15. Springer, Berlin (1998)Google Scholar
  6. 6.
    Han, J, Kamber, M.: Data Mining Concepts and Techniques. Morgan Kaufman Publishers, San FranciscoGoogle Scholar
  7. 7.
    Goldberg, D.E.: Genetic Algorithm in Search Optimization and Machine Learning. Addison Wesley, Boston (1989)Google Scholar
  8. 8.
    Hall, M., et al.: The WEKA data mining software: an update. ACM SIGKDD Explor. Newslett. 11(1), 10–18 (2009)CrossRefGoogle Scholar
  9. 9.
    Fabrice, G., Hamilton, H.J.: Quality Measures in Data Mining, vol. 43. Springer, Heidelberg (2007)Google Scholar
  10. 10.
  11. 11.
    Powers David, M.W.: Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. J. Mach. Learn. Technol. 2(1), 37–63 (2011)Google Scholar
  12. 12.
    Sivanandam, S.N., Deepa, S.N.: Introduction to Genetic Algorithms. Springer, Berlin (2008)MATHGoogle Scholar
  13. 13.
    Hand, D.J., Yu, K.: Idiot’s Bayes—not so stupid after all? Int. Stat. Rev. 69(3), 385–399 (2001)MATHGoogle Scholar
  14. 14.
    Fakhraei, S., et al.: Confidence in medical decision making application in temporal lobe epilepsy data mining. In: Proceedings of the Workshop on Data Mining for Medicine and Healthcare. ACM (2011)Google Scholar
  15. 15.
    Newman, D.J., Hettich, S., Blake, C.L., Merz, C.J.: UCI Repository of Machine Learning Databases. Department of Information and Computer Science, University of California, Irvine (1998)Google Scholar
  16. 16.
    Jabbar, M.A., Deekshatulu, B.L., Chandra, P.: Heart Disease Prediction System Using Associative Classification and Genetic Algorithm, pp. 183–192. Elsevier, New York (2012) Google Scholar
  17. 17.
    Gansterer, W.N., Ecker, G.F.: On the relationship between feature selection and classification accuracy. Work 4, 90–105 (2008)Google Scholar
  18. 18.
    Yan, H., Zheng, J., Jiang, Y., Peng, C., Xiao, S.: Selecting critical clinical features for heart diseases diagnosis with a real-coded genetic algorithm. Appl. Soft Comput. 8, 1105–1111 (2008)CrossRefGoogle Scholar
  19. 19.
    Dash, M., Liu, H., Motoda, H.: Consistency based feature selection. In: Proceedings of the 4th International Conference on Knowledge Discovery and Data Mining ‘ICKDDM00’, pp. 98–109 (2000)Google Scholar
  20. 20.
    Palaniappan, S., Awang, R.: Intelligent Heart Disease Prediction System Using Data Mining Techniques. IEEE (2008)Google Scholar

Copyright information

© Springer India 2015

Authors and Affiliations

  1. 1.Department of Computer Science and EngineeringBirla Institute of TechnologyMesra, RanchiIndia

Personalised recommendations