Abstract
Heart disease is a leading cause of death in the world. Heart disease is the number one killer in both urban and rural areas. Predicting the outcome of disease is the challenging task. Data mining can be can be used to automatically infer diagnostic rules and help specialists to make diagnosis process more reliable. Several data mining techniques are used by researchers to help health care professionals to predict the heart disease. Random forest is an ensemble and most accurate learning algorithm, suitable for medical applications. Chi square feature selection measure is used to evaluate between variables and determines whether they are correlated or not. In this paper, we propose a classification model which uses random forest and chi square to predict heart disease. We evaluate our approach on heart disease data sets. The experimental results demonstarte that our approach improve classification accuracy compared to other classification approaches, and the presented model can help health care professional for predicting heart disease.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Polat, K., Gunes, S., Tosun, S.: Diagnosis of heart disease using artificial immune recognition system and fuzzy weight preprocessing. Pattern Recognit. 39, 2186–193 (2006)
Das, R., Turkoglu, I., Sengur, A.: Effective diagnosis of heart disease through network ensembles. Expert Syst. Appl. 36, 7675–7680 (2009)
Anooj, P.K.: Clinical decision support system: risk level prediction of heart disease using weighted fuzzy rules. J. King Saud Univ. CIS, 24, 27–40 (2012)
Detrano, R., Janosi, A., Stein burn, W., et al.: International application of new probability algorithm for the diagnosis of CAD. Am. J. Cardiol. 64(5), 304–310 (1989)
Shouman, M., Turner, T., Stocker, R.: Using decision tree for diagnosing heart disease patients. In: 9th Australian Data Mining Conference, Australia, vol 121. ACM (2011)
Tu, M.C. et al.: Effective diagnosis of heart disease through bagging approach. In: Biomedical Engineering and Approach, pp. 1–4, BMEI 2009, IEEE (2009)
Andreeva: Data modeling and specific rule generation via data mining techniques. In: International Conference on Computer System and Technologies, Comsystech 2006, pp. 1–6 (2006)
Saaol times, Monthly magazine, Modifiable risk factors of heart disease, pp. 6–10, July (2015)
home.etf.rs/~vm/os/dmsw/Random%20Forest.pptx. Last Accessed 10 Aug 2015
Forman, G.: An extensive empirical study of feature selection metrics for text classification. J. Mach. Learn. Res. 3, 1289–1305 (2003)
Sonwang, P., et al.: Computer network security based on SVM approach. In: 11th International Conference on Control, Automation, and Systems
Med Calc: www.medcalc.org. Last Accessed 5 Aug 2015
UCI machine learning repository: archive.ics.uci.edu/ml. Last Accessed 15 Aug 2015
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Jabbar, M.A., Deekshatulu, B.L., Chandra, P. (2016). Prediction of Heart Disease Using Random Forest and Feature Subset Selection. In: Snášel, V., Abraham, A., Krömer, P., Pant, M., Muda, A. (eds) Innovations in Bio-Inspired Computing and Applications. Advances in Intelligent Systems and Computing, vol 424. Springer, Cham. https://doi.org/10.1007/978-3-319-28031-8_16
Download citation
DOI: https://doi.org/10.1007/978-3-319-28031-8_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-28030-1
Online ISBN: 978-3-319-28031-8
eBook Packages: EngineeringEngineering (R0)