An Efficient Prediction Model for Diabetic Database Using Soft Computing Techniques
Organizations aim at harnessing predictive insights, using the vast real-time data stores that they have accumulated through the years, using data mining techniques. Health sector, has an extremely large source of digital data - patient-health related data-store, which can be effectively used for predictive analytics. This data, may consists of missing, incorrect and sometimes incomplete values sets that can have a detrimental effect on the decisions that are outcomes of data analytics. Using the PIMA Indians Diabetes dataset, we have proposed an efficient imputation method using a hybrid combination of CART and Genetic Algorithm, as a preprocessing step. The classical neural network model is used for prediction, on the preprocessed dataset. The accuracy achieved by the proposed model far exceeds the existing models, mainly because of the soft computing preprocessing adopted. This approach is simple, easy to understand and implement and practical in its approach.
Unable to display preview. Download preview PDF.
- 1.Mitra, S., Acharya, T.: Data Mining, Multimedia, Soft-computing and Bioinformatics. Wiley Interscience, Hoboken (2004)Google Scholar
- 3.Zhang, S., Qin, Z., Ling, C.X., Sheng, S.: Missing is Useful: Missing Values in Cost Sensitive Decision Trees. IEEE Transactions on Knowledge and Data Engineering 17(12) (2005)Google Scholar
- 4.Rady, E.A., Abd, El-Monsef, M.M.E., Abd, El-Latif, W.A.: A Modified Rough Set Approach to Incomplete Information Systems. Journal of Applied Mathematics and Decision Sciences 2007, article ID 58248 (2007)Google Scholar
- 5.Satya Kumar, D.V.R., Sriram, K., Rao, K.M., Murty, U.S.: Management of Filariasis Using Prediction Rules Derived from Data Mining. In: Bioinformation by Biomedical Informatics Publishing Group (2005)Google Scholar
- 6.Palaniappan, S., Awang, R.: Intelligent Heart Disease Prediction System using Data Mining Techniques. International Journal of Computer Science and Network Security 8(8) (2008)Google Scholar
- 7.Liu, P., Lei, L.: A Review of Missing Data Treatment Methods. Intelligent Information Management Systems and Technologies 1(3), 412–419 (2005)Google Scholar
- 8.Mehala, B., Ranjit Jeba Thangaiah, P., Vivekanandan, K.: Selecting Scalable Algorithms to Deal with Missing Values. International Journal of Recent Trends in Engineering 1(2) (2009)Google Scholar
- 9.Adbdella, M., Marwala, T.: Treatment of Missing Data Using Neural Networks and Genetic Algorithms. In: International Joint Conference on Neural Networks, Canada (2005)Google Scholar
- 10.Magnani, M.: Techniques for Dealing with Missing Data in Knowledge Discovery Tasks, Department of Computer Science, University of Bologna (2004)Google Scholar
- 11.Acuna, E., Rodriguez, C.: The Treatment of Missing Values and its Effect in the Classifier Accuracy. In: Multiscale Methods in Science and Engineering, pp. 639–647. Springer, Heidelberg (2004)Google Scholar