An Efficient Prediction Model for Diabetic Database Using Soft Computing Techniques

  • Veena H. Bhat
  • Prasanth G. Rao
  • P. Deepa Shenoy
  • K. R. Venugopal
  • L. M. Patnaik
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5908)

Abstract

Organizations aim at harnessing predictive insights, using the vast real-time data stores that they have accumulated through the years, using data mining techniques. Health sector, has an extremely large source of digital data - patient-health related data-store, which can be effectively used for predictive analytics. This data, may consists of missing, incorrect and sometimes incomplete values sets that can have a detrimental effect on the decisions that are outcomes of data analytics. Using the PIMA Indians Diabetes dataset, we have proposed an efficient imputation method using a hybrid combination of CART and Genetic Algorithm, as a preprocessing step. The classical neural network model is used for prediction, on the preprocessed dataset. The accuracy achieved by the proposed model far exceeds the existing models, mainly because of the soft computing preprocessing adopted. This approach is simple, easy to understand and implement and practical in its approach.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Mitra, S., Acharya, T.: Data Mining, Multimedia, Soft-computing and Bioinformatics. Wiley Interscience, Hoboken (2004)Google Scholar
  2. 2.
    Little, R.J.A., Rubin, D.B.: Statistical Analysis with Missing Data. John Wiley, New York (1987)MATHGoogle Scholar
  3. 3.
    Zhang, S., Qin, Z., Ling, C.X., Sheng, S.: Missing is Useful: Missing Values in Cost Sensitive Decision Trees. IEEE Transactions on Knowledge and Data Engineering 17(12) (2005)Google Scholar
  4. 4.
    Rady, E.A., Abd, El-Monsef, M.M.E., Abd, El-Latif, W.A.: A Modified Rough Set Approach to Incomplete Information Systems. Journal of Applied Mathematics and Decision Sciences 2007, article ID 58248 (2007)Google Scholar
  5. 5.
    Satya Kumar, D.V.R., Sriram, K., Rao, K.M., Murty, U.S.: Management of Filariasis Using Prediction Rules Derived from Data Mining. In: Bioinformation by Biomedical Informatics Publishing Group (2005)Google Scholar
  6. 6.
    Palaniappan, S., Awang, R.: Intelligent Heart Disease Prediction System using Data Mining Techniques. International Journal of Computer Science and Network Security 8(8) (2008)Google Scholar
  7. 7.
    Liu, P., Lei, L.: A Review of Missing Data Treatment Methods. Intelligent Information Management Systems and Technologies 1(3), 412–419 (2005)Google Scholar
  8. 8.
    Mehala, B., Ranjit Jeba Thangaiah, P., Vivekanandan, K.: Selecting Scalable Algorithms to Deal with Missing Values. International Journal of Recent Trends in Engineering 1(2) (2009)Google Scholar
  9. 9.
    Adbdella, M., Marwala, T.: Treatment of Missing Data Using Neural Networks and Genetic Algorithms. In: International Joint Conference on Neural Networks, Canada (2005)Google Scholar
  10. 10.
    Magnani, M.: Techniques for Dealing with Missing Data in Knowledge Discovery Tasks, Department of Computer Science, University of Bologna (2004)Google Scholar
  11. 11.
    Acuna, E., Rodriguez, C.: The Treatment of Missing Values and its Effect in the Classifier Accuracy. In: Multiscale Methods in Science and Engineering, pp. 639–647. Springer, Heidelberg (2004)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Veena H. Bhat
    • 1
  • Prasanth G. Rao
    • 1
  • P. Deepa Shenoy
    • 1
  • K. R. Venugopal
    • 1
  • L. M. Patnaik
    • 2
  1. 1.University Visvesvaraya College of EngineeringBangalore UniversityBangaloreIndia
  2. 2.Vice Chancellor, Defence Institute of Advanced TechnologyDeemed UniversityPuneIndia

Personalised recommendations