A Comparison of Machine Learning Methods Applicable to Healthcare Claims Fraud Detection

  • Nnaemeka Obodoekwe
  • Dustin Terence van der HaarEmail author
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 918)


The healthcare industry has become a very important pillar in the modern society but has witnessed an increase in fraudulent activities. Traditional fraud detection methods have been used to detect potential fraud, but for certain cases they have been insufficient and time consuming. Data mining which has emerged as a very important process in knowledge discovery has been successfully applied in the health insurance claims fraud detection. We implemented a prototype that comprised different methods and a comparison of each of the methods was carried out to determine which method is most suited for the Medicare dataset. We found that while ensemble methods and neural net performed, the logistic regression and the naive bayes model did not perform well as depicted in the result.


Healthcare Fraud detection Machine learning 


  1. 1.
    Yu, H.: Impacts of rising health care costs on families with employment-based private insurance: a national analysis with state fixed effects. Health Serv. Res. 47(5), 2012–2030 (2012)CrossRefGoogle Scholar
  2. 2.
    Singh, A.: Fraud in insurance on rise. Technical report, Ernst & Young (2011)Google Scholar
  3. 3.
    Davis, L.E.: Growing health care fraud drastically affects all of us, October 2017Google Scholar
  4. 4.
    Rabiul, J., Nabeel, M., Ahsan, H., Sifat, M.: An evaluation of data processing solutions considering preprocessing and “special” features. In: 11th International Conference on Signal-Image Technology & Internet-Based Systems (2015)Google Scholar
  5. 5.
    McLeod, S.: Maslow’s hierarchy of needs. Simply Psychol. 1 (2007)Google Scholar
  6. 6.
    Bush, J., Sandridge, L., Treadway, C., Vance, K., Coustasse, A.: Medicare fraud, waste and abuse. In: Business and Health Administration Association Annual Conference (2017)Google Scholar
  7. 7.
    Thornton, D., van Capelleveen, G., Poel, M., van Hillegersberg, J., Mueller, R.M.: Outlier-based health insurance fraud detection for U.S. medicaid data. In: 16th International Conference on Enterprise Information Systems (2014)Google Scholar
  8. 8.
    Branting, L.K., Reeder, F., Gold, J., Champney, T.: Graph analytics for healthcare fraud risk estimation. In: Advances in Social Networks Analysis and Mining (ASONAM) (2016)Google Scholar
  9. 9.
    Bauder, R.A., Khoshgoftaar, T.M.: A probabilistic programming approach for outlier detection in healthcare claims. 15th IEEE International Conference on Machine Learning and Applications (ICMLA) (2016)Google Scholar
  10. 10.
    Bauder, R.A., Khoshgoftaar, T.M.: Medicare fraud detection using machine learning methods. In: 16th IEEE International Conference on Machine Learning and Applications (2017)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  • Nnaemeka Obodoekwe
    • 1
  • Dustin Terence van der Haar
    • 1
    Email author
  1. 1.Academy of Computer Science and Software EngineeringUniversity of JohannesburgJohannesburgSouth Africa

Personalised recommendations