A survey on the state of healthcare upcoding fraud analysis and detection

  • Richard Bauder
  • Taghi M. Khoshgoftaar
  • Naeem Seliya
Article

Abstract

From its infancy in the 1910s, healthcare group insurance continues to increase, creating a consistently rising burden on the government and taxpayers. The growing number of people enrolled in healthcare programs such as Medicare, along with the enormous volume of money in the healthcare industry, increases the appeal for and risk of fraudulent activities. One such fraud, known as upcoding, is a means by which a provider can obtain additional reimbursement by coding a certain provided service as a more expensive service than what was actually performed. With the proliferation of data mining techniques and the recent and continued availability of public healthcare data, the application of these techniques towards fraud detection, using this increasing cache of data, has the potential to greatly reduce healthcare costs through a more robust detection of upcoding fraud. Presently, there is a sizable body of healthcare fraud detection research available but upcoding fraud studies are limited. Audit data can be difficult to obtain, limiting the usefulness of supervised learning; therefore, other data mining techniques, such as unsupervised learning, must be explored using mostly unlabeled records in order to detect upcoding fraud. This paper is specific to reviewing upcoding fraud analysis and detection research providing an overview of healthcare, upcoding, and a review of the current data mining techniques used therein.

Keywords

Healthcare Healthcare coding Upcoding Fraud and abuse Medicare Data mining 

Notes

Acknowledgments

The authors would like to thank the editor and the anonymous reviewers for their insightful comments. They would also like to thank various members of the Data Mining and Machine Learning Laboratory, Florida Atlantic University, Boca Raton, for their assistance reviewing this manuscript.

Compliance with ethical standards

Conflict of interest

The authors declare that they have no conflicts of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

References

  1. Ahmad, P., Qamar, S., Rizvi, S.Q.A.: Techniques of data mining in healthcare: a review. Int. J. Comput. Appl. 120(15), 38–50 (2015)Google Scholar
  2. Aral, K.D., Güvenir, H.A., Sabuncuoğlu, İ., Akar, A.R.: A prescription fraud detection model. Comput. Methods Programs Biomed. 106(1), 37–46 (2012)CrossRefPubMedGoogle Scholar
  3. Bowblis, J.R., Brunt, C.S.: Medicare skilled nursing facility reimbursement and upcoding. Health Econ. 23(7), 821–840 (2014)CrossRefPubMedGoogle Scholar
  4. Bricker, E.: Physician upcoding: Does it happen? If so, how?. http://www.compassphs.com/blog/uncategorized/physician-upcoding-does-it-happen-if-so-how-2/ (2015)
  5. Brunt, C.S.: CPT fee differentials and visit upcoding under Medicare Part B. Health Econ. 20(7), 831–841 (2011)CrossRefPubMedGoogle Scholar
  6. Chandola, V., Sukumar, S.R., Schryver, J. C.: Knowledge discovery from massive healthcare claims data. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ser. KDD ’13, pp. 1312–1320. ACM, New York, NY (2013)Google Scholar
  7. Cox, E.: A fuzzy system for detecting anomalous behaviors in healthcare provider claims. In: Intelligent Systems for Finance and Business, pp. 111–134. John Wiley & Sons (1995)Google Scholar
  8. Dave, D.M., Dadhich, P.: Applications of data mining techniques: empowering quality healthcare services. In: IJICCT (2013)Google Scholar
  9. Furlan, Š., Bajec, M.: Holistic approach to fraud management in health insurance. J. Inf. Organ. Sci. 32(2), 99–114 (2008)Google Scholar
  10. Gera, C., Joshi, K.: A survey on data mining techniques in the medicative field. Int. J. Comput. Appl. 113(13), 32–35 (2015)Google Scholar
  11. Hsia, R.Y., Antwi, Y.A.: Variation in charges for emergency department visits across California. Ann. Emerg. Med. 64(2), 120–126.e4 (2014)CrossRefPubMedPubMedCentralGoogle Scholar
  12. Johnson, M.E., Nagarur, N.: Multi-stage methodology to detect health insurance claim fraud. Health Care Manag. Sci. (2015). doi: 10.1007/s10729-015-9317-3
  13. Joudaki, H., Rashidian, A., Minaei-Bidgoli, B., Mahmoodi, M., Geraili, B., Nasiri, M., Arab, M.: Using data mining to detect health care fraud and abuse: a review of literature. Glob. J. Health Sci. 7(1), 194 (2015)Google Scholar
  14. Jürges, H., Köberlein, J.: First do no harm. Then do not cheat: DRG upcoding in German Neonatology. CESifo Group Munich, CESifo Working Paper Series 4341 (2013)Google Scholar
  15. King, K.M.: Medicare fraud: progress made, but more action needed to address Medicare fraud, waste, and abuse. http://www.gao.gov/products/GAO-14-560T (2014)
  16. Kumar, M., Ghani, R., Mei, Z.-S.: Data mining to predict and prevent errors in health insurance claims processing. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ser. KDD ’10, pp. 65–74. ACM, New York, NY (2010)Google Scholar
  17. Liu, Q., Vasarhelyi, M.: Healthcare fraud detection: A survey and a clustering model incorporating geo-location information. In: 29th World Continuous Auditing and Reporting Symposium (29WCARS), Brisbane, Australia (2013)Google Scholar
  18. Lu, F., Boritz, J.E.: Detecting fraud in health insurance data: learning to model incomplete Benford’s law distributions. In: Machine Learning: ECML 2005: 16th European Conference on Machine Learning, Porto, Portugal, October 3–7, 2005. Proceedings, pp. 633–640. Springer, Berlin (2005)Google Scholar
  19. Luo, W., Gallagher, M.: Unsupervised DRG upcoding detection in healthcare databases. In: 2010 IEEE International Conference on Data Mining Workshops (ICDMW), pp. 600–605 (2010)Google Scholar
  20. Major, J.A., Riedinger, D.R.: EFD: a hybrid knowledge/statistical-based system for the detection of fraud. J. Risk Insur. 69(3), 309–324 (2002)CrossRefGoogle Scholar
  21. Morris, L.: Combating fraud in health care: an essential component of any cost containment strategy. http://content.healthaffairs.org/content/28/5/1351.full (2009)
  22. Munro, D.: Annual US healthcare spending hits $3.8 trillion. http://www.forbes.com/sites/danmunro/2014/02/02/annual-u-s-healthcare-spending-hits-3-8-trillion/ (2014)
  23. Ngufor, C., Wojtusiak, A.: Unsupervised labeling of data for supervised learning and its application to medical claims prediction. Comput. Sci. 14(2), 191–214 (2013)CrossRefGoogle Scholar
  24. Ornstein, C., Grochowski, R.J.: Top billing: meet the docs who charge Medicare top dollar for office visits. https://www.propublica.org/article/billing-to-the-max-docs-charge-medicare-top-rate-for-office-visits (2014)
  25. Peng, Y., Kou, G., Sabatka, A., Chen, Z., Khazanchi, D., Shi, Y.: Application of clustering methods to health insurance fraud detection. In: 2006 International Conference on Service Systems and Service Management vol. 1, pp. 116–120 (2006)Google Scholar
  26. Phua, C., Lee, V.C.S., Smith-Miles, K., Gayler, R.W.: A comprehensive survey of data mining-based fraud detection research. In: CoRR. arXiv:1009.6119 (2010)
  27. Rosenberg, M.A., Fryback, D.G., Katz, D.A.: A statistical model to detect DRG upcoding. Health Serv. Outcomes Res. Method. 1(3), 233–252 (2000)CrossRefGoogle Scholar
  28. Schönfelder, T., Klewer, J.: Methods to detect DRG-upcoding. Heilberufe 60, 6–12 (2008)CrossRefGoogle Scholar
  29. Silverman, E., Skinner, J.: Medicare upcoding and hospital ownership. J. Health Econ. 23(2), 369–389 (2004)CrossRefPubMedGoogle Scholar
  30. Steinbusch, P.J., Oostenbrink, J.B., Zuurbier, J.J., Schaepkens, F.J.: The risk of upcoding in casemix systems: a comparative study. Health Policy 81(23), 289–299 (2007)CrossRefPubMedGoogle Scholar
  31. Suresh, N., de Traversay, J., Gollamudi, H., Pathria, A., Tyler, M.: Detection of upcoding and code gaming fraud and abuse in prospective payment healthcare systems. US Patent 8,666,757 (2014)Google Scholar
  32. Sweeney, E.: Florida fraud case highlights concerns surrounding Medicare advantage upcoding. http://www.fiercehealthpayer.com/antifraud/story/florida-fraud-case-highlights-concerns-surrounding-medicare-advantage-upcod/2015-02-17 (2015)
  33. Tomar, D., Agarwal, S.: A survey on data mining approaches for healthcare. Int. J. Bio-Sci. Bio-Technol. 5(5), 241–266 (2013)CrossRefGoogle Scholar
  34. Travaille, P., Müller, R.M., Thornton, D., van Hillegersberg, J.: Electronic fraud detection in the US Medicaid healthcare program: lessons learned from other industries. In: 17th Americas Conference on Information Systems, AMCIS (2011)Google Scholar
  35. Witten, I .H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, Burlington (2005)Google Scholar
  36. Yamanishi, K., Takeuchi, J.-I., Williams, G., Milne, P.: On-line unsupervised outlier detection using finite mixtures with discounting learning algorithms. Data Min. Knowl. Disc. 8(3), 275–300 (2004)CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media New York 2016

Authors and Affiliations

  • Richard Bauder
    • 1
  • Taghi M. Khoshgoftaar
    • 1
  • Naeem Seliya
    • 2
  1. 1.College of Engineering and Computer ScienceFlorida Atlantic UniversityBoca RatonUSA
  2. 2.Health Safety Technologies, LLCMonticelloUSA

Personalised recommendations