Skip to main content

Responsible Artificial Intelligence in Healthcare: Predicting and Preventing Insurance Claim Denials for Economic and Social Wellbeing


It is estimated that one out of seven health insurance claims is rejected in the US; hospitals across the country lose approximately $262 billion annually due to denied claims. This widespread problem causes huge cash-flow issues and overburdens patients. Thus, preventing claim denials before claims are submitted to insurers improves profitability, accelerates the revenue cycle, and supports patients’ wellbeing. This study utilizes Design Science Research (DSR) paradigm and develops a Responsible Artificial Intelligence (RAI) solution helping hospital administrators identify potentially denied claims. Guided by five principles, this framework utilizes six AI algorithms – classified as white-box and glass-box – and employs cross-validation to tune hyperparameters and determine the best model. The results show that a white-box algorithm (AdaBoost) model yields an AUC rate of 0.83, outperforming all other models. This research’s primary implications are to (1) help providers reduce operational costs and increase the efficiency of insurance claim processes (2) help patients focus on their recovery instead of dealing with appealing claims.

This is a preview of subscription content, access via your institution.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7


  1. Abbasi, A., Albrecht, C., Vance, A., & Hansen, J. (2012). Metafraud: A meta-learning framework for detecting financial fraud. MIS Quarterly: Management Information Systems, 36(4), 1293.

    Article  Google Scholar 

  2. Altmann, A., Toloşi, L., Sander, O., & Lengauer, T. (2010). Permutation importance: A corrected feature importance measure. Bioinformatics, 26(10), 1340–1347.

    Article  Google Scholar 

  3. Amershi, S., Begel, A., Bird, C., DeLine, R., Gall, H., Kamar, E., et al. (2019). Software engineering for machine learning: A case study. In IEEE/ACM 41st International Conference on Software Engineering: Software Engineering in Practice (pp. 291–300). Institute of Electrical and Electronics Engineers Inc.

  4. Bengio, Y., & Grandvalet, Y. (2004). No unbiased estimator of the variance of K-fold cross-validation. Journal of Machine Learning Research, 5(1), 1089–1105.

    Google Scholar 

  5. Bounthavong, M., Watanabe, J. H., & Sullivan, K. M. (2015). Approach to addressing missing data for electronic medical records and pharmacy claims data research. Pharmacotherapy, 35(4), 380–387.

    Article  Google Scholar 

  6. Cam, A., Chui, M., & Hall, B. (2018). Global AI survey: AI proves its worth, but few scale impact. McKinsey. Accessed 30 September 2020.

  7. Chawla, N. V., Bowyer, K. W., Hall, L. O., & Kegelmeyer, W. P. (2011). SMOTE: Synthetic minority over-sampling technique. Journal of Artificial Intelligence Research, 16, 321–357.

    Article  Google Scholar 

  8. Chollet, F. (2017). Deep learning with Python (1st ed.). Manning Publications.

  9. Cyrus, H. (2020). Leveraging machine learning to identify quality issues in the Medicaid claim adjudication process.

  10. Delmolino, D., & Whitehouse, M. (2018). Responsible AI: A framework for building Trust in Your AI solutions.

  11. Diakopoulos, N. (2016). Accountability in algorithmic decision making. Communications of the ACM, 59(2), 56–62.

    Article  Google Scholar 

  12. Doshi-Velez, F., & Kim, B. (2017). Towards A Rigorous Science of Interpretable Machine Learning. Accessed 27 November 2020.

  13. Faraj, S., Pachidi, S., & Sayegh, K. (2018). Working and organizing in the age of the learning algorithm. Information and Organization, 28(1), 62–70.

  14. Friedman, J. H. (2001). Greedy function approximation: A gradient boosting machine. Annals of Statistics, 29(5), 1189–1232.

    Article  Google Scholar 

  15. Gee, E., & Spiro, T. (2019). Excess administrative costs burden the U.S. health care system - Center for American Progress. Center for American Progress. Accessed 27 August 2020.

  16. Gilpin, L. H., Bau, D., Yuan, B. Z., Bajwa, A., Specter, M., & Kagal, L. (2018). Explaining explanations: An approach to evaluating interpretability of machine learning.

  17. Gregor, S., & Hevner, A. (2013). Positioning and presenting design science research for maximum impact. MIS Quarterly, 37(2), 337–355.

    Article  Google Scholar 

  18. Gregor, S., Chandra Kruse, L., Seidel, S., & Kruse, C. (2020). The anatomy of a design principle. Article in Journal of the Association for Information Systems, 21(6), 1622–1652.

    Article  Google Scholar 

  19. Hall, P., Gill, N., & Schmidt, N. (2019). Proposed Guidelines for the Responsible Use of Explainable Machine Learning. Accessed 27 November 2020.

  20. Hansen, K. (2020). The virtue of simplicity: On machine learning models in algorithmic trading. Big Data & Society.

  21. Heaton, J. (2008). Introduction to neural networks for Java (2nd ed.). Heaton Research, Inc.

  22. Hevner, A. R., March, S. T., Park, J., & Ram, S. (2004). Design science in information systems research. MIS Quarterly: Management Information Systems, 28(1), 75–105.

    Article  Google Scholar 

  23. Hopp, W. J., Li, J., & Wang, G. (2018). Big Data and the Precision Medicine Revolution. Production and Operations Management, 27(9), 1647–1664.

  24. James, G., Witten, D., Hastie, T., & Tibshirani, R. (2013). An introduction to statistical learning (Vol. 103). New York: Springer New York.

    Book  Google Scholar 

  25. Jimenez, R. (2013). Proper coding can help prove medical necessity. Smart Billing Solutions: Medical Billing. Accessed 1 September 2020.

  26. Jobin, A., Lenca, M., & Vayena, E. (2019). The global landscape of AI ethics guidelines. Nature Machine Intelligence.

  27. Johnson, M., Albizri, A., & Simsek, S. (2020). Artificial intelligence in healthcare operations to enhance treatment outcomes: A framework to predict lung cancer prognosis. Annals of Operations Research, 1–31.

  28. Johnson, M. E., & Nagarur, N. (2016). Multi-stage methodology to detect health insurance claim fraud. Health Care Management Science, 19(3), 249–260.

    Article  Google Scholar 

  29. Jones Sanborn, B. (2017). Change healthcare analysis shows $262 billion in medical claims initially denied, meaning billions in administrative costs | healthcare finance news. Healthcare Finance. Accessed 28 August 2020.

  30. Khurjekar, N. M. (2017). An integrated three stage predictive framework for health insurance claim denials. Binghamton University.

  31. Kim, B.-H., Sridharan, S., Atwal, A., & Ganapathi, V. (2020). Deep Claim: Payer Response Prediction from Claims Data with Deep Learning. arXiv. Accessed 18 February 2021.

  32. Kohavi, R. (1995). A study of cross-validation and bootstrap for accuracy estimation and model selection. In Appears in the International Joint Conference on Artificial Intelligence (IJCAI) (pp. 1–7).

  33. Kovach, J. V., & Borikar, S. (2018). Enhancing financial performance: An application of lean six sigma to reduce insurance claim denials. Quality Management in Health Care, 27(3), 165–171.

    Article  Google Scholar 

  34. Kuhn, M., & Johnson, K. (2018). Applied predictive modeling (2nd ed.). Springer.

  35. Little, R. J. A., & Rubin, D. B. (2002). Statistical analysis with missing data. Statistical analysis with missing data. Hoboken: John Wiley & Sons, Inc..

    Book  Google Scholar 

  36. Liu, S., & Vicente, L. (2020). Accuracy and fairness trade-offs in machine learning: A stochastic multi-objective approach.

  37. Lodder, P. (2013). To impute or not impute: That’s the question.

  38. Mease, D., & Wyner, A. (2008). Evidence contrary to the statistical view of boosting. Journal of Machine Learning Research, 9, 131–156.

    Google Scholar 

  39. Olson, D. L., & Delen, D. (2008). Advanced data mining techniques. Springer Publishing Company, Incorporated.

  40. Papanicolas, I., Woskie, L. R., & Jha, A. K. (2018). Health care spending in the United States and other high-income countries. JAMA - Journal of the American Medical Association. American Medical Association.

  41. Peffers, K., Tuunanen, T., Rothenberger, M. A., & Chatterjee, S. (2007). A design science research methodology for information systems research. Journal of Management Information Systems, 24(3), 45–77.

  42. Pohlig, C. (2009, August). Investigate Claim Denials. The Hospitalist. Accessed 1 September 2020.

  43. Polit, D. F. . (2012). Nursing research : Generating and assessing evidence for nursing practice /. Wolters Kluwer Health/Lippincott Williams & Wilkins.

  44. Powers, D. M. W. (2011). Evaluation: From precision, recall, and F-measure to ROC, Informedness, Markedness & Correlation. Journal of Machine Learning Technologies, 2(1), 37–63. Accessed 24 August 2020.

  45. Rai, A. (2020). Explainable AI: From black box to glass box. Journal of the Academy of Marketing Science. Springer.

  46. Ribeiro, M. T., Singh, S., & Guestrin, C. (2016). Model-Agnostic Interpretability of Machine Learning. Accessed 27 Nov 2020.

  47. Saripalli, P., Tirumala, V., & Chimmad, A. (2017). Assessment of healthcare claims rejection risk using machine learning. In 2017 IEEE 19th International Conference on e-Health Networking, Applications and Services, Healthcom 2017 (Vol. 2017-December, pp. 1–6).

  48. Simsek, S., Albizri, A., Johnson, M., Custis, T., & Weikert, S. (2020). Predictive data analytics for contract renewals: A decision support tool for managerial decision-making. Journal of Enterprise Information Management.

  49. Tikkanen, R., & Abrams, M. (2020). U.S. Health Care from a Global Perspective, 2019 | Commonwealth Fund. The Commonwealth Fund. Accessed 27 August 2020.

  50. Wang, Y., Xiong, M., & Olya, H. (2020). Toward an understanding of responsible artificial intelligence practices. In Proceedings of the 53rd Hawaii International Conference on System Sciences. Hawaii International Conference on System Sciences.

  51. Wearn, O., Freeman, R., & Jacoby, D. (2019). Responsible AI for conservation. Nature Machine Intelligence, 1, 72–73.

    Article  Google Scholar 

  52. Wojtusiak, J., Ngufor, C., Shiver, J., & Ewald, R. (2011). Rule-based prediction of medical claims’ payments: A method and initial application to medicaid data. In Proceedings - 10th International Conference on Machine Learning and Applications, ICMLA 2011 (Vol. 2, pp. 162–167).

  53. Yong, P. L., Saunders, R. S., & Olsen, L. (2010). Excess administrative costs. Institute of Medicine (US) roundtable on evidence-based medicine. National Academies Press (US). Accessed 1 September 2020.

  54. Zhang, Y., & Yang, Y. (2015). Cross-validation for selecting a model selection procedure. Journal of Econometrics, 187(1), 95–112.

    Article  Google Scholar 

Download references

Author information



Corresponding author

Correspondence to Antoine Harfouche.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Johnson, M., Albizri, A. & Harfouche, A. Responsible Artificial Intelligence in Healthcare: Predicting and Preventing Insurance Claim Denials for Economic and Social Wellbeing. Inf Syst Front (2021).

Download citation


  • Artificial intelligence
  • Analytics
  • Insurance claim denials
  • Design science
  • Responsible AI
  • White-box models