Skip to main content

Advertisement

Log in

Data mining and predictive analytics applications for the delivery of healthcare services: a systematic literature review

  • Big Data Analytics in Operations & Supply Chain Management
  • Published:
Annals of Operations Research Aims and scope Submit manuscript

Abstract

With the widespread use of healthcare information systems commonly known as electronic health records, there is significant scope for improving the way healthcare is delivered by resorting to the power of big data. This has made data mining and predictive analytics an important tool for healthcare decision making. The literature has reported attempts for knowledge discovery from the big data to improve the delivery of healthcare services, however, there appears no attempt for assessing and synthesizing the available information on how the big data phenomenon has contributed to better outcomes for the delivery of healthcare services. This paper aims to achieve this by systematically reviewing the existing body of knowledge to categorize and evaluate the reported studies on healthcare operations and data mining frameworks. The outcome of this study is useful as a reference for the practitioners and as a research platform for the academia.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  • Agarwal, R., & Dhar, V. (2014). Editorial-Big data, data science, and analytics: The opportunity and challenge for IS research. Information Systems Research, 25, 443–448.

    Article  Google Scholar 

  • Amarasingham, R., Patzer, R. E., Huesch, M., Nguyen, N. Q., & Xie, B. (2014). Implementing electronic health care predictive analytics: Considerations and challenges. Health Affairs, 33, 1148–1154.

    Article  Google Scholar 

  • Anderson, J. E., & Chang, D. C. (2015). Using electronic health records for surgical quality improvement in the era of big data. JAMA Surgery, 150, 24–29.

    Article  Google Scholar 

  • Bates, D. W., Saria, S., Ohno-Machado, L., Shah, A., & Escobar, G. (2014). Big data in health care: Using analytics to identify and manage high-risk and high-cost patients. Health Affairs, 33, 1123–1131.

    Article  Google Scholar 

  • Bengoa, R., Kawar, R., Key, P., Leatherman, S., Massoud, R., & Saturno, P. (2006). Quality of care: A process for making strategic choices in health systems. Geneva: World Health Organization. WHO press.

    Google Scholar 

  • Bonacina, S., Masseroli, M. & Pinciroli, F. (2005). Foreseeing promising bio-medical findings for effective applications of data mining. Biological and Medical Data Analysis, Springer.

  • Caron, F., Vanthienen, J., Vanhaecht, K., van Limbergen, E., de Weerdt, J., & Baesens, B. (2014). Monitoring care processes in the gynecologic oncology department. Computers in Biology and Medicine, 44, 88–96.

    Article  Google Scholar 

  • Carson, J. S. (2002). Model verification and validation. In Proceedings of the Winter Simulation Conference (pp. 52–58), IEEE.

  • Ceglowski, R., Churilov, L., & Wasserthiel, J. (2007). Combining data mining and discrete event simulation for a value-added view of a hospital emergency department. Journal of the Operational Research Society, 58, 246–254.

    Article  Google Scholar 

  • Chi, C.-L., Street, W. N., & Ward, M. M. (2008). Building a hospital referral expert system with a prediction and optimization-based decision support system algorithm. Journal of biomedical informatics, 41, 371–386.

    Article  Google Scholar 

  • Cornalba, C., Bellazzi, R. G., & Bellazzi, R. (2008). Building a normative decision support system for clinical and operational risk management in hemodialysis. IEEE Transactions on Information Technology in Biomedicine, 12, 678–686.

    Article  Google Scholar 

  • Delen, D., & Demirkan, H. (2013). Data, information and analytics as services. Decision Support Systems, 55, 359–363.

    Article  Google Scholar 

  • Demir, E. (2014). A decision support tool for predicting patients at risk of readmission: A comparison of classification trees, logistic regression, generalized additive models, and multivariate adaptive regression splines. Decision Sciences, 45, 849–880.

    Article  Google Scholar 

  • Dobrzykowski, D., Deilami, V. S., Hong, P., & Kim, S.-C. (2014). A structured analysis of operations and supply chain management research in healthcare (1982–2011). International Journal of Production Economics, 147, 514–530.

    Article  Google Scholar 

  • Dubey, R., Gunasekaran, A., Childe, S. J., Wamba, S. F., & Papadopoulos, T. (2016). The impact of big data on world-class sustainable manufacturing. The International Journal of Advanced Manufacturing Technology, 84, 631–645.

    Article  Google Scholar 

  • Fayyad, U., Piatetsky-Shapiro, G., & Smyth, P. (1996). From data mining to knowledge discovery in databases. AI Magazine, 17, 37.

    Google Scholar 

  • Garg, L., McClean, S., Meenan, B., & Millard, P. (2009). Non-homogeneous Markov models for sequential pattern mining of healthcare data. IMA Journal of Management Mathematics, 20, 327–344.

    Article  Google Scholar 

  • Gheorghe, M., & Petre, R. (2014). Integrating data mining techniques into telemedicine systems. Informatica Economica, 18, 120–130.

    Article  Google Scholar 

  • Glowacka, K. J., Henry, R. M., & May, J. H. (2009). A hybrid data mining/simulation approach for modelling outpatient no-shows in clinic scheduling. Journal of the Operational Research Society, 60, 1056–1068.

    Article  Google Scholar 

  • Harper, P. (2005). Combining data mining tools with health care models for improved understanding of health processes and resource utilisation. Clinical and Investigative Medicine, 28, 338.

    Google Scholar 

  • Haux, R., Ammenwerth, E., Herzog, W., & Knaup, P. (2002). Health care in the information society. A prognosis for the year 2013. International Journal of Medical Informatics, 66, 3–21.

    Article  Google Scholar 

  • Hazen, B. T., Boone, C. A., Ezell, J. D., & Jones-Farmer, L. A. (2014). Data quality for data science, predictive analytics, and big data in supply chain management: An introduction to the problem and suggestions for research and applications. International Journal of Production Economics, 154, 72–80.

    Article  Google Scholar 

  • James, B. C., & Savitz, L. A. (2011). How Intermountain trimmed health care costs through robust quality improvement efforts. Health Affairs, 30, 1185–1191.

    Article  Google Scholar 

  • Kinsman, L., Rotter, T., James, E., Snow, P., & Willis, J. (2010). What is a clinical pathway? Development of a definition to inform the debate. BMC medicine, 8, 1.

    Article  Google Scholar 

  • Koh, H. C., & Tan, G. (2011). Data mining applications in healthcare. Journal of Healthcare Information Management, 19, 65.

    Google Scholar 

  • Kontio, E., Airola, A., Pahikkala, T., Lundgren-Laine, H., Junttila, K., Korvenranta, H., et al. (2014). Predicting patient acuity from electronic patient records. Journal of Biomedical Informatics, 51, 35–40.

    Article  Google Scholar 

  • Kudyba, S., & Gregorio, T. (2010). Identifying factors that impact patient length of stay metrics for healthcare providers with advanced analytics. Health Informatics Journal, 16, 235–245.

    Article  Google Scholar 

  • Kuo, N.-W. (2011). Information technology applications for geriatric consultation services in Taiwan. International Journal of Advancements in Computing Technology, 3, 44–52.

    Article  Google Scholar 

  • Langabeer, J. R, I. I., & Helton, J. (2015). Health care operations and systesm management. Health care operations management a systems perspective (2nd ed.). Burlington, MA: Jones and Bartlett Publishers.

    Google Scholar 

  • Lavrač, N., Bohanec, M., Pur, A., Cestnik, B., Debeljak, M., & Kobler, A. (2007). Data mining and visualization for decision support and modeling of public health-care resources. Journal of Biomedical Informatics, 40, 438–447.

    Article  Google Scholar 

  • Lee, T.-T., Liu, C.-Y., Kuo, Y.-H., Mills, M. E., Fong, J.-G., & Hung, C. (2011). Application of data mining to the identification of critical factors in patient falls using a web-based reporting system. International Journal of Medical Informatics, 80, 141–150.

    Article  Google Scholar 

  • Lessmanna, S., Seowb, H., Baesenscd, B. & Thomasd, L. C. (2013). Benchmarking state-of-the-art classification algorithms for credit scoring: A ten-year update. In Credit Research Centre, Conference Archive.

  • Lin, F.-R., Chou, S.-C., Pan, S.-M., & Chen, Y.-M. (2001). Mining time dependency patterns in clinical pathways. International Journal of Medical Informatics, 62, 11–25.

    Article  Google Scholar 

  • Malik, M. M., Khan, M., & Abdallah, S. (2015). Aggregate capacity planning for elective surgeries: A bi-objective optimization approach to balance patients waiting with healthcare costs. Operations Research for Health Care, 7, 3–13.

    Article  Google Scholar 

  • Malin, B., Nyemba, S., & Paulett, J. (2011). Learning relational policies from electronic health record access logs. Journal of Biomedical Informatics, 44, 333–342.

    Article  Google Scholar 

  • Menon, A. K., Jiang, X., Kim, J., Vaidya, J., & Ohno-Machado, L. (2014). Detecting inappropriate access to electronic health records using collaborative filtering. Machine Learning, 95, 87–101.

    Article  Google Scholar 

  • Moullin, M. (2007). Performance measurement definitions: Linking performance measurement and organisational excellence. International Journal of Health Care Quality Assurance, 20, 181–183.

    Article  Google Scholar 

  • Ng, S.-K., McLachlan, G. J., & Lee, A. H. (2006). An incremental EM-based learning approach for on-line prediction of hospital resource utilization. Artificial Intelligence in Medicine, 36, 257–267.

    Article  Google Scholar 

  • Papadopoulos, T., Gunasekaran, A., Dubey, R., Altay, N., Childe, S. J., & Fosso-Wamba, S. (2016). The role of Big Data in explaining disaster resilience in supply chains for sustainability. Journal of Cleaner Production, 142, 1108–1118.

    Article  Google Scholar 

  • Patterson, J. R. F. (2009). Handbook of systems engineering and management. In A. P. Sage & W. B. Rouse (Eds.), System engineering life cycles: Life cycles for research, development, test and evaluation; acquisition; and planning and marketing. Hoboken: Wiley.

    Google Scholar 

  • Porter, M. E., & Teisberg, E. O. (2006). Redefining health care: Creating value-based competition on results. Brighton: Harvard Business Press.

    Google Scholar 

  • Rebuge, Á., & Ferreira, D. R. (2012). Business process analysis in healthcare environments: A methodology based on process mining. Information Systems, 37, 99–116.

    Article  Google Scholar 

  • Rousseau, D. M., Manning, J., & Denyer, D. (2008). 11 Evidence in management and organizational science: Assembling the field’s full weight of scientific knowledge through syntheses. The Academy of Management Annals, 2, 475–515.

    Article  Google Scholar 

  • Rovani, M., Maggi, F. M., de Leoni, M., & van der Aalst, W. M. (2015). Declarative process mining in healthcare. Expert Systems with Applications, 42, 9236–9251.

    Article  Google Scholar 

  • Rubrichi, S., & Quaglini, S. (2012). Summary of Product Characteristics content extraction for a safe drugs usage. Journal of Biomedical Informatics, 45, 231–239.

    Article  Google Scholar 

  • Saenz, M. J., & Koufteros, X. (2015). Special issue on literature reviews in supply chain management and logistics. International Journal of Physical Distribution and Logistics Management. doi:10.1108/ijpdlm-12-2014-0305.

  • Samorani, M., & Laganga, L. R. (2015). Outpatient appointment scheduling given individual day-dependent no-show predictions. European Journal of Operational Research, 240, 245–257.

    Article  Google Scholar 

  • Sargent, R. G. (2005). Verification and validation of simulation models. In Proceedings of the 37th conference on Winter simulation, winter simulation conference (pp. 130–143).

  • Seuring, S., & Gold, S. (2012). Conducting content-analysis based literature reviews in supply chain management. Supply Chain Management: An International Journal, 17, 544–555.

    Article  Google Scholar 

  • Shapiro, G., & Markoff, G. (1997). Methods for drawing statistical inferences from text and transcripts (pp. 3–31). Lawrence Erlbaum Associates, Mahwah, NJ: Text Analysis for the Social Sciences.

  • Siau, K. (2003). Health care informatics. IEEE Transactions on Information Technology in Biomedicine, 7, 1–7.

    Article  Google Scholar 

  • Spiegel, J. R., Mckenna, M. T., Lakshman, G. S., & Nordstrom, P. G. (2013). Method and system for anticipatory package shipping, USA patent application.

  • Spruit, M., Vroon, R., & Batenburg, R. (2014). Towards healthcare business intelligence in long-term care: An explorative case study in the Netherlands. Computers in Human Behavior, 30, 698–707.

    Article  Google Scholar 

  • Srinivas, K., Rani, B. K., & Govrdhan, A. (2010). Applications of data mining techniques in healthcare and prediction of heart attacks. International Journal on Computer Science and Engineering (IJCSE), 2, 250–255.

    Google Scholar 

  • Testik, M. C., Ozkaya, B. Y., Aksu, S., & Ozcebe, O. I. (2012). Discovering blood donor arrival patterns using data mining: A method to investigate service quality at blood centers. Journal of Medical Systems, 36, 579–594.

    Article  Google Scholar 

  • Vissers, J., & Beech, R. (2005). Health operations management: Patient flow logistics in health care. Abingdon-on-Thames: Routledge.

    Book  Google Scholar 

  • Waller, M. A., & Fawcett, S. E. (2013). Data science, predictive analytics, and big data: A revolution that will transform supply chain design and management. Journal of Business Logistics, 34, 77–84.

    Article  Google Scholar 

  • Wamba, S. F., Akter, S., Edwards, A., Chopin, G., & Gnanzou, D. (2015). How ‘big data’can make big impact: Findings from a systematic review and a longitudinal case study. International Journal of Production Economics, 165, 234–246.

    Article  Google Scholar 

  • Wang, G., Gunasekaran, A., Ngai, E. W. T., & Papadopoulos, T. (2016). Big data analytics in logistics and supply chain management: Certain investigations for research and applications. International Journal of Production Economics, 176, 98–110.

    Article  Google Scholar 

  • Wirth, R. & Hipp, J. (2000). CRISP-DM: Towards a standard process model for data mining. In Proceedings of the 4th International Conference on the Practical Applications of Knowledge Discovery and Data Mining, Citeseer, (pp. 29–39).

  • Yao, Y., Zhong, N., & Zhao, Y. (2008). A conceptual framework of data mining. Data Mining: Foundations and Practice. Berlin: Springer.

    Google Scholar 

  • Yoo, I., Alafaireet, P., Marinov, M., Pena-Hernandez, K., Gopidi, R., Chang, J.-F., et al. (2012). Data mining in healthcare and biomedicine: A survey of the literature. Journal of Medical Systems, 36, 2431–2448.

    Article  Google Scholar 

  • Zheng, B., Zhang, J., Yoon, S. W., Lam, S. S., Khasawneh, M., & Poranki, S. (2015). Predictive modeling of hospital readmissions using metaheuristics and data mining. Expert Systems with Applications, 42, 7110–7120.

    Article  Google Scholar 

  • Zhong, W., Chow, R., & He, J. (2012). Clinical charge profiles prediction for patients diagnosed with chronic diseases using Multi-level Support Vector Machine. Expert Systems with Applications, 39, 1474–1483.

    Article  Google Scholar 

Download references

Acknowledgements

This research is funded by a grant from the Centre of Sustainable Processes, Abu Dhabi University, United Arab Emirates.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to M. M. Malik.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Malik, M.M., Abdallah, S. & Ala’raj, M. Data mining and predictive analytics applications for the delivery of healthcare services: a systematic literature review. Ann Oper Res 270, 287–312 (2018). https://doi.org/10.1007/s10479-016-2393-z

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10479-016-2393-z

Keywords

Navigation