Abstract
With the widespread use of healthcare information systems commonly known as electronic health records, there is significant scope for improving the way healthcare is delivered by resorting to the power of big data. This has made data mining and predictive analytics an important tool for healthcare decision making. The literature has reported attempts for knowledge discovery from the big data to improve the delivery of healthcare services, however, there appears no attempt for assessing and synthesizing the available information on how the big data phenomenon has contributed to better outcomes for the delivery of healthcare services. This paper aims to achieve this by systematically reviewing the existing body of knowledge to categorize and evaluate the reported studies on healthcare operations and data mining frameworks. The outcome of this study is useful as a reference for the practitioners and as a research platform for the academia.
Similar content being viewed by others
References
Agarwal, R., & Dhar, V. (2014). Editorial-Big data, data science, and analytics: The opportunity and challenge for IS research. Information Systems Research, 25, 443–448.
Amarasingham, R., Patzer, R. E., Huesch, M., Nguyen, N. Q., & Xie, B. (2014). Implementing electronic health care predictive analytics: Considerations and challenges. Health Affairs, 33, 1148–1154.
Anderson, J. E., & Chang, D. C. (2015). Using electronic health records for surgical quality improvement in the era of big data. JAMA Surgery, 150, 24–29.
Bates, D. W., Saria, S., Ohno-Machado, L., Shah, A., & Escobar, G. (2014). Big data in health care: Using analytics to identify and manage high-risk and high-cost patients. Health Affairs, 33, 1123–1131.
Bengoa, R., Kawar, R., Key, P., Leatherman, S., Massoud, R., & Saturno, P. (2006). Quality of care: A process for making strategic choices in health systems. Geneva: World Health Organization. WHO press.
Bonacina, S., Masseroli, M. & Pinciroli, F. (2005). Foreseeing promising bio-medical findings for effective applications of data mining. Biological and Medical Data Analysis, Springer.
Caron, F., Vanthienen, J., Vanhaecht, K., van Limbergen, E., de Weerdt, J., & Baesens, B. (2014). Monitoring care processes in the gynecologic oncology department. Computers in Biology and Medicine, 44, 88–96.
Carson, J. S. (2002). Model verification and validation. In Proceedings of the Winter Simulation Conference (pp. 52–58), IEEE.
Ceglowski, R., Churilov, L., & Wasserthiel, J. (2007). Combining data mining and discrete event simulation for a value-added view of a hospital emergency department. Journal of the Operational Research Society, 58, 246–254.
Chi, C.-L., Street, W. N., & Ward, M. M. (2008). Building a hospital referral expert system with a prediction and optimization-based decision support system algorithm. Journal of biomedical informatics, 41, 371–386.
Cornalba, C., Bellazzi, R. G., & Bellazzi, R. (2008). Building a normative decision support system for clinical and operational risk management in hemodialysis. IEEE Transactions on Information Technology in Biomedicine, 12, 678–686.
Delen, D., & Demirkan, H. (2013). Data, information and analytics as services. Decision Support Systems, 55, 359–363.
Demir, E. (2014). A decision support tool for predicting patients at risk of readmission: A comparison of classification trees, logistic regression, generalized additive models, and multivariate adaptive regression splines. Decision Sciences, 45, 849–880.
Dobrzykowski, D., Deilami, V. S., Hong, P., & Kim, S.-C. (2014). A structured analysis of operations and supply chain management research in healthcare (1982–2011). International Journal of Production Economics, 147, 514–530.
Dubey, R., Gunasekaran, A., Childe, S. J., Wamba, S. F., & Papadopoulos, T. (2016). The impact of big data on world-class sustainable manufacturing. The International Journal of Advanced Manufacturing Technology, 84, 631–645.
Fayyad, U., Piatetsky-Shapiro, G., & Smyth, P. (1996). From data mining to knowledge discovery in databases. AI Magazine, 17, 37.
Garg, L., McClean, S., Meenan, B., & Millard, P. (2009). Non-homogeneous Markov models for sequential pattern mining of healthcare data. IMA Journal of Management Mathematics, 20, 327–344.
Gheorghe, M., & Petre, R. (2014). Integrating data mining techniques into telemedicine systems. Informatica Economica, 18, 120–130.
Glowacka, K. J., Henry, R. M., & May, J. H. (2009). A hybrid data mining/simulation approach for modelling outpatient no-shows in clinic scheduling. Journal of the Operational Research Society, 60, 1056–1068.
Harper, P. (2005). Combining data mining tools with health care models for improved understanding of health processes and resource utilisation. Clinical and Investigative Medicine, 28, 338.
Haux, R., Ammenwerth, E., Herzog, W., & Knaup, P. (2002). Health care in the information society. A prognosis for the year 2013. International Journal of Medical Informatics, 66, 3–21.
Hazen, B. T., Boone, C. A., Ezell, J. D., & Jones-Farmer, L. A. (2014). Data quality for data science, predictive analytics, and big data in supply chain management: An introduction to the problem and suggestions for research and applications. International Journal of Production Economics, 154, 72–80.
James, B. C., & Savitz, L. A. (2011). How Intermountain trimmed health care costs through robust quality improvement efforts. Health Affairs, 30, 1185–1191.
Kinsman, L., Rotter, T., James, E., Snow, P., & Willis, J. (2010). What is a clinical pathway? Development of a definition to inform the debate. BMC medicine, 8, 1.
Koh, H. C., & Tan, G. (2011). Data mining applications in healthcare. Journal of Healthcare Information Management, 19, 65.
Kontio, E., Airola, A., Pahikkala, T., Lundgren-Laine, H., Junttila, K., Korvenranta, H., et al. (2014). Predicting patient acuity from electronic patient records. Journal of Biomedical Informatics, 51, 35–40.
Kudyba, S., & Gregorio, T. (2010). Identifying factors that impact patient length of stay metrics for healthcare providers with advanced analytics. Health Informatics Journal, 16, 235–245.
Kuo, N.-W. (2011). Information technology applications for geriatric consultation services in Taiwan. International Journal of Advancements in Computing Technology, 3, 44–52.
Langabeer, J. R, I. I., & Helton, J. (2015). Health care operations and systesm management. Health care operations management a systems perspective (2nd ed.). Burlington, MA: Jones and Bartlett Publishers.
Lavrač, N., Bohanec, M., Pur, A., Cestnik, B., Debeljak, M., & Kobler, A. (2007). Data mining and visualization for decision support and modeling of public health-care resources. Journal of Biomedical Informatics, 40, 438–447.
Lee, T.-T., Liu, C.-Y., Kuo, Y.-H., Mills, M. E., Fong, J.-G., & Hung, C. (2011). Application of data mining to the identification of critical factors in patient falls using a web-based reporting system. International Journal of Medical Informatics, 80, 141–150.
Lessmanna, S., Seowb, H., Baesenscd, B. & Thomasd, L. C. (2013). Benchmarking state-of-the-art classification algorithms for credit scoring: A ten-year update. In Credit Research Centre, Conference Archive.
Lin, F.-R., Chou, S.-C., Pan, S.-M., & Chen, Y.-M. (2001). Mining time dependency patterns in clinical pathways. International Journal of Medical Informatics, 62, 11–25.
Malik, M. M., Khan, M., & Abdallah, S. (2015). Aggregate capacity planning for elective surgeries: A bi-objective optimization approach to balance patients waiting with healthcare costs. Operations Research for Health Care, 7, 3–13.
Malin, B., Nyemba, S., & Paulett, J. (2011). Learning relational policies from electronic health record access logs. Journal of Biomedical Informatics, 44, 333–342.
Menon, A. K., Jiang, X., Kim, J., Vaidya, J., & Ohno-Machado, L. (2014). Detecting inappropriate access to electronic health records using collaborative filtering. Machine Learning, 95, 87–101.
Moullin, M. (2007). Performance measurement definitions: Linking performance measurement and organisational excellence. International Journal of Health Care Quality Assurance, 20, 181–183.
Ng, S.-K., McLachlan, G. J., & Lee, A. H. (2006). An incremental EM-based learning approach for on-line prediction of hospital resource utilization. Artificial Intelligence in Medicine, 36, 257–267.
Papadopoulos, T., Gunasekaran, A., Dubey, R., Altay, N., Childe, S. J., & Fosso-Wamba, S. (2016). The role of Big Data in explaining disaster resilience in supply chains for sustainability. Journal of Cleaner Production, 142, 1108–1118.
Patterson, J. R. F. (2009). Handbook of systems engineering and management. In A. P. Sage & W. B. Rouse (Eds.), System engineering life cycles: Life cycles for research, development, test and evaluation; acquisition; and planning and marketing. Hoboken: Wiley.
Porter, M. E., & Teisberg, E. O. (2006). Redefining health care: Creating value-based competition on results. Brighton: Harvard Business Press.
Rebuge, Á., & Ferreira, D. R. (2012). Business process analysis in healthcare environments: A methodology based on process mining. Information Systems, 37, 99–116.
Rousseau, D. M., Manning, J., & Denyer, D. (2008). 11 Evidence in management and organizational science: Assembling the field’s full weight of scientific knowledge through syntheses. The Academy of Management Annals, 2, 475–515.
Rovani, M., Maggi, F. M., de Leoni, M., & van der Aalst, W. M. (2015). Declarative process mining in healthcare. Expert Systems with Applications, 42, 9236–9251.
Rubrichi, S., & Quaglini, S. (2012). Summary of Product Characteristics content extraction for a safe drugs usage. Journal of Biomedical Informatics, 45, 231–239.
Saenz, M. J., & Koufteros, X. (2015). Special issue on literature reviews in supply chain management and logistics. International Journal of Physical Distribution and Logistics Management. doi:10.1108/ijpdlm-12-2014-0305.
Samorani, M., & Laganga, L. R. (2015). Outpatient appointment scheduling given individual day-dependent no-show predictions. European Journal of Operational Research, 240, 245–257.
Sargent, R. G. (2005). Verification and validation of simulation models. In Proceedings of the 37th conference on Winter simulation, winter simulation conference (pp. 130–143).
Seuring, S., & Gold, S. (2012). Conducting content-analysis based literature reviews in supply chain management. Supply Chain Management: An International Journal, 17, 544–555.
Shapiro, G., & Markoff, G. (1997). Methods for drawing statistical inferences from text and transcripts (pp. 3–31). Lawrence Erlbaum Associates, Mahwah, NJ: Text Analysis for the Social Sciences.
Siau, K. (2003). Health care informatics. IEEE Transactions on Information Technology in Biomedicine, 7, 1–7.
Spiegel, J. R., Mckenna, M. T., Lakshman, G. S., & Nordstrom, P. G. (2013). Method and system for anticipatory package shipping, USA patent application.
Spruit, M., Vroon, R., & Batenburg, R. (2014). Towards healthcare business intelligence in long-term care: An explorative case study in the Netherlands. Computers in Human Behavior, 30, 698–707.
Srinivas, K., Rani, B. K., & Govrdhan, A. (2010). Applications of data mining techniques in healthcare and prediction of heart attacks. International Journal on Computer Science and Engineering (IJCSE), 2, 250–255.
Testik, M. C., Ozkaya, B. Y., Aksu, S., & Ozcebe, O. I. (2012). Discovering blood donor arrival patterns using data mining: A method to investigate service quality at blood centers. Journal of Medical Systems, 36, 579–594.
Vissers, J., & Beech, R. (2005). Health operations management: Patient flow logistics in health care. Abingdon-on-Thames: Routledge.
Waller, M. A., & Fawcett, S. E. (2013). Data science, predictive analytics, and big data: A revolution that will transform supply chain design and management. Journal of Business Logistics, 34, 77–84.
Wamba, S. F., Akter, S., Edwards, A., Chopin, G., & Gnanzou, D. (2015). How ‘big data’can make big impact: Findings from a systematic review and a longitudinal case study. International Journal of Production Economics, 165, 234–246.
Wang, G., Gunasekaran, A., Ngai, E. W. T., & Papadopoulos, T. (2016). Big data analytics in logistics and supply chain management: Certain investigations for research and applications. International Journal of Production Economics, 176, 98–110.
Wirth, R. & Hipp, J. (2000). CRISP-DM: Towards a standard process model for data mining. In Proceedings of the 4th International Conference on the Practical Applications of Knowledge Discovery and Data Mining, Citeseer, (pp. 29–39).
Yao, Y., Zhong, N., & Zhao, Y. (2008). A conceptual framework of data mining. Data Mining: Foundations and Practice. Berlin: Springer.
Yoo, I., Alafaireet, P., Marinov, M., Pena-Hernandez, K., Gopidi, R., Chang, J.-F., et al. (2012). Data mining in healthcare and biomedicine: A survey of the literature. Journal of Medical Systems, 36, 2431–2448.
Zheng, B., Zhang, J., Yoon, S. W., Lam, S. S., Khasawneh, M., & Poranki, S. (2015). Predictive modeling of hospital readmissions using metaheuristics and data mining. Expert Systems with Applications, 42, 7110–7120.
Zhong, W., Chow, R., & He, J. (2012). Clinical charge profiles prediction for patients diagnosed with chronic diseases using Multi-level Support Vector Machine. Expert Systems with Applications, 39, 1474–1483.
Acknowledgements
This research is funded by a grant from the Centre of Sustainable Processes, Abu Dhabi University, United Arab Emirates.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Malik, M.M., Abdallah, S. & Ala’raj, M. Data mining and predictive analytics applications for the delivery of healthcare services: a systematic literature review. Ann Oper Res 270, 287–312 (2018). https://doi.org/10.1007/s10479-016-2393-z
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10479-016-2393-z