Data Mining and Knowledge Discovery

, Volume 29, Issue 4, pp 914–949 | Cite as

On mining latent treatment patterns from electronic medical records

  • Zhengxing HuangEmail author
  • Wei Dong
  • Peter Bath
  • Lei Ji
  • Huilong Duan


Clinical pathway (CP) analysis plays an important role in health-care management in ensuring specialized, standardized, normalized and sophisticated therapy procedures for individual patients. Recently, with the rapid development of hospital information systems, a large volume of electronic medical records (EMRs) has been produced, which provides a comprehensive source for CP analysis. In this paper, we are concerned with the problem of utilizing the heterogeneous EMRs to assist CP analysis and improvement. More specifically, we develop a probabilistic topic model to link patient features and treatment behaviors together to mine treatment patterns hidden in EMRs. Discovered treatment patterns, as actionable knowledge representing the best practice for most patients in most time of their treatment processes, form the backbone of CPs, and can be exploited to help physicians better understand their specialty and learn from previous experiences for CP analysis and improvement. Experimental results on a real collection of 985 EMRs collected from a Chinese hospital show that the proposed approach can effectively identify meaningful treatment patterns from EMRs.


Clinical pathway analysis Probabilistic topic models Latent Dirichlet allocation Pattern discovery Electronic medical records 



This work was supported by the National Nature Science Foundation of China under Grant No. 81101126, the National Hi-Tech R&D Plan of China under Grant No 2012AA02A601, and the Fundamental Research Funds for the Central Universities under Grant No 2014QNA5014. The authors would like to give special thanks to all experts who cooperated in the evaluation of the proposed method. The authors are especially thankful for the positive support received from the cooperative hospitals as well as to all medical staff involved. The authors would like to thank the anonymous reviewers for their constructive comments on an earlier draft of this paper.


  1. Agrawal R, Gunopulos D, Leymann F (1998) Mining process models from workflow logs. In HJ Schek, F Saltor, I Ramos, G Alonso (eds) Sixth international conference on extending database technology. Springer-Verlag, London, pp 469–483Google Scholar
  2. Antman EM, Cohen M, Bernink PM et al (2000) The TIMI risk score for Unstable Angina/Non-ST elevation MI: a method for prognostication and therapeutic decision making. J Am Med Assoc 284(7):835–842CrossRefGoogle Scholar
  3. Blei DM, Ng AY, Jordan MI (March 2003) Latent Dirichlet allocation. J Mach Learn Res 3:993–1022Google Scholar
  4. Bouarfa L, Dankelman J (2012) Workflow mining and outlier detection from clinical activity logs. J Biomed Inform 45(6):1185–1190CrossRefGoogle Scholar
  5. Cheah J (2000) Development and implementation of a clinical pathway programme in an acute care general hospital in singapore. Int J Qual Health Care 12:403–412CrossRefGoogle Scholar
  6. Cook JE, Wolf AL (1998) Discovering models of software processes from event-based data. ACM Transactions on Software Engineering and Methodology 7(3):215–249CrossRefGoogle Scholar
  7. Dong W, Huang Z, Ji L, Li H (2014) A genetic fuzzy system for unstable angina risk assessment. BMC Med Inform Decis Mak 14:12CrossRefGoogle Scholar
  8. Dunn AG, Ong MS, Westbrook JI, Magrabi F, Coiera E, Wobcke W (2011) A simulation framework for mapping risks in clinical processes: the case of in-patient transfers. J Am Med Inform Assoc 18(3):259–266CrossRefGoogle Scholar
  9. Dy SM, Garg P, Nyberg D, Dawson PB, Pronovost PJ, Morlock L, Rubin H, Wu AW (2005) Critical pathway effectiveness: assessing the impact of patient, hospital care, and pathway characteristics using qualitative comparative analysis. Health Serv Res 40(2):499–516CrossRefGoogle Scholar
  10. Elson RB, Faughnan JG, Connelly DP (1997) An industrial process view of information delivery to support clinical decision making: implications for systems design and process measures. J Am Med Inform Assoc 4(4):266–278CrossRefGoogle Scholar
  11. Ghattas J, Peleg M, Soffer P, Denekamp Y (2010) Learning the context of a clinical process. In: Stefanie R-M, Shazia S, Leymann F (eds) Business process management workshops, vol 43. Lecture Notes in Business Information Processing. Springer, Berlin, pp 545–556Google Scholar
  12. Gooch P, Roudsari A (2011) Computerization of workflows, guidelines, and care pathways: a review of implementation challenges for process-oriented health information systems. J Am Med Inform Assoc 18(6):738–748CrossRefGoogle Scholar
  13. Griffiths TL (2004) Finding scientific topics. Proc Natl Acad Sci USA 101:5228–5235CrossRefGoogle Scholar
  14. Huang Z, Lu X, Gan C, Duan H (2011) Variation prediction in clinical processes. In: Peleg M, Lavrac N, Combi C (eds) Artificial intelligence in medicine, vol 6747., Lecture notes in Computer ScienceSpringer, Berlin/Heidelberg, pp 286–295CrossRefGoogle Scholar
  15. Huang Z, Lu X, Duan H (2012) Using recommendation to support adaptive clinical pathways. J Med Syst 36(3):1849–1860CrossRefGoogle Scholar
  16. Huang Z, Lu X, Duan H (2012) On mining clinical pathway patterns from medical behaviors. Artif Intell Med 56(1):35–50CrossRefGoogle Scholar
  17. Huang Z, Juarez JM, Duan H, Li H (2013) Length of stay prediction for clinical treatment process using temporal similarity. Expert Syst Appl 40(16):6330–6339CrossRefGoogle Scholar
  18. Huang Z, Lu X, Duan H (2013) Latent treatment topic discovery for clinical pathways. J Med Syst 37(2):1–10CrossRefGoogle Scholar
  19. Huang Z, Lu X, Duan H, Fan W (2013) Summarizing clinical pathways from event logs. J Biomed Inform 46(1):111–127CrossRefGoogle Scholar
  20. Huang Z, Dong W, Duan H, Li H (2014) Similarity measure between patient traces for clinical pathway analysis: problem, method, and applications. IEEE J Biomed Health Inform 18(1):4–14CrossRefGoogle Scholar
  21. Huang Z, Dong W, Ji L, Gan C, Lu X, Duan H (2014) Discovery of clinical pathway patterns from event logs using probabilistic topic models. J Biomed Inform 47:39–57CrossRefGoogle Scholar
  22. Huang Z, Lu X, Duan H (2012) Anomaly detection in clinical processes. In AMIA Annu Symp Proc, pp 370–379Google Scholar
  23. Hunter B, Segrott J (2008) Re-mappling client journeys and professional identities: a review of the literature on clinical pathways. Int J Nurs Stud 45:608–625CrossRefGoogle Scholar
  24. Iwata T, Sawada H (2013) Topic model for analyzing purchase data with price information. Data Min Knowl Discov 26(3):559–573zbMATHCrossRefGoogle Scholar
  25. Lakshmanan GT, Rozsnyai S, Wang F (2013) Investigating clinical care pathways correlated with outcomes. In: Daniel F, Wang J, Weber B (eds) Business process management, vol 8094. Lecture Notes in Computer Science.Springer, Berlin, pp 323–338Google Scholar
  26. Lang M, Burkle TB, Laumann S, Prokosch HU (2008) Process mining for clinical workflows: challenges and current limitations. In SK Andersen, GO Klein, S Schulz, J Aarts (eds) Proceedings of MIE2008 the XXIst international congress of the European federation for medical informatics, pp 229–234Google Scholar
  27. Lenz R, Blaser R, Beyer M, Heger O, Biber C et al (2007) IT support for clinical pathways-lessons learned. Int J Med Inform 76(3):S397–S402CrossRefGoogle Scholar
  28. Lenz R, Reichert M (2007) IT support for healthcare processes-premises, challenges, perspectives. Data Knowl Eng 61(1):39–58CrossRefGoogle Scholar
  29. Lin F, Chen S, Pan S, Chen Y (2001) Mining time dependency patterns in clinical pathways. Int J Med Inform 62(1):11–25CrossRefGoogle Scholar
  30. Loeb M, Carusone SC, Goeree R, Walter SD, Brazil K, Krueger P et al (2006) Effect of a clinical pathway to reduce hospitalizations in nursing home residents with pneumonia. J Am Med Assoc 295: 2503–2510CrossRefGoogle Scholar
  31. Lu X, Huang Z, Duan H (2012) Supporting adaptive clinical treatment processes through recommendations. Comput Methods Programs Biomed 107(3):413–424CrossRefGoogle Scholar
  32. Mans R, Schonenberg H, Leonardi G, Panzarasa S, Cavallini A, Quaglini S (2008) Process mining techniques: an application to stroke care. Stud Health Technol Inform 136:573–578Google Scholar
  33. Peleg M, Mulyar N, van der Aalst WMP (2012) Pattern-based analysis of computer-interpretable guidelines: don’t forget the context. Artif Intell Med 54(1):73–74CrossRefGoogle Scholar
  34. Peleg M (2013) Computer-interpretable clinical guidelines: a methodological review. J Biomed Inform 46(4):744–763CrossRefGoogle Scholar
  35. Peleg M, Soffer P, Ghattas J (2008) Mining process execution and outcomes—position paper. In: Arthur H, Benatallah B, Paik H-Y (eds) Business process management workshops, vol 4928. Lecture Notes in Computer Science. Springer, Berlin, pp 395–400Google Scholar
  36. Phung D, Adams B, Venkatesh S, Kumar M (2009) Unsupervised context detection using wireless signals. Pervasive Mobile Comput 5(6):714–733CrossRefGoogle Scholar
  37. Quaglini S, Stefanelli M, Lanzola G, Caporusso V, Panzarasa S (2001) Flexible guideline-based patient careflow systems. Artif Intell Med 22(1):65–80CrossRefGoogle Scholar
  38. Rebuge A, Ferreira DR (2012) Business process analysis in healthcare environments: a methodology based on process mining. Inform Syst 37(2):99–116CrossRefGoogle Scholar
  39. Renholm M, Leino-Kilpi H, Suominen T (2002) Critical pathways: a systematic review. J Nurs Adm 32(4):196–202CrossRefGoogle Scholar
  40. Rosen-Zvi M, Griffiths T, Steyvers M, Smyth P (2004) The author-topic model for authors and documents. In 20th conference on uncertainty in artificial intelligence, pp 487–494Google Scholar
  41. Rotter T, Kugler J, Koch R, Gothe H, Twork S, van Oostrum JM, Steyerberg EW (2008) A systematic review and meta-analysis of the effects of clinical pathways on length of stay, hospital costs and patient outcomes. BMC Health Serv Res 8:265CrossRefGoogle Scholar
  42. Tsoumakas G, Katakis I (2007) Multi-label classification: an overview. Int J Data Warehous Min 3(3):1–13CrossRefGoogle Scholar
  43. Uzark K (2003) Clinical pathways for monitoring and advancing congenital heart disease care. Progr Pediatr Cardiol 18:131–139CrossRefGoogle Scholar
  44. Wakamiya S, Yamauchi K (2009) What are the standard functions of electronic clinical pathways? Int J Med Inform 78(8):543–550CrossRefGoogle Scholar
  45. Wang X, McCallum A, Wei X (2007) Topical n-grams: phrase and topic discovery, with an application to information retrieval. In IEEE international conference on data mining, pp 697–702Google Scholar
  46. Wang F, Zhang P, Cao N, Hu J, Sorrentino R (2014) Exploring the associations between drug side-effects and therapeutic indications. J Biomed Inform. doi: 10.1016/j.jbi.2014.03.014
  47. Weiland DE (1997) Why use clinical pathways rather than practice guidelines? Am J Surg 174:592–595CrossRefGoogle Scholar
  48. 2012 Writing Committee Members, Jneid H, Anderson JL, Wright RS, Adams CD, Bridges CR, Casey DE, Ettinger SM, Fesmire FM, Ganiats TG, Lincoff AM, Peterson ED, Philippides GJ, Theroux P, Wenger NK, Zidar JP (2012) 2012 ACCF/AHA focused update of the guideline for the management of patients with Unstable Angina/Non-ST-Elevation myocardial infarction (updating the 2007 guideline and replacing the 2011 focused update). Circulation 126(7):875–910Google Scholar
  49. Whye Teh Y, Jordan MI, Beal MJ, Blei DM (2004) Hierarchical Dirichlet processes. J Am Stat Assoc 101(476):1566–1581CrossRefGoogle Scholar
  50. Yao W, Kumar A (2013) Conflexflow: integrating flexible clinical pathways into clinical decision support systems using context and rules. Decis Support Syst 55(2):499–515CrossRefGoogle Scholar
  51. Zand DJ, Brown KM, Konecki UL, Campbell JK, Salehi V, Chamberlain JM (2008) Effectiveness of a clinical pathway for the emergency treatment of patients with inborn errors of metabolism. Pediatrics 122:1191–1195CrossRefGoogle Scholar

Copyright information

© The Author(s) 2014

Authors and Affiliations

  • Zhengxing Huang
    • 1
    Email author
  • Wei Dong
    • 2
  • Peter Bath
    • 3
  • Lei Ji
    • 4
  • Huilong Duan
    • 1
  1. 1.The Key Laboratory of Biomedical Engineering, Ministry of EducationCollege of Biomedical Engineering and Instrument Science of Zhejiang UniversityHangzhouChina
  2. 2.Department of CardiologyChinese PLA General HospitalBeijingChina
  3. 3.Information SchoolUniversity of SheffieldSheffieldUK
  4. 4.IT DepartmentChinese PLA General HospitalBeijingChina

Personalised recommendations