Skip to main content

Identifying Chemotherapy Regimens in Electronic Health Record Data Using Interval-Encoded Sequence Alignment

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9105))

Abstract

Electronic health records (EHRs) play an essential role in patient management and guideline-based care. However, EHRs often do not encode therapy protocols directly, and instead only catalog the individual drug agents patients receive. In this paper, we present an automated approach for protocol identification using EHR data. We introduce a novel sequence alignment method based on the Needleman-Wunsch algorithm that models variation in treatment gaps. Using data on 178 breast cancer patients that included manually annotated chemotherapy protocols, our method successfully matched 93% of regimens based on the top score and had 98% accuracy using the top two scored regimens. These results indicate that our sequence alignment approach can accurately find chemotherapy plans in patient event logs while measuring temporal variation in treatment administration.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Lee, W.N., Das, A.K.: Local alignment tool for clinical history: temporal semantic search of clinical databases. In: AMIA Annu. Symp. Proc., pp. 437–441 (2010)

    Google Scholar 

  2. Lee, W.N.: Evaluating clinical practice variation using a knowledge-based temporal sequence alignment framework. Ph.D. Thesis. Stanford University: U.S.

    Google Scholar 

  3. Bouarfa, L., Dankelman, J.: Workflow mining and outlier detection from clinical activity logs. J. Biomed. Inform. 45(6), 1185–1190 (2012)

    Article  Google Scholar 

  4. Combi, C., Gozzi, M., Oliboni, B., Juarez, J., Marin, R.: Temporal similarity measures for querying clinical workflows. Artificial Intelligence in Medicine 46, 37–54 (2009)

    Article  Google Scholar 

  5. Montani, S., Leonardi, G.: Retrieval and clustering for supporting business process adjustment and analysis. Information Systems 40, 128–141 (2014)

    Article  Google Scholar 

  6. Needleman, S.B., Wunsch, C.D.: A general method applicable to the search for similarities in the amino acid sequence of two proteins. Journal of Molecular Biology 48(3), 443–453 (1970)

    Article  Google Scholar 

  7. Tu, S.W., Musen, M.A.: Episodic refinement of episodic skeletal-plan refinement. International Journal of Human–Computer Studies 48, 475–497 (1998)

    Article  Google Scholar 

  8. Huang, Z., Dong, W., Ji, L., Gan, C., Lu, X., Duan, H.: Discovery of clinical pathway patterns from event logs using probabilistic topic models. J. Biomed. Inform. 47, 39–57 (2014)

    Article  Google Scholar 

  9. Huang, Z., Lu, X., Duan, H.: On mining clinical pathway patterns from medical behaviors. Artif. Intell. Med. 56, 35–50 (2012)

    Article  Google Scholar 

  10. Van der Aalst, W., Weijters, T., Maruster, L.: Workflow Mining Discovering Process Models from Event Logs. IEEE Trans. Knowl. Data Eng. 16(9), 1128–1142 (2004)

    Article  Google Scholar 

  11. Hares, B., Levy, M.: Automated plan-recognition of chemotherapy protocols. In: AMIA Annu. Symp. Proc., pp. 108–114 (2011)

    Google Scholar 

  12. Batal, I., Fradkin, D., Harrison, J., Moerchen, F., Hauskrecht, M.: Mining Recent Temporal Patterns for Event Detection in Multivariate Time Series Data. In: Proceedings of Knowledge Discovery and Data Mining (KDD), Beijing, China (2012b)

    Google Scholar 

  13. Sacchi, L., Larizza, C., Combi, C., Bellazi, R.: Data mining with temporal abstractions: learning rules from time series. Data Mining and Knowledge Discovery (15) (2007)

    Google Scholar 

  14. Combi, C., Franceschet, M., Peron, A.: Representing and Reasoning about Temporal Granularities. Journal of Logic and Computation 14(1), 51–77 (2004)

    Article  MATH  MathSciNet  Google Scholar 

  15. Juárez, J.M., Guil, F., Palma, J.T., Marín, R.: Temporal similarity by measuring possibilistic uncertainty in CBR. Fuzzy Sets and Systems 160(2), 214–230 (2009)

    Article  MATH  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Haider Syed .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Syed, H., Das, A.K. (2015). Identifying Chemotherapy Regimens in Electronic Health Record Data Using Interval-Encoded Sequence Alignment. In: Holmes, J., Bellazzi, R., Sacchi, L., Peek, N. (eds) Artificial Intelligence in Medicine. AIME 2015. Lecture Notes in Computer Science(), vol 9105. Springer, Cham. https://doi.org/10.1007/978-3-319-19551-3_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-19551-3_17

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-19550-6

  • Online ISBN: 978-3-319-19551-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics