Abstract
Electronic health records (EHRs) play an essential role in patient management and guideline-based care. However, EHRs often do not encode therapy protocols directly, and instead only catalog the individual drug agents patients receive. In this paper, we present an automated approach for protocol identification using EHR data. We introduce a novel sequence alignment method based on the Needleman-Wunsch algorithm that models variation in treatment gaps. Using data on 178 breast cancer patients that included manually annotated chemotherapy protocols, our method successfully matched 93% of regimens based on the top score and had 98% accuracy using the top two scored regimens. These results indicate that our sequence alignment approach can accurately find chemotherapy plans in patient event logs while measuring temporal variation in treatment administration.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Lee, W.N., Das, A.K.: Local alignment tool for clinical history: temporal semantic search of clinical databases. In: AMIA Annu. Symp. Proc., pp. 437–441 (2010)
Lee, W.N.: Evaluating clinical practice variation using a knowledge-based temporal sequence alignment framework. Ph.D. Thesis. Stanford University: U.S.
Bouarfa, L., Dankelman, J.: Workflow mining and outlier detection from clinical activity logs. J. Biomed. Inform. 45(6), 1185–1190 (2012)
Combi, C., Gozzi, M., Oliboni, B., Juarez, J., Marin, R.: Temporal similarity measures for querying clinical workflows. Artificial Intelligence in Medicine 46, 37–54 (2009)
Montani, S., Leonardi, G.: Retrieval and clustering for supporting business process adjustment and analysis. Information Systems 40, 128–141 (2014)
Needleman, S.B., Wunsch, C.D.: A general method applicable to the search for similarities in the amino acid sequence of two proteins. Journal of Molecular Biology 48(3), 443–453 (1970)
Tu, S.W., Musen, M.A.: Episodic refinement of episodic skeletal-plan refinement. International Journal of Human–Computer Studies 48, 475–497 (1998)
Huang, Z., Dong, W., Ji, L., Gan, C., Lu, X., Duan, H.: Discovery of clinical pathway patterns from event logs using probabilistic topic models. J. Biomed. Inform. 47, 39–57 (2014)
Huang, Z., Lu, X., Duan, H.: On mining clinical pathway patterns from medical behaviors. Artif. Intell. Med. 56, 35–50 (2012)
Van der Aalst, W., Weijters, T., Maruster, L.: Workflow Mining Discovering Process Models from Event Logs. IEEE Trans. Knowl. Data Eng. 16(9), 1128–1142 (2004)
Hares, B., Levy, M.: Automated plan-recognition of chemotherapy protocols. In: AMIA Annu. Symp. Proc., pp. 108–114 (2011)
Batal, I., Fradkin, D., Harrison, J., Moerchen, F., Hauskrecht, M.: Mining Recent Temporal Patterns for Event Detection in Multivariate Time Series Data. In: Proceedings of Knowledge Discovery and Data Mining (KDD), Beijing, China (2012b)
Sacchi, L., Larizza, C., Combi, C., Bellazi, R.: Data mining with temporal abstractions: learning rules from time series. Data Mining and Knowledge Discovery (15) (2007)
Combi, C., Franceschet, M., Peron, A.: Representing and Reasoning about Temporal Granularities. Journal of Logic and Computation 14(1), 51–77 (2004)
Juárez, J.M., Guil, F., Palma, J.T., Marín, R.: Temporal similarity by measuring possibilistic uncertainty in CBR. Fuzzy Sets and Systems 160(2), 214–230 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Syed, H., Das, A.K. (2015). Identifying Chemotherapy Regimens in Electronic Health Record Data Using Interval-Encoded Sequence Alignment. In: Holmes, J., Bellazzi, R., Sacchi, L., Peek, N. (eds) Artificial Intelligence in Medicine. AIME 2015. Lecture Notes in Computer Science(), vol 9105. Springer, Cham. https://doi.org/10.1007/978-3-319-19551-3_17
Download citation
DOI: https://doi.org/10.1007/978-3-319-19551-3_17
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19550-6
Online ISBN: 978-3-319-19551-3
eBook Packages: Computer ScienceComputer Science (R0)