Abstract
Extracting relevant information from medical conversations and providing it to doctors and patients might help in addressing doctor burnout and patient forgetfulness. In this paper, we focus on extracting the Medication Regimen (dosage and frequency for medications) discussed in a medical conversation. We frame the problem as a Question Answering (QA) task and perform comparative analysis over: a QA approach, a new combined QA and Information Extraction approach, and other baselines. We use a small corpus of 6,692 annotated doctor-patient conversations for the task. Clinical conversation corpora are costly to create, difficult to handle (because of data privacy concerns), and thus scarce. We address this data scarcity challenge through data augmentation methods, using publicly available embeddings and pretrain part of the network on a related task (summarization) to improve the model’s performance. Compared to the baseline, our best-performing models improve the dosage and frequency extractions’ ROUGE-1 F1 scores from 54.28 and 37.13 to 89.57 and 45.94, respectively. Using our best-performing model, we present the first fully automated system that can extract Medication Regimen tags from spontaneous doctor-patient conversations with about \(\sim \)71% accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
This prevents overfitting and repetition when converting all the numbers to words.
- 2.
This can happen when a different form of a medication (e.g. abbreviation, generic or brand name) is used in the conversation compared to the annotation.
- 3.
- 4.
- 5.
Since we had high quality human written transcripts and our ASR transcripts did not contain spelling mistakes (as long as the word was correctly recognized), string matching worked well during testing.
- 6.
References
Alsentzer, E., Murphy, J., Boag, W., Weng, W.H., Jindi, D., Naumann, T., McDermott, M.: Publicly available clinical BERT embeddings. In: Proceedings of the 2nd Clinical Natural Language Processing Workshop, Association for Computational Linguistics, Minneapolis, Minnesota, USA, pp. 72–78 (2019)
del Carmen, M.G., Herman, J., Rao, S., Hidrue, M.K., Ting, D., Lehrhoff, S.R., Lenz, S., Heffernan, J., Ferris, T.G.: Trends and factors associated with physician burnout at a multispecialty academic faculty practice organization. JAMA Netw. Open 2(3), e190554–e190554 (2019)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol. 1 (Long and Short Papers), pp. 4171–4186 (2019)
Finley, G., Edwards, E., Robinson, A., Brenndoerfer, M., Sadoughi, N., Fone, J., Axtmann, N., Miller, M., Suendermann-Oeft, D.: An automated medical scribe for documenting clinical encounters. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations, pp. 11–15 (2018)
GAO: Medical records: fees and challenges associated with patients’ access. United States Government Accountability Office, Report to Congressional Committees GAO-18-386 (2018)
Jeblee, S., Khattak, F.K., Crampton, N., Mamdani, M., Rudzicz, F.: Extracting relevant information from physician-patient dialogues for automated clinical note taking. In: Proceedings of the Tenth International Workshop on Health Text Mining and Information Analysis (LOUHI 2019), pp. 65–74 (2019)
Kannan, A., Chen, K., Jaunzeikare, D., Rajkomar, A.: Semi-supervised learning for information extraction from dialogue. In: Interspeech, pp. 2077–2081 (2018)
Kessels, R.P.: Patients’ memory for medical information. J. R. Soc. Med. 96(5), 219–222 (2003)
Kim, S., Dalmia, S., Metze, F.: Cross-attention end-to-end ASR for two-party conversations. arXiv preprint arXiv:190710726 (2019)
Kodish-Wachs, J., Agassi, E., Kenny III, J.P.: A systematic comparison of contemporary automatic speech recognition engines for conversational clinical speech. In: AMIA Annual Symposium Proceedings, American Medical Informatics Association, vol. 2018, p. 683 (2018)
Kumar, S.: Burnout and doctors: prevalence, prevention and intervention. Healthcare 4(3), 37 (2016). Multidisciplinary Digital Publishing Institute
Leventhal, R.: Physician burnout addressed: how one medical group is (virtually) progressing. Healthcare Innovation (2018)
Liu, S., Ma, W., Moore, R., Ganesan, V., Nelson, S.: RxNorm: prescription for electronic drug information exchange. IT Prof. 7(5), 17–23 (2005)
Liu, Z., Lim, H., Suhaimi, N.F.A., Tong, S.C., Ong, S., Ng, A., Lee, S., Macdonald, M.R., Ramasamy, S., Krishnaswamy, P., et al.: Fast prototyping a dialogue comprehension system for nurse-patient conversations on symptom monitoring. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol. 2 (Industry Papers), pp. 24–31 (2019)
McCann, B., Keskar, N.S., Xiong, C., Socher, R.: The natural language decathlon: multitask learning as question answering. arXiv preprint arXiv:180608730 (2018)
Peters, M.E., Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K., Zettlemoyer, L.: Deep contextualized word representations. In: Proceedings of NAACL-HLT, pp. 2227–2237 (2018)
See, A., Liu, P.J., Manning, C.D.: Get to the point: summarization with pointer-generator networks. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Long Papers), vol. 1, pp. 1073–1083 (2017)
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, Ł., Polosukhin, I.: Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 5998–6008 (2017)
Xiong, C., Zhong, V., Socher, R.: Dynamic coattention networks for question answering. arXiv preprint arXiv:161101604 (2016)
Acknowledgements
We thank: University of Pittsburgh Medical Center (UPMC) and Abridge AI Inc. for providing access to the de-identified data corpus; Dr. Shivdev Rao, CEO, Abridge AI Inc. and a practicing cardiologist in UPMC’s Heart and Vascular Institute, and Prlof. Florian Metze, Associate Research Professor, Carnegie Mellon University for helpful discussions; Ben Schloss, Steven Coleman, and Deborah Osakue for data business development and annotation management.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Selvaraj, S.P., Konam, S. (2021). Medication Regimen Extraction from Medical Conversations. In: Shaban-Nejad, A., Michalowski, M., Buckeridge, D.L. (eds) Explainable AI in Healthcare and Medicine. Studies in Computational Intelligence, vol 914. Springer, Cham. https://doi.org/10.1007/978-3-030-53352-6_18
Download citation
DOI: https://doi.org/10.1007/978-3-030-53352-6_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-53351-9
Online ISBN: 978-3-030-53352-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)