Medication Regimen Extraction from Medical Conversations

Selvaraj, Sai P.; Konam, Sandeep

doi:10.1007/978-3-030-53352-6_18

Sai P. Selvaraj⁵ &
Sandeep Konam⁵

Part of the book series: Studies in Computational Intelligence ((SCI,volume 914))

1706 Accesses
5 Citations

Abstract

Extracting relevant information from medical conversations and providing it to doctors and patients might help in addressing doctor burnout and patient forgetfulness. In this paper, we focus on extracting the Medication Regimen (dosage and frequency for medications) discussed in a medical conversation. We frame the problem as a Question Answering (QA) task and perform comparative analysis over: a QA approach, a new combined QA and Information Extraction approach, and other baselines. We use a small corpus of 6,692 annotated doctor-patient conversations for the task. Clinical conversation corpora are costly to create, difficult to handle (because of data privacy concerns), and thus scarce. We address this data scarcity challenge through data augmentation methods, using publicly available embeddings and pretrain part of the network on a related task (summarization) to improve the model’s performance. Compared to the baseline, our best-performing models improve the dosage and frequency extractions’ ROUGE-1 F1 scores from 54.28 and 37.13 to 89.57 and 45.94, respectively. Using our best-performing model, we present the first fully automated system that can extract Medication Regimen tags from spontaneous doctor-patient conversations with about \(\sim \)71% accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Hardcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
This prevents overfitting and repetition when converting all the numbers to words.
2.
This can happen when a different form of a medication (e.g. abbreviation, generic or brand name) is used in the conversation compared to the annotation.
3.
https://cloud.google.com/speech-to-text/.
4.
https://www.ibm.com/cloud/watson-speech-to-text.
5.
Since we had high quality human written transcripts and our ASR transcripts did not contain spelling mistakes (as long as the word was correctly recognized), string matching worked well during testing.
6.
https://spacy.io/api/entityrecognizer.

References

Alsentzer, E., Murphy, J., Boag, W., Weng, W.H., Jindi, D., Naumann, T., McDermott, M.: Publicly available clinical BERT embeddings. In: Proceedings of the 2nd Clinical Natural Language Processing Workshop, Association for Computational Linguistics, Minneapolis, Minnesota, USA, pp. 72–78 (2019)
Google Scholar
del Carmen, M.G., Herman, J., Rao, S., Hidrue, M.K., Ting, D., Lehrhoff, S.R., Lenz, S., Heffernan, J., Ferris, T.G.: Trends and factors associated with physician burnout at a multispecialty academic faculty practice organization. JAMA Netw. Open 2(3), e190554–e190554 (2019)
Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol. 1 (Long and Short Papers), pp. 4171–4186 (2019)
Google Scholar
Finley, G., Edwards, E., Robinson, A., Brenndoerfer, M., Sadoughi, N., Fone, J., Axtmann, N., Miller, M., Suendermann-Oeft, D.: An automated medical scribe for documenting clinical encounters. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations, pp. 11–15 (2018)
Google Scholar
GAO: Medical records: fees and challenges associated with patients’ access. United States Government Accountability Office, Report to Congressional Committees GAO-18-386 (2018)
Google Scholar
Jeblee, S., Khattak, F.K., Crampton, N., Mamdani, M., Rudzicz, F.: Extracting relevant information from physician-patient dialogues for automated clinical note taking. In: Proceedings of the Tenth International Workshop on Health Text Mining and Information Analysis (LOUHI 2019), pp. 65–74 (2019)
Google Scholar
Kannan, A., Chen, K., Jaunzeikare, D., Rajkomar, A.: Semi-supervised learning for information extraction from dialogue. In: Interspeech, pp. 2077–2081 (2018)
Google Scholar
Kessels, R.P.: Patients’ memory for medical information. J. R. Soc. Med. 96(5), 219–222 (2003)
Google Scholar
Kim, S., Dalmia, S., Metze, F.: Cross-attention end-to-end ASR for two-party conversations. arXiv preprint arXiv:190710726 (2019)
Kodish-Wachs, J., Agassi, E., Kenny III, J.P.: A systematic comparison of contemporary automatic speech recognition engines for conversational clinical speech. In: AMIA Annual Symposium Proceedings, American Medical Informatics Association, vol. 2018, p. 683 (2018)
Google Scholar
Kumar, S.: Burnout and doctors: prevalence, prevention and intervention. Healthcare 4(3), 37 (2016). Multidisciplinary Digital Publishing Institute
Google Scholar
Leventhal, R.: Physician burnout addressed: how one medical group is (virtually) progressing. Healthcare Innovation (2018)
Google Scholar
Liu, S., Ma, W., Moore, R., Ganesan, V., Nelson, S.: RxNorm: prescription for electronic drug information exchange. IT Prof. 7(5), 17–23 (2005)
Google Scholar
Liu, Z., Lim, H., Suhaimi, N.F.A., Tong, S.C., Ong, S., Ng, A., Lee, S., Macdonald, M.R., Ramasamy, S., Krishnaswamy, P., et al.: Fast prototyping a dialogue comprehension system for nurse-patient conversations on symptom monitoring. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol. 2 (Industry Papers), pp. 24–31 (2019)
Google Scholar
McCann, B., Keskar, N.S., Xiong, C., Socher, R.: The natural language decathlon: multitask learning as question answering. arXiv preprint arXiv:180608730 (2018)
Peters, M.E., Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K., Zettlemoyer, L.: Deep contextualized word representations. In: Proceedings of NAACL-HLT, pp. 2227–2237 (2018)
Google Scholar
See, A., Liu, P.J., Manning, C.D.: Get to the point: summarization with pointer-generator networks. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Long Papers), vol. 1, pp. 1073–1083 (2017)
Google Scholar
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, Ł., Polosukhin, I.: Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 5998–6008 (2017)
Google Scholar
Xiong, C., Zhong, V., Socher, R.: Dynamic coattention networks for question answering. arXiv preprint arXiv:161101604 (2016)

Download references

Acknowledgements

We thank: University of Pittsburgh Medical Center (UPMC) and Abridge AI Inc. for providing access to the de-identified data corpus; Dr. Shivdev Rao, CEO, Abridge AI Inc. and a practicing cardiologist in UPMC’s Heart and Vascular Institute, and Prlof. Florian Metze, Associate Research Professor, Carnegie Mellon University for helpful discussions; Ben Schloss, Steven Coleman, and Deborah Osakue for data business development and annotation management.

Author information

Authors and Affiliations

Abridge AI Inc., Pittsburgh, USA
Sai P. Selvaraj & Sandeep Konam

Authors

Sai P. Selvaraj
View author publications
You can also search for this author in PubMed Google Scholar
Sandeep Konam
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sai P. Selvaraj .

Editor information

Editors and Affiliations

Department of Pediatrics, College of Medicine, The University of Tennessee Health Science Center (UTHSC), Oak-Ridge National Lab (ORNL), Memphis, TN, USA
Arash Shaban-Nejad
School of Nursing, University of Minnesota, Minneapolis, MN, USA
Martin Michalowski
McGill Clinical & Health Informatics, Montreal, QC, Canada
David L. Buckeridge

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Selvaraj, S.P., Konam, S. (2021). Medication Regimen Extraction from Medical Conversations. In: Shaban-Nejad, A., Michalowski, M., Buckeridge, D.L. (eds) Explainable AI in Healthcare and Medicine. Studies in Computational Intelligence, vol 914. Springer, Cham. https://doi.org/10.1007/978-3-030-53352-6_18

Download citation

DOI: https://doi.org/10.1007/978-3-030-53352-6_18
Published: 03 November 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-53351-9
Online ISBN: 978-3-030-53352-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics