Skip to main content

Medication Regimen Extraction from Medical Conversations

  • Chapter
  • First Online:
Explainable AI in Healthcare and Medicine

Part of the book series: Studies in Computational Intelligence ((SCI,volume 914))

Abstract

Extracting relevant information from medical conversations and providing it to doctors and patients might help in addressing doctor burnout and patient forgetfulness. In this paper, we focus on extracting the Medication Regimen (dosage and frequency for medications) discussed in a medical conversation. We frame the problem as a Question Answering (QA) task and perform comparative analysis over: a QA approach, a new combined QA and Information Extraction approach, and other baselines. We use a small corpus of 6,692 annotated doctor-patient conversations for the task. Clinical conversation corpora are costly to create, difficult to handle (because of data privacy concerns), and thus scarce. We address this data scarcity challenge through data augmentation methods, using publicly available embeddings and pretrain part of the network on a related task (summarization) to improve the model’s performance. Compared to the baseline, our best-performing models improve the dosage and frequency extractions’ ROUGE-1 F1 scores from 54.28 and 37.13 to 89.57 and 45.94, respectively. Using our best-performing model, we present the first fully automated system that can extract Medication Regimen tags from spontaneous doctor-patient conversations with about \(\sim \)71% accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    This prevents overfitting and repetition when converting all the numbers to words.

  2. 2.

    This can happen when a different form of a medication (e.g. abbreviation, generic or brand name) is used in the conversation compared to the annotation.

  3. 3.

    https://cloud.google.com/speech-to-text/.

  4. 4.

    https://www.ibm.com/cloud/watson-speech-to-text.

  5. 5.

    Since we had high quality human written transcripts and our ASR transcripts did not contain spelling mistakes (as long as the word was correctly recognized), string matching worked well during testing.

  6. 6.

    https://spacy.io/api/entityrecognizer.

References

  1. Alsentzer, E., Murphy, J., Boag, W., Weng, W.H., Jindi, D., Naumann, T., McDermott, M.: Publicly available clinical BERT embeddings. In: Proceedings of the 2nd Clinical Natural Language Processing Workshop, Association for Computational Linguistics, Minneapolis, Minnesota, USA, pp. 72–78 (2019)

    Google Scholar 

  2. del Carmen, M.G., Herman, J., Rao, S., Hidrue, M.K., Ting, D., Lehrhoff, S.R., Lenz, S., Heffernan, J., Ferris, T.G.: Trends and factors associated with physician burnout at a multispecialty academic faculty practice organization. JAMA Netw. Open 2(3), e190554–e190554 (2019)

    Google Scholar 

  3. Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol. 1 (Long and Short Papers), pp. 4171–4186 (2019)

    Google Scholar 

  4. Finley, G., Edwards, E., Robinson, A., Brenndoerfer, M., Sadoughi, N., Fone, J., Axtmann, N., Miller, M., Suendermann-Oeft, D.: An automated medical scribe for documenting clinical encounters. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations, pp. 11–15 (2018)

    Google Scholar 

  5. GAO: Medical records: fees and challenges associated with patients’ access. United States Government Accountability Office, Report to Congressional Committees GAO-18-386 (2018)

    Google Scholar 

  6. Jeblee, S., Khattak, F.K., Crampton, N., Mamdani, M., Rudzicz, F.: Extracting relevant information from physician-patient dialogues for automated clinical note taking. In: Proceedings of the Tenth International Workshop on Health Text Mining and Information Analysis (LOUHI 2019), pp. 65–74 (2019)

    Google Scholar 

  7. Kannan, A., Chen, K., Jaunzeikare, D., Rajkomar, A.: Semi-supervised learning for information extraction from dialogue. In: Interspeech, pp. 2077–2081 (2018)

    Google Scholar 

  8. Kessels, R.P.: Patients’ memory for medical information. J. R. Soc. Med. 96(5), 219–222 (2003)

    Google Scholar 

  9. Kim, S., Dalmia, S., Metze, F.: Cross-attention end-to-end ASR for two-party conversations. arXiv preprint arXiv:190710726 (2019)

  10. Kodish-Wachs, J., Agassi, E., Kenny III, J.P.: A systematic comparison of contemporary automatic speech recognition engines for conversational clinical speech. In: AMIA Annual Symposium Proceedings, American Medical Informatics Association, vol. 2018, p. 683 (2018)

    Google Scholar 

  11. Kumar, S.: Burnout and doctors: prevalence, prevention and intervention. Healthcare 4(3), 37 (2016). Multidisciplinary Digital Publishing Institute

    Google Scholar 

  12. Leventhal, R.: Physician burnout addressed: how one medical group is (virtually) progressing. Healthcare Innovation (2018)

    Google Scholar 

  13. Liu, S., Ma, W., Moore, R., Ganesan, V., Nelson, S.: RxNorm: prescription for electronic drug information exchange. IT Prof. 7(5), 17–23 (2005)

    Google Scholar 

  14. Liu, Z., Lim, H., Suhaimi, N.F.A., Tong, S.C., Ong, S., Ng, A., Lee, S., Macdonald, M.R., Ramasamy, S., Krishnaswamy, P., et al.: Fast prototyping a dialogue comprehension system for nurse-patient conversations on symptom monitoring. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol. 2 (Industry Papers), pp. 24–31 (2019)

    Google Scholar 

  15. McCann, B., Keskar, N.S., Xiong, C., Socher, R.: The natural language decathlon: multitask learning as question answering. arXiv preprint arXiv:180608730 (2018)

  16. Peters, M.E., Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K., Zettlemoyer, L.: Deep contextualized word representations. In: Proceedings of NAACL-HLT, pp. 2227–2237 (2018)

    Google Scholar 

  17. See, A., Liu, P.J., Manning, C.D.: Get to the point: summarization with pointer-generator networks. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Long Papers), vol. 1, pp. 1073–1083 (2017)

    Google Scholar 

  18. Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, Ł., Polosukhin, I.: Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 5998–6008 (2017)

    Google Scholar 

  19. Xiong, C., Zhong, V., Socher, R.: Dynamic coattention networks for question answering. arXiv preprint arXiv:161101604 (2016)

Download references

Acknowledgements

We thank: University of Pittsburgh Medical Center (UPMC) and Abridge AI Inc. for providing access to the de-identified data corpus; Dr. Shivdev Rao, CEO, Abridge AI Inc. and a practicing cardiologist in UPMC’s Heart and Vascular Institute, and Prlof. Florian Metze, Associate Research Professor, Carnegie Mellon University for helpful discussions; Ben Schloss, Steven Coleman, and Deborah Osakue for data business development and annotation management.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sai P. Selvaraj .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Selvaraj, S.P., Konam, S. (2021). Medication Regimen Extraction from Medical Conversations. In: Shaban-Nejad, A., Michalowski, M., Buckeridge, D.L. (eds) Explainable AI in Healthcare and Medicine. Studies in Computational Intelligence, vol 914. Springer, Cham. https://doi.org/10.1007/978-3-030-53352-6_18

Download citation

Publish with us

Policies and ethics