Labeling of Multilingual Breast MRI Reports

Tsai, Chen-Han; Kiryati, Nahum; Konen, Eli; Sklair-Levy, Miri; Mayer, Arnaldo

doi:10.1007/978-3-030-61166-8_25

Chen-Han Tsai²⁷,
Nahum Kiryati²⁸,
Eli Konen²⁹,
Miri Sklair-Levy²⁹ &
…
Arnaldo Mayer²⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12446))

Included in the following conference series:

International Workshop on Interpretability of Machine Intelligence in Medical Image Computing
International Workshop on Medical Image Learning with Less Labels and Imperfect Data
International Workshop on Large-scale Annotation of Biomedical data and Expert Label Synthesis

1305 Accesses

Abstract

Medical reports are an essential medium in recording a patient’s condition throughout a clinical trial. They contain valuable information that can be extracted to generate a large labeled dataset needed for the development of clinical tools. However, the majority of medical reports are stored in an unregularized format, and a trained human annotator (typically a doctor) must manually assess and label each case, resulting in an expensive and time consuming procedure. In this work, we present a framework for developing a multilingual breast MRI report classifier using a custom-built language representation called LAMBR. Our proposed method overcomes practical challenges faced in clinical settings, and we demonstrate improved performance in extracting labels from medical reports when compared with conventional approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Breast Imaging-Reporting and Data System: a score between 0–6 indicating the level of severity of a breast lesion.

References

Bien, N., et al.: Deep-learning-assisted diagnosis for knee magnetic resonance imaging: development and retrospective validation of MRNet. Plos Med. 15(11), e1002699 (2018). https://doi.org/10.1371/journal.pmed.1002699
Article Google Scholar
Chronopoulou, A., Baziotis, C., Potamianos, A.: An embarrassingly simple approach for transfer learning from pretrained language models (2019)
Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: pre-training of deep bidirectional transformers for language understanding (2018)
Google Scholar
Gozes, O., et al.: Rapid AI development cycle for the coronavirus (covid-19) pandemic: initial results for automated detection & patient monitoring using deep learning CT image analysis (2020)
Google Scholar
Howard, J., Ruder, S.: Universal language model fine-tuning for text classification (2018)
Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization (2014)
Google Scholar
Lee, J., et al.: BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36, 1234–1240 (2019). https://doi.org/10.1093/bioinformatics/btz682
Article Google Scholar
Liu, Y., et al.: Roberta: a robustly optimized BERT pretraining approach (2019)
Google Scholar
Peters, M.E., et al.: Deep contextualized word representations. CoRR abs/1802.05365 (2018)
Google Scholar
Sharir, O., Peleg, B., Shoham, Y.: The cost of training NLP models: a concise overview. ArXiv abs/2004.08900 (2020)
Google Scholar
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z.: Rethinking the inception architecture for computer vision (2015)
Google Scholar
Tsai, C.H., Kiryati, N., Konen, E., Eshed, I., Mayer, A.: Knee injury detection using MRI with efficiently-layered network (ELNet). ArXiv abs/2005.02706 (2020)
Google Scholar
Vaswani, A., et al.: Attention is all you need (2017)
Google Scholar
Wood, D.A., et al.: Automated labelling using an attention model for radiology reports of MRI scans (ALARM) (2020)
Google Scholar
Wu, Y., et al.: Google’s neural machine translation system: bridging the gap between human and machine translation (2016)
Google Scholar
Yang, Z., Dai, Z., Yang, Y., Carbonell, J., Salakhutdinov, R., Le, Q.V.: XLNet: generalized autoregressive pretraining for language understanding (2019)
Google Scholar
Yosinski, J., Clune, J., Bengio, Y., Lipson, H.: How transferable are features in deep neural networks? (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Electrical Engineering, Tel Aviv University, Tel Aviv-Yafo, Israel
Chen-Han Tsai
The Manuel and Raquel Klachky Chair of Image Processing, School of Electrical Engineering, Tel-Aviv University, Tel Aviv-Yafo, Israel
Nahum Kiryati
Diagnostic Imaging, Sheba Medical Center, Affiliated to the Sackler School of Medicine, Tel-Aviv University, Tel Aviv-Yafo, Israel
Eli Konen, Miri Sklair-Levy & Arnaldo Mayer

Authors

Chen-Han Tsai
View author publications
You can also search for this author in PubMed Google Scholar
Nahum Kiryati
View author publications
You can also search for this author in PubMed Google Scholar
Eli Konen
View author publications
You can also search for this author in PubMed Google Scholar
Miri Sklair-Levy
View author publications
You can also search for this author in PubMed Google Scholar
Arnaldo Mayer
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chen-Han Tsai .

Editor information

Editors and Affiliations

University of Porto, Porto, Portugal
Jaime Cardoso
University of Houston, Houston, TX, USA
Hien Van Nguyen
University of Minnesota, Minneapolis, MN, USA
Nicholas Heller
University of Coimbra, Coimbra, Portugal
Pedro Henriques Abreu
Amsterdam University Medical Center, Amsterdam, The Netherlands
Ivana Isgum
University of Porto, Porto, Portugal
Wilson Silva
University of Porto, Porto, Portugal
Ricardo Cruz
University of Coimbra, Coimbra, Portugal
Jose Pereira Amorim
Johns Hopkins University, Baltimore, MD, USA
Vishal Patel
University of Houston, Houston, TX, USA
Badri Roysam
Chinese Academy of Sciences, Beijing, China
Kevin Zhou
UT Southwestern Medical Center, Dallas, TX, USA
Steve Jiang
University of Arkansas, Fayetteville, AR, USA
Ngan Le
University of Arkansas, Fayetteville, AR, USA
Khoa Luu
University of Bern, Bern, Switzerland
Raphael Sznitman
Eindhoven University of Technology, Eindhoven, The Netherlands
Veronika Cheplygina
Technical University of Munich, Nantes, Germany
Diana Mateus
University of Dundee, Dundee, UK
Emanuele Trucco
Eindhoven University of Technology, Eindhoven, The Netherlands
Samaneh Abbasi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tsai, CH., Kiryati, N., Konen, E., Sklair-Levy, M., Mayer, A. (2020). Labeling of Multilingual Breast MRI Reports. In: Cardoso, J., et al. Interpretable and Annotation-Efficient Learning for Medical Image Computing. IMIMIC MIL3ID LABELS 2020 2020 2020. Lecture Notes in Computer Science(), vol 12446. Springer, Cham. https://doi.org/10.1007/978-3-030-61166-8_25

Download citation

DOI: https://doi.org/10.1007/978-3-030-61166-8_25
Published: 02 October 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-61165-1
Online ISBN: 978-3-030-61166-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)