Revisiting 3D Context Modeling with Supervised Pre-training for Universal Lesion Detection in CT Slices

Zhang, Shu; Xu, Jincheng; Chen, Yu-Chun; Ma, Jiechao; Li, Zihao; Wang, Yizhou; Yu, Yizhou

doi:10.1007/978-3-030-59719-1_53

Shu Zhang¹⁶,
Jincheng Xu¹⁶,
Yu-Chun Chen¹⁷,
Jiechao Ma¹⁷,
Zihao Li¹⁷,
Yizhou Wang^16,18,19 &
…
Yizhou Yu^17,20

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12264))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

8980 Accesses
10 Citations

Abstract

Universal lesion detection from computed tomography (CT) slices is important for comprehensive disease screening. Since each lesion can locate in multiple adjacent slices, 3D context modeling is of great significance for developing automated lesion detection algorithms. In this work, we propose a Modified Pseudo-3D Feature Pyramid Network (MP3D FPN) that leverages depthwise separable convolutional filters and a group transform module (GTM) to efficiently extract 3D context enhanced 2D features for universal lesion detection in CT slices. To facilitate faster convergence, a novel 3D network pre-training method is derived using solely large-scale 2D object detection dataset in the natural image domain. We demonstrate that with the novel pre-training method, the proposed MP3D FPN achieves state-of-the-art detection performance on the DeepLesion dataset (3.48% absolute improvement in the sensitivity of FPs@0.5), significantly surpassing the baseline method by up to 6.06% (in MAP@0.5) which adopts 2D convolution for 3D context modeling. Moreover, the proposed 3D pre-trained weights can potentially be used to boost the performance of other 3D medical image analysis tasks.

This work was done when Jincheng Xu was an intern at Deepwise AI Lab.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 119.00; Price excludes VAT (USA)

Softcover Book: USD 159.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Lesion Detection with Deep Aggregated 3D Contextual Feature and Auxiliary Information

3D Context Enhanced Region-Based Convolutional Neural Network for End-to-End Lesion Detection

Multi-scale Convolutional Neural Network Based on 3D Context Fusion for Lesion Detection

References

Yan, K., Bagheri, M., Summers, R.M.: 3D context enhanced region-based convolutional neural network for end-to-end lesion detection. In: Frangi, A.F., Schnabel, J.A., Davatzikos, C., Alberola-López, C., Fichtinger, G. (eds.) MICCAI 2018. LNCS, vol. 11070, pp. 511–519. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00928-1_58
Chapter Google Scholar
Shao, Q., Gong, L., Ma, K., Liu, H., Zheng, Y.: Attentive CT lesion detection using deep pyramid inference with multi-scale booster. In: Shen, D., et al. (eds.) MICCAI 2019. LNCS, vol. 11769, pp. 301–309. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-32226-7_34
Chapter Google Scholar
Zlocha, M., Dou, Q., Glocker, B.: Improving RetinaNet for CT lesion detection with dense masks from weak RECIST labels. In: Shen, D., et al. (eds.) MICCAI 2019. LNCS, vol. 11769, pp. 402–410. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-32226-7_45
Chapter Google Scholar
Li, Z., Zhang, S., Zhang, J., Huang, K., Wang, Y., Yu, Y.: MVP-Net: multi-view FPN with position-aware attention for deep universal lesion detection. In: Shen, D., et al. (eds.) MICCAI 2019. LNCS, vol. 11769, pp. 13–21. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-32226-7_2
Chapter Google Scholar
Yan, K., et al.: MULAN: multitask universal lesion analysis network for joint lesion detection, tagging, and segmentation. In: Shen, D., et al. (eds.) MICCAI 2019. LNCS, vol. 11769, pp. 194–202. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-32226-7_22
Chapter Google Scholar
Zhou, Z., et al.: Models genesis: generic autodidactic models for 3D medical image analysis. In: Shen, D., et al. (eds.) MICCAI 2019. LNCS, vol. 11767, pp. 384–393. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-32251-9_42
Chapter Google Scholar
Chen, S., Ma, K., Zheng, Y.: Med3d: transfer learning for 3D medical image analysis. arXiv preprint arXiv:1904.00625 (2019)
Qiu, Z., Yao, T., Mei, T.: Learning spatio-temporal representation with pseudo-3D residual networks. In: proceedings of the IEEE International Conference on Computer Vision, pp. 5533–5541 (2017)
Google Scholar
Yang, J., Huang, X., Ni, B., Xu, J., Yang, C., Xu, G.: Reinventing 2D convolutions for 3D medical images. arXiv preprint arXiv:1911.10477 (2019)
Lakhani, P., Sundaram, B.: Deep learning at chest radiography: automated classification of pulmonary tuberculosis by using convolutional neural networks. Radiology 284(2), 574–582 (2017)
Article Google Scholar
He, K., Girshick, R., Dollár, P.: Rethinking imagenet pre-training. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 4918–4927 (2019)
Google Scholar
Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Google Scholar
Fang, C., Li, G., Pan, C., Li, Y., Yu, Y.: Globally guided progressive fusion network for 3D pancreas segmentation. MICCAI 2019. LNCS, vol. 11765, pp. 210–218. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-32245-8_24
Chapter Google Scholar
Wu, Y., He, K.: Group normalization. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 3–19 (2018)
Google Scholar
Lin, T.-Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48
Chapter Google Scholar

Download references

Acknowledgements

This work is funded by National Key Research and Development Program of China (No. 2019YFC0118101), MOST-2018AAA0102004 and NSFC-61625201. We would like to thank Yemin Shi for valuable discussions.

Author information

Authors and Affiliations

Department of Computer Science, Peking University, Beijing, China
Shu Zhang, Jincheng Xu & Yizhou Wang
Deepwise AI Lab, Beijing, China
Yu-Chun Chen, Jiechao Ma, Zihao Li & Yizhou Yu
Advanced Institute of Information Technology, Peking University, Hangzhou, China
Yizhou Wang
Center on Frontiers of Computing Studies, Peking University, Beijing, China
Yizhou Wang
The University of Hong Kong, Pokfulam, Hong Kong
Yizhou Yu

Authors

Shu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jincheng Xu
View author publications
You can also search for this author in PubMed Google Scholar
Yu-Chun Chen
View author publications
You can also search for this author in PubMed Google Scholar
Jiechao Ma
View author publications
You can also search for this author in PubMed Google Scholar
Zihao Li
View author publications
You can also search for this author in PubMed Google Scholar
Yizhou Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yizhou Yu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yizhou Yu .

Editor information

Editors and Affiliations

University of Toronto, Toronto, ON, Canada
Anne L. Martel
The University of British Columbia, Vancouver, BC, Canada
Purang Abolmaesumi
University College London, London, UK
Danail Stoyanov
École Centrale de Nantes, Nantes, France
Diana Mateus
EURECOM, Biot, France
Maria A. Zuluaga
Chinese Academy of Sciences, Beijing, China
S. Kevin Zhou
Sorbonne University, Paris, France
Daniel Racoceanu
The Hebrew University of Jerusalem, Jerusalem, Israel
Leo Joskowicz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, S. et al. (2020). Revisiting 3D Context Modeling with Supervised Pre-training for Universal Lesion Detection in CT Slices. In: Martel, A.L., et al. Medical Image Computing and Computer Assisted Intervention – MICCAI 2020. MICCAI 2020. Lecture Notes in Computer Science(), vol 12264. Springer, Cham. https://doi.org/10.1007/978-3-030-59719-1_53

Download citation

DOI: https://doi.org/10.1007/978-3-030-59719-1_53
Published: 29 September 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-59718-4
Online ISBN: 978-3-030-59719-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)

Revisiting 3D Context Modeling with Supervised Pre-training for Universal Lesion Detection in CT Slices

Abstract

Access this chapter

Similar content being viewed by others

Lesion Detection with Deep Aggregated 3D Contextual Feature and Auxiliary Information

3D Context Enhanced Region-Based Convolutional Neural Network for End-to-End Lesion Detection

Multi-scale Convolutional Neural Network Based on 3D Context Fusion for Lesion Detection

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Revisiting 3D Context Modeling with Supervised Pre-training for Universal Lesion Detection in CT Slices

Abstract

Access this chapter

Similar content being viewed by others

Lesion Detection with Deep Aggregated 3D Contextual Feature and Auxiliary Information

3D Context Enhanced Region-Based Convolutional Neural Network for End-to-End Lesion Detection

Multi-scale Convolutional Neural Network Based on 3D Context Fusion for Lesion Detection

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation