Skip to main content
Log in

Plane invariant segmentation of computed tomography images through weighted cross entropy optimized conditional GANs in compressed formats

  • Original Article
  • Published:
Medical & Biological Engineering & Computing Aims and scope Submit manuscript

Abstract

Computed tomography (CT) scan provides first-hand knowledge to doctors to identify an ailment. Deep neural networks help enhance image understanding through segmentation and labeling. In this work, we implement two variants of Pix2Pix generative adversarial networks (GANs) with varying complexities of generator and discriminator networks for plane invariant segmentation of CT scan images and subsequently propose an effective generative adversarial network with a suitably weighted binary cross-entropy loss function followed by image processing layer necessary for getting high-quality output segmentation. Our conditional GAN is powered by a unique set of an encoder-decoder network that coupled with the image processing layer produces enhanced segmentation. The network can be extended to the complete set of Hounsfield units and can also be implemented on smartphones. Furthermore, we also demonstrate effects on accuracy, F-1 score, and Jaccard index by using the conditional GAN networks on the spine vertebrae dataset, thus achieving an average of 86.28 % accuracy, 90.5 % Jaccard index score, and 89.9 % F-1 score in predicting segmented maps for validation input images. In addition, an overall lifting of accuracy, F-1 score, and Jaccard index graph for validation images with better continuity has also been highlighted.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19

Similar content being viewed by others

References

  1. Tan Y (2021) GPU-based parallel implementation of swarm intelligence algorithms, 1st ed. 2016. Accessed 14 May 2021

  2. Goodfellow et al (2021) Generative adversarial networks, Communications of the ACM, vol 63 no. 11, pp. 139-144, 2020. Available: https://doi.org/10.1145/3422622 Accessed 15 Mar 2021

  3. Kass M, Witkin A, Terzopoulos D (1988) Snakes: active contour models, International Journal of Computer Vision, vo. 1, no 4, pp 321-331 Available: https://doi.org/10.1007/bf00133570 Accessed 20 Jun 2021

  4. Comaniciu D, Meer P (2002) Mean shift: a robust approach toward feature space analysis. IEEE Transactions on pattern analysis and machine intelligence, vol 24, no 5, pp 603-619, Available: https://doi.org/10.1109/34.1000236 Accessed 18 Mar 2021

  5. Osher S, Sethian J (1988) Fronts propagating with curvature-dependent speed: algorithms based on Hamilton-Jacobi formulations. Journal of Computational Physics, vol 79, no 1, pp 12-49, Available: https://doi.org/10.1016/0021-9991(88)900022 Accessed 24 Mar 2021

  6. Lecun Y, Bengio Y (1995) Convolutional networks for images, speech, and time series. The handbook of brain theory and neural networks, pp 255-258 Accessed: 25 Mar 2021

  7. Stehling M, Turner R, Mansfield P (1991) Echo-planar imaging: magnetic resonance imaging in a fraction of a second. Science, vol 254, no 5028, pp 43-50 Available: https://doi.org/10.1126/science.1925560Accessed: 8 Apr 2021

  8. McGibney G, Smith M, Nichols S, Crawley A (1993) Quantitative evaluation of several partial Fourier reconstruction algorithms used in MRI. Magnetic Resonance in Medicine, vol 30, no 1, pp 51-59 Available: https://doi.org/10.1002/mrm.1910300109 Accessed 10 Apr 2021

  9. Yang G et al (2018) DAGAN: deep de-aliasing generative adversarial networks for fast compressed sensing MRI reconstruction. IEEE Transactions on Medical Imaging, vol. 37, no. 6, pp 1310-1321 Available: https://doi.org/10.1109/tmi.2017.2785879 Accessed 18 Apr 2021

  10. VerSe‘19 - Grand Challenge, grand-challenge.org, 2021. [Online]. Available: https://verse2019.grand-challenge.org Accessed: 20 Apr 2021

  11. https://www.socr.umich.edu Brain Viewer Webapp, Socr.umich.edu, 2021. [Online]. Available: https://socr.umich.edu/HTML5/BrainViewer Accessed: 20 Apr 2021

  12. Wang X et al (2018) ESRGAN: enhanced super-resolution generative adversarial networks European conference on computer vision vol 2018 Accessed 23 Apr 2021

  13. Ledig C, Theis L, Huszar F, Caballero J, Cunningham A, Acosta A, Shi W (2017) Photo-realistic single image super-resolution using a generative adversarial network. IEEE Conference on Computer Vision and Pattern Recognition (CVPR). https://doi.org/10.1109/cvpr.2017.19

    Article  Google Scholar 

  14. Zhao Y et al (2020) Fine-grained facial image-to-image translation with an attention-based pipeline generative adversarial framework. Multimedia Tools and Applications, vol. 79, no. 21-22 pp. 14981-15000 Available: https://doi.org/10.1007/s11042-019-08346-x Accessed 23 Apr 2021

  15. Singh N, Raza K (2021) Medical image generation using generative adversarial networks: a review health informatics: a computational perspective in healthcare, pp. 77-96 Available: https://doi.org/10.1007/978-981-15-9735-0-5 Accessed 24 Apr 2021

  16. Radford A, Metz L (2016) Unsupervised representation learning with deep convolutional generative adversarial networks. International Conference on Learning Representations, vol 2 Accessed 24 Apr 2021

  17. Denton E, Chintala S, Szlam A, Fergus R (2015) Deep generative image models using a laplacian pyramid of adversarial networks. Advances in Neural Information Processing Systems 28 (NIPS 2015)

  18. Sheng M, Ma Z, Jia H, Mao Q, Dong M (2020) Face aging with conditional generative adversarial network guided by ranking-CNN. 2020 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR). https://doi.org/10.1109/mipr49039.2020.00071

  19. Zhu J, Park T, Isola P, Efros A (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks 2017 IEEE International Conference on Computer Vision (ICCV) Available: https://doi.org/10.1109/iccv.2017.244 Accessed 25 Apr 2021

  20. Kang T, Lee KH (2020) Unsupervised image-to-image translation with self-attention networks. IEEE International Conference on Big Data and Smart Computing (BigComp). https://doi.org/10.1109/bigcomp48618.2020.00-92

  21. Brehm S, Scherer S, Lienhart R (2022) Semantically consistent image-to-image translation for unsupervised domain adaptation. Proceedings of the 14th International Conference on Agents and Artificial Intelligence. https://doi.org/10.5220/0010786000003116

  22. Zhou Y, Yang Z, Zhang H, Chang EI, Fan Y, Xu Y (2022) 3D segmentation guided style-based generative adversarial networks for pet synthesis. IEEE Transactions on Medical Imaging 41(8):2092–2104. https://doi.org/10.1109/tmi.2022.3156614

    Article  PubMed  Google Scholar 

  23. Liao Y, Huang Y (2022) Deep learning-based application of image style transfer. Mathematical Problems in Engineering 1–10. https://doi.org/10.1155/2022/1693892

  24. Wang Y (2020) A mathematical introduction to generative adversarial nets (GAN) arXiv e-prints Accessed 29 Apr 2021

  25. Grover P (2021) 5 regression loss functions all machine learners should know medium [online] Available: https://heartbeat.fritz.ai/5-regression-loss-functions-all-machine-learners-should-know-4fb140e9d4b0 Accessed 28 Apr 2021

  26. Löffler M et al (2020) A vertebral segmentation dataset with fracture grading, Radiology: Artificial Intelligence, vol 2 no 4, p e190138 Available: https://doi.org/10.1148/ryai.2020190138 Accessed 30 Apr 2021

  27. Sekuboyina A (2020) VerSe: a vertebrae labelling and segmentation benchmark for multi-detector CT images, arXiv e-prints. Accessed 10 Mar 2021

  28. Gorgolewski K et al The brain imaging data structure, a format for organizing and describing outputs of neuroimaging experiments", Scientific Data, vol 3 no 1, 2016. Available: https://doi.org/10.1038/sdata.2016.44 Accessed 20 Jul 2021

  29. Nazem F, Ghasemi F, Fassihi A, Dehnavi A (2021) 3D U-Net: A voxel-based method in binding site prediction of protein structure. Journal Of Bioinformatics And Computational Biology 19(02):2150006. https://doi.org/10.1142/s0219720021500062

    Article  CAS  PubMed  Google Scholar 

  30. Dutta A, Singh K, Anand A (2021) SpliceViNCI: Visualizing the splicing of non-canonical introns through recurrent neural networks. Journal Of Bioinformatics And Computational Biology, 19(04) 2150014. https://doi.org/10.1142/s0219720021500141

  31. Maas A, Hannun A, Ng A (2013) Rectifier nonlinearities improve neural network, acoustic models. ICML, 30

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Usman Khan.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Khan, U., Yasin, A. Plane invariant segmentation of computed tomography images through weighted cross entropy optimized conditional GANs in compressed formats. Med Biol Eng Comput 61, 2677–2697 (2023). https://doi.org/10.1007/s11517-023-02846-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11517-023-02846-7

Keywords

Navigation