Abstract
Computed tomography (CT) scan provides first-hand knowledge to doctors to identify an ailment. Deep neural networks help enhance image understanding through segmentation and labeling. In this work, we implement two variants of Pix2Pix generative adversarial networks (GANs) with varying complexities of generator and discriminator networks for plane invariant segmentation of CT scan images and subsequently propose an effective generative adversarial network with a suitably weighted binary cross-entropy loss function followed by image processing layer necessary for getting high-quality output segmentation. Our conditional GAN is powered by a unique set of an encoder-decoder network that coupled with the image processing layer produces enhanced segmentation. The network can be extended to the complete set of Hounsfield units and can also be implemented on smartphones. Furthermore, we also demonstrate effects on accuracy, F-1 score, and Jaccard index by using the conditional GAN networks on the spine vertebrae dataset, thus achieving an average of 86.28 % accuracy, 90.5 % Jaccard index score, and 89.9 % F-1 score in predicting segmented maps for validation input images. In addition, an overall lifting of accuracy, F-1 score, and Jaccard index graph for validation images with better continuity has also been highlighted.
Similar content being viewed by others
References
Tan Y (2021) GPU-based parallel implementation of swarm intelligence algorithms, 1st ed. 2016. Accessed 14 May 2021
Goodfellow et al (2021) Generative adversarial networks, Communications of the ACM, vol 63 no. 11, pp. 139-144, 2020. Available: https://doi.org/10.1145/3422622 Accessed 15 Mar 2021
Kass M, Witkin A, Terzopoulos D (1988) Snakes: active contour models, International Journal of Computer Vision, vo. 1, no 4, pp 321-331 Available: https://doi.org/10.1007/bf00133570 Accessed 20 Jun 2021
Comaniciu D, Meer P (2002) Mean shift: a robust approach toward feature space analysis. IEEE Transactions on pattern analysis and machine intelligence, vol 24, no 5, pp 603-619, Available: https://doi.org/10.1109/34.1000236 Accessed 18 Mar 2021
Osher S, Sethian J (1988) Fronts propagating with curvature-dependent speed: algorithms based on Hamilton-Jacobi formulations. Journal of Computational Physics, vol 79, no 1, pp 12-49, Available: https://doi.org/10.1016/0021-9991(88)900022 Accessed 24 Mar 2021
Lecun Y, Bengio Y (1995) Convolutional networks for images, speech, and time series. The handbook of brain theory and neural networks, pp 255-258 Accessed: 25 Mar 2021
Stehling M, Turner R, Mansfield P (1991) Echo-planar imaging: magnetic resonance imaging in a fraction of a second. Science, vol 254, no 5028, pp 43-50 Available: https://doi.org/10.1126/science.1925560Accessed: 8 Apr 2021
McGibney G, Smith M, Nichols S, Crawley A (1993) Quantitative evaluation of several partial Fourier reconstruction algorithms used in MRI. Magnetic Resonance in Medicine, vol 30, no 1, pp 51-59 Available: https://doi.org/10.1002/mrm.1910300109 Accessed 10 Apr 2021
Yang G et al (2018) DAGAN: deep de-aliasing generative adversarial networks for fast compressed sensing MRI reconstruction. IEEE Transactions on Medical Imaging, vol. 37, no. 6, pp 1310-1321 Available: https://doi.org/10.1109/tmi.2017.2785879 Accessed 18 Apr 2021
VerSe‘19 - Grand Challenge, grand-challenge.org, 2021. [Online]. Available: https://verse2019.grand-challenge.org Accessed: 20 Apr 2021
https://www.socr.umich.edu Brain Viewer Webapp, Socr.umich.edu, 2021. [Online]. Available: https://socr.umich.edu/HTML5/BrainViewer Accessed: 20 Apr 2021
Wang X et al (2018) ESRGAN: enhanced super-resolution generative adversarial networks European conference on computer vision vol 2018 Accessed 23 Apr 2021
Ledig C, Theis L, Huszar F, Caballero J, Cunningham A, Acosta A, Shi W (2017) Photo-realistic single image super-resolution using a generative adversarial network. IEEE Conference on Computer Vision and Pattern Recognition (CVPR). https://doi.org/10.1109/cvpr.2017.19
Zhao Y et al (2020) Fine-grained facial image-to-image translation with an attention-based pipeline generative adversarial framework. Multimedia Tools and Applications, vol. 79, no. 21-22 pp. 14981-15000 Available: https://doi.org/10.1007/s11042-019-08346-x Accessed 23 Apr 2021
Singh N, Raza K (2021) Medical image generation using generative adversarial networks: a review health informatics: a computational perspective in healthcare, pp. 77-96 Available: https://doi.org/10.1007/978-981-15-9735-0-5 Accessed 24 Apr 2021
Radford A, Metz L (2016) Unsupervised representation learning with deep convolutional generative adversarial networks. International Conference on Learning Representations, vol 2 Accessed 24 Apr 2021
Denton E, Chintala S, Szlam A, Fergus R (2015) Deep generative image models using a laplacian pyramid of adversarial networks. Advances in Neural Information Processing Systems 28 (NIPS 2015)
Sheng M, Ma Z, Jia H, Mao Q, Dong M (2020) Face aging with conditional generative adversarial network guided by ranking-CNN. 2020 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR). https://doi.org/10.1109/mipr49039.2020.00071
Zhu J, Park T, Isola P, Efros A (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks 2017 IEEE International Conference on Computer Vision (ICCV) Available: https://doi.org/10.1109/iccv.2017.244 Accessed 25 Apr 2021
Kang T, Lee KH (2020) Unsupervised image-to-image translation with self-attention networks. IEEE International Conference on Big Data and Smart Computing (BigComp). https://doi.org/10.1109/bigcomp48618.2020.00-92
Brehm S, Scherer S, Lienhart R (2022) Semantically consistent image-to-image translation for unsupervised domain adaptation. Proceedings of the 14th International Conference on Agents and Artificial Intelligence. https://doi.org/10.5220/0010786000003116
Zhou Y, Yang Z, Zhang H, Chang EI, Fan Y, Xu Y (2022) 3D segmentation guided style-based generative adversarial networks for pet synthesis. IEEE Transactions on Medical Imaging 41(8):2092–2104. https://doi.org/10.1109/tmi.2022.3156614
Liao Y, Huang Y (2022) Deep learning-based application of image style transfer. Mathematical Problems in Engineering 1–10. https://doi.org/10.1155/2022/1693892
Wang Y (2020) A mathematical introduction to generative adversarial nets (GAN) arXiv e-prints Accessed 29 Apr 2021
Grover P (2021) 5 regression loss functions all machine learners should know medium [online] Available: https://heartbeat.fritz.ai/5-regression-loss-functions-all-machine-learners-should-know-4fb140e9d4b0 Accessed 28 Apr 2021
Löffler M et al (2020) A vertebral segmentation dataset with fracture grading, Radiology: Artificial Intelligence, vol 2 no 4, p e190138 Available: https://doi.org/10.1148/ryai.2020190138 Accessed 30 Apr 2021
Sekuboyina A (2020) VerSe: a vertebrae labelling and segmentation benchmark for multi-detector CT images, arXiv e-prints. Accessed 10 Mar 2021
Gorgolewski K et al The brain imaging data structure, a format for organizing and describing outputs of neuroimaging experiments", Scientific Data, vol 3 no 1, 2016. Available: https://doi.org/10.1038/sdata.2016.44 Accessed 20 Jul 2021
Nazem F, Ghasemi F, Fassihi A, Dehnavi A (2021) 3D U-Net: A voxel-based method in binding site prediction of protein structure. Journal Of Bioinformatics And Computational Biology 19(02):2150006. https://doi.org/10.1142/s0219720021500062
Dutta A, Singh K, Anand A (2021) SpliceViNCI: Visualizing the splicing of non-canonical introns through recurrent neural networks. Journal Of Bioinformatics And Computational Biology, 19(04) 2150014. https://doi.org/10.1142/s0219720021500141
Maas A, Hannun A, Ng A (2013) Rectifier nonlinearities improve neural network, acoustic models. ICML, 30
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Khan, U., Yasin, A. Plane invariant segmentation of computed tomography images through weighted cross entropy optimized conditional GANs in compressed formats. Med Biol Eng Comput 61, 2677–2697 (2023). https://doi.org/10.1007/s11517-023-02846-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11517-023-02846-7