Skip to main content
Log in

Utilization of big data classification models in digitally enhanced optical coherence tomography for medical diagnostics

  • S.I.: Improving Healthcare outcomes using Multimedia Big Data Analytics
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

With the advancement in modern imaging techniques like CT scan, MRI, PET scan etc., a vast amount of data is generated every day in the field of healthcare. Big data contains hidden information, which necessitates the development of intelligent systems to analyze it and extract relevant information, allowing for accurate and cost-effective decisions in the medical field. By utilizing the untapped potential of the big data available in the medical field, very precise models can be developed for the medical diagnosis of retinal diseases. Optical coherence tomography (OCT) is a non-invasive imaging test that captures different, distinctive layers of the retina and optic nerve in a living eye to map and measure their thickness, that helps diagnose various retinal disorders. With the advancement of the application of deep learning-based techniques in the field of medical sciences, the use of convolutional neural network (CNN) based approaches for disease detection is gaining popularity. While the manual examination of 3D OCT images for the diagnosis of retinal disorders requires extensive time and expert intervention, the use of CNNs provides an effective automated option that provides results with higher accuracy while also reducing the time involved in the overall process. In this paper, we have implemented the aforementioned idea by proposing OCT-CNN, a CNN architecture, that automatically classifies retinal OCT images and identifies potential disorders in a living eye. Several techniques have been employed to enhance the performance of the proposed approach, including digital enhancement of the images, dropout regularization, adaptive learning rates, and early stopping of training to attain optimal performance. The performance of the proposed OCT-CNN is evaluated on the UCSD dataset against several popular deep CNN architectures and existing state-of-the-art approaches to automatic retinal OCT classification. The proposed OCT-CNN attains the best performance on all evaluated metrics, pushing the classification accuracies to 99.28% on CNV, 99.9% on DME, 99.38% on DRUSEN, and 100% on NORMAL images, indicating its superiority over existing state-of-the-art techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

  1. Sun H, Liu Z, Wang G, Lian W, Ma J (2019) Intelligent analysis of medical big data based on deep learning. IEEE Access 7:142022–142037. https://doi.org/10.1109/ACCESS.2019.2942937

    Article  Google Scholar 

  2. Davenport T, Kalakota R (2019) The potential for artificial intelligence in healthcare. Future Healthc J 6(2):94–98. https://doi.org/10.7861/futurehosp.6-2-94

    Article  Google Scholar 

  3. Shafqat S, Fayyaz M, Khattak HA et al (2021) Leveraging deep learning for designing healthcare analytics heuristic for diagnostics. Neural Process Lett. https://doi.org/10.1007/s11063-021-10425-w

    Article  Google Scholar 

  4. Najafabadi MM, Villanustre F, Khoshgoftaar TM et al (2015) Deep learning applications and challenges in big data analytics. J Big Data 2:1. https://doi.org/10.1186/s40537-014-0007-7

    Article  Google Scholar 

  5. Marmor MF (2000) Mechanisms of fluid accumulation in retinal edema. In: 2nd International symposium on macular edema. Springer, Dordrecht, pp 35–45. https://doi.org/10.1007/978-94-011-4152-9_4

  6. Rasti R, Rabbani H, Mehridehnavi A, Hajizadeh F (2018) Macular OCT classification using a multi-scale convolutional neural network ensemble. IEEE Trans Med Imaging 37(4):1024–1034. https://doi.org/10.1109/TMI.2017.2780115

    Article  Google Scholar 

  7. Mathenge W (2014) Age-related macular degeneration. Community Eye Health 27(87):49–50

    Google Scholar 

  8. Romero-Aroca P (2013) Current status in diabetic macular edema treatments. World J Diabetes 4(5):165–169. https://doi.org/10.4239/wjd.v4.i5.165

    Article  Google Scholar 

  9. Huang D, Swanson EA, Lin CP, Schuman JS, Stinson WG, Chang W, Hee MR, Flotte T, Gregory K, Puliafito CA (1991) Optical coherence tomography. Science 254(5035):1178–1181

    Article  Google Scholar 

  10. Wu H, Gu X (2015) Towards dropout training for convolutional neural networks. Neural Netw 71:1–10

    Article  Google Scholar 

  11. Liu Y-Y, Chen M, Ishikawa H, Wollstein G, Schuman JS, Rehg JM (2011) Automated macular pathology diagnosis in retinal OCT images using multi-scale spatial pyramid and local binary patterns in texture and shape encoding. Med Image Anal 15(5):748–759. https://doi.org/10.1016/j.media.2011.06.005

    Article  Google Scholar 

  12. Global Burden of Disease Study (2013) Collaborators (2015) Global, regional, and national incidence, prevalence, and years lived with disability for 301 acute and chronic diseases and injuries in 188 countries, 1990–2013: a systematic analysis for the Global Burden of Disease Study 2013. Lancet 386:743–800

    Google Scholar 

  13. Litjens G, Kooi T, Bejnordi BE et al (2017) A survey on deep learning in medical image analysis. Med Image Anal 42:60–88

    Article  Google Scholar 

  14. Kermany D, Zhang K, Goldbaum M (2018) Large dataset of labeled optical coherence tomography (oct) and chest X-ray images. Mendeley Data, v3. https://doi.org/10.17632/rscbjbr9sj

  15. Liu YY, Chen M, Ishikawa H, Wollstein G, Schuman JS, Rehg JM (2011) Automated macular pathology diagnosis in retinal OCT images using multi-scale spatial pyramid and local binary patterns in texture and shape encoding. Med Image Anal 15(5):748–759

    Article  Google Scholar 

  16. Farsiu S, Chiu SJ, O’Connell RV, Folgar FA, Yuan E, Izatt JA, Toth CA (2014) Age-related eye disease study 2 ancillary spectral domain optical coherence tomography study group. Quantitative classification of eyes with and without intermediate age-related macular degeneration using optical coherence tomography. Ophthalmology 121(1):162–172. https://doi.org/10.1016/j.ophtha.2013.07.013

    Article  Google Scholar 

  17. Shi F, Cai N, Gu Y, Hu D, Ma Y, Chen Y, Chen X (2019) DeSpecNet: a CNN-based method for speckle reduction in retinal optical coherence tomography images. Phys Med Biol 64(17):175010

    Article  Google Scholar 

  18. Zhang K, Zuo W, Chen Y, Meng D, Zhang L (2017) Beyond a Gaussian denoiser: residual learning of deep CNN for image denoising. IEEE Trans Image Process 26(7):3142–3155

    Article  MathSciNet  Google Scholar 

  19. Srinivasan PP, Kim LA, Mettu PS, Cousins SW, Comer GM, Izatt JA, Farsiu S (2014) Fully automated detection of diabetic macular edema and dry age-related macular degeneration from optical coherence tomography images. Biomed Opt Express 5(10):3568–3577

    Article  Google Scholar 

  20. Fang L, Jin Y, Huang L, Guo S, Zhao G, Chen X (2019) Iterative fusion convolutional neural networks for classification of optical coherence tomography images. J Vis Commun Image Represent 59:327–333

    Article  Google Scholar 

  21. Fang L, Wang C, Li S, Rabbani H, Chen X, Liu Z (2019) Attention to lesion: lesion-aware convolutional neural network for retinal optical coherence tomography image classification. IEEE Trans Med Imaging 38(8):1959–1970

    Article  Google Scholar 

  22. Huang L, He X, Fang L, Rabbani H, Chen X (2019) Automatic classification of retinal optical coherence tomography images with layer guided convolutional neural network. IEEE Signal Process Lett 26(7):1026–1030

    Article  Google Scholar 

  23. Abirami MS, Vennila B, Suganthi K, Kawatra S, Vaishnava A (2021) Detection of choroidal neovascularization (CNV) in retina OCT images using VGG16 and DenseNet CNN. Wirel Pers Commun. https://doi.org/10.1007/s11277-021-09086-8

    Article  Google Scholar 

  24. Nandy Pal M, Roy S, Banerjee M (2021) Content based retrieval of retinal OCT scans using twin CNN. Sādhanā 46:174. https://doi.org/10.1007/s12046-021-01701-5

    Article  Google Scholar 

  25. Mishra SS, Mandal B, Puhan NB (2022) MacularNet: towards fully automated attention-based deep CNN for macular disease classification. SN Comput Sci 3:142. https://doi.org/10.1007/s42979-022-01024-0

    Article  Google Scholar 

  26. Yoo TK, Choi JY, Kim HK (2021) Feasibility study to improve deep learning in OCT diagnosis of rare retinal diseases with few-shot classification. Med Biol Eng Comput 59:401–415. https://doi.org/10.1007/s11517-021-02321-1

    Article  Google Scholar 

  27. Virdi G, Virdi T, Elkeeb A (2021) Optical coherence tomography analysis using deep learning: a synthetic data approach. Investig Ophthalmol Vis Sci 62(11):22

    Google Scholar 

  28. Liu X, Ali TK, Singh P, Shah A, McKinney SM, Ruamviboonsuk P, Turner AW, Keane PA, Chotcomwongse P, Nganthavee V, Chia M et al (2022) Deep learning to detect OCT-derived diabetic macular edema from color retinal photographs: a multicenter validation study. Ophthalmol Retina 6(5):398–410. https://doi.org/10.1016/j.oret.2021.12.021

    Article  Google Scholar 

  29. Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Process Syst 25:1097–1105

    Google Scholar 

  30. Scarpa G, Gargiulo M, Mazza A, Gaetano R (2018) A CNN-based fusion method for feature extraction from sentinel data. Remote Sens 10(2):236

    Article  Google Scholar 

  31. Khalajzadeh H, Mansouri M, Teshnehlab M (2014) Face recognition using convolutional neural network and simple logistic classifier. In: Soft computing in industrial applications, pp 197–207

  32. Arel I, Rose DC, Karnowski TP (2010) Deep machine learning-a new frontier in artificial intelligence research [research frontier]. IEEE Comput Intell Mag 5(4):13–18

    Article  Google Scholar 

  33. Zuiderveld K (1994) Contrast limited adaptive histogram equalization. In: Graphics gems, pp 474–485

  34. Zeiler MD, Fergus R (2014) Visualizing and understanding convolutional networks. In: European conference on computer vision, pp 818–833

  35. Agarap AF (2018) Deep learning using rectified linear units (relu). arXiv preprint. arXiv:1803.08375

  36. Rawat W, Wang Z (2017) Deep convolutional neural networks for image classification: a comprehensive review. Neural Comput 29(9):2352–2449

    Article  MathSciNet  Google Scholar 

  37. Kingma DP, Ba J (2014) Adam: a method for stochastic optimization. arXiv preprint. arXiv:1412.6980

  38. Prechelt L (2012). Early stopping—but when? In: Montavon G, Orr GB, Müller KR (eds) Neural networks: tricks of the trade. Lecture notes in computer science, vol 7700. Springer, Berlin. https://doi.org/10.1007/978-3-642-35289-8_5

  39. Yu XH, Chen GA, Cheng SX (1995) Dynamic learning rate optimization of the backpropagation algorithm. IEEE Trans Neural Netw 6(3):669–677

    Article  Google Scholar 

  40. Howard AG, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Andreetto M, Adam H (2017) Mobilenets: efficient convolutional neural networks for mobile vision applications. arXiv preprint. arXiv:1704.04861

  41. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778

  42. Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint. arXiv:1409.1556

  43. Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z (2016) Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2818–2826

  44. Powers DM (2020) Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. arXiv preprint. arXiv:2010.16061

  45. Bengio Y, Grandvalet Y (2004) No unbiased estimator of the variance of k-fold cross-validation. J Mach Learn Res 5:1089–1105

    MathSciNet  Google Scholar 

Download references

Funding

Not applicable.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Priti Bansal.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bansal, P., Harjai, N., Saif, M. et al. Utilization of big data classification models in digitally enhanced optical coherence tomography for medical diagnostics. Neural Comput & Applic 36, 225–239 (2024). https://doi.org/10.1007/s00521-022-07973-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-022-07973-0

Keywords

Navigation