Skip to main content

Open-Source Datasets for Colonoscopy Polyps and Its AI-Enabled Techniques

  • Conference paper
  • First Online:
Communication and Intelligent Systems (ICCIS 2022)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 686))

Included in the following conference series:

  • 329 Accesses

Abstract

High-quality, open access, and free colonoscopy data act as a catalyst for the ongoing state-of-the-art (SOTA) artificial intelligence (AI) research works in the detection, localization, segmentation, and classification of various colorectal cancer findings in the colon and rectum such as polyp growth, abnormal tissues, cancerous and non-cancerous lesions, etc. This paper presents and summarizes widely used, open-source, and downloadable polyp colonoscopy datasets along with its source links. A comparative analysis of twenty-one colonoscopy datasets has been done. Copy-right-free, raw colonoscopy data (videos and images) from various medical Websites and YouTube videos are also mentioned for potential data development. Such colonoscopy datasets will further aid in the development of AI-powered software and hardware medical systems in this field and eventually help in reducing the burden of experienced gastroenterologists.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Ali S, Jha D, Ghatwary N, Realdon S, Cannizzaro R, Salem OE, Lamarque D, Daul C, Anonsen KV, Riegler MA et al (2021) PolypGen: a multi-center polyp detection and segmentation dataset for generalisability assessment. arXiv preprint arXiv:2106.04463

  2. An NS, Lan PN, Hang DV, Long DV, Trung TQ, Thuy NT, Sang DV (2022) BlazeNeo: blazing fast polyp segmentation and neoplasm detection. IEEE Access 10:43669–43684

    Article  Google Scholar 

  3. Bernal J, Real A (2021) Polyp segmentation in colonoscopy images. In: Computer-aided analysis of gastrointestinal videos. Springer, pp 171–175

    Google Scholar 

  4. Bernal J, Sánchez FJ, Fernández-Esparrach G, Gil D, Rodríguez C, Vilariño F (2015) WM-DOVA maps for accurate polyp highlighting in colonoscopy: validation vs. saliency maps from physicians. Comput Med Imaging Graph 43:99–111

    Google Scholar 

  5. Bernal J, Sánchez J, Vilarino F (2012) Towards automatic polyp detection with a polyp appearance model. Pattern Recognit 45(9):3166–3182

    Google Scholar 

  6. Bernal J, Tajkbaksh N, Sánchez FJ, Matuszewski BJ, Chen H, Yu L, Angermann Q, Romain O, Rustad B, Balasingham I, Pogorelov K, Choi S, Debard Q, Maier-Hein L, Speidel S, Stoyanov D, Brandao P, Córdova H, Sánchez-Montes C, Gurudu SR, Fernández-Esparrach G, Dray X, Liang J, Histace A (2017) Comparative validation of polyp detection methods in video colonoscopy: results from the MICCAI 2015 endoscopic vision challenge. IEEE Trans Med Imaging 36(6):1231–1249. https://doi.org/10.1109/TMI.2017.2664042

  7. Borgli H, Thambawita V, Smedsrud PH, Hicks S, Jha D, Eskeland SL, Randel KR, Pogorelov K, Lux M, Nguyen DTD, Johansen D, Griwodz C, Stensland HK, Garcia-Ceja E, Schmidt PT, Hammer HL, Riegler MA, Halvorsen P, de Lange T (2020) HyperKvasir, a comprehensive multi-class image and video dataset for gastrointestinal endoscopy. Sci Data 7(1):283. https://doi.org/10.1038/s41597-020-00622-y

  8. Fitting D, Krenzer A, Troya J, Banck M, Sudarevic B, Brand M, Böck W, Zoller WG, Rösch T, Puppe F et al (2022) A video based benchmark data set (ENDOTEST) to evaluate computer-aided polyp detection systems. Scand J Gastroenterol 1–7

    Google Scholar 

  9. Goel N, Kaur S, Gunjan D, Mahapatra S (2022) Investigating the significance of color space for abnormality detection in wireless capsule endoscopy images. Biomed Signal Process Control 75:103624

    Article  Google Scholar 

  10. Goel N, Kaur S, Gunjan D, Mahapatra S (2022) Dilated CNN for abnormality detection in wireless capsule endoscopy images. Soft Comput 26(3):1231–1247

    Article  Google Scholar 

  11. Handa P, Goel N, Indu S (2022) Datasets of wireless capsule endoscopy for AI-enabled techniques. In: Raman B, Murala S, Chowdhury A, Dhall A, Goyal P (eds) Computer vision and image processing. Springer International Publishing, Cham, pp 439–446

    Chapter  Google Scholar 

  12. Jha D, Smedsrud PH, Riegler MA, Halvorsen P, Lange TD, Johansen D, Johansen HD (2020) Kvasir-SEG: a segmented polyp dataset. In: International conference on multimedia modeling. Springer, pp 451–462

    Google Scholar 

  13. Ji GP, Chou YC, Fan DP, Chen G, Fu H, Jha D, Shao L (2021) Progressively normalized self-attention network for video polyp segmentation. In: International conference on medical image computing and computer-assisted intervention. Springer, pp 142–152

    Google Scholar 

  14. Kaur S, Goel N (2020) A dilated convolutional approach for inflammatory lesion detection using multi-scale input feature fusion (workshop paper). In: 2020 IEEE sixth international conference on multimedia big data (BigMM). IEEE, pp 386–393

    Google Scholar 

  15. Li K, Fathan MI, Patel K, Zhang T, Zhong C, Bansal A, Rastogi A, Wang JS, Wang G (2021) Colonoscopy polyp detection and classification: dataset creation and comparative evaluations. PLoS ONE 16(8):e0255809

    Article  Google Scholar 

  16. Ma Y, Chen X, Cheng K, Li Y, Sun B (2021) LDPolypVideo benchmark: a large-scale colonoscopy video dataset of diverse polyps. In: International conference on medical image computing and computer-assisted intervention. Springer, pp 387–396

    Google Scholar 

  17. Mesejo P, Pizarro D, Abergel A, Rouquette O, Beorchia S, Poincloux L, Bartoli A (2016) Computer-aided classification of gastrointestinal lesions in regular colonoscopy. IEEE Trans Med Imaging 35(9):2051–2063

    Article  Google Scholar 

  18. Misawa M, Kudo SE, Mori Y, Hotta K, Ohtsuka K, Matsuda T, Saito S, Kudo T, Baba T, Ishida F et al (2021) Development of a computer-aided detection system for colonoscopy and a publicly accessible large colonoscopy video database (with video). Gastrointest Endosc 93(4):960–967

    Google Scholar 

  19. Ngoc Lan P, An NS, Hang DV, Long DV, Trung TQ, Thuy NT, Sang DV (2021) NeoUNet: towards accurate colon polyp segmentation and neoplasm detection. In: International symposium on visual computing. Springer, pp 15–28

    Google Scholar 

  20. Nogueira-Rodríguez A, Domínguez-Carbajales R, López-Fernández H, Iglesias Á, Cubiella J, Fdez-Riverola F, Reboiro-Jato M, Glez-Peña D (2021) Deep neural networks approaches for detecting and classifying colorectal polyps. Neurocomputing 423:721–734

    Article  Google Scholar 

  21. Peña DG, Jato MR et al (2022) Dataset, polyp. https://www.iisgaliciasur.es/home/biobanco/colorectal-polyp-image-cohort-pibadb/?lang=en

  22. Pogorelov K, Randel KR, Griwodz C, Eskeland SL, de Lange T, Johansen D, Spampinato C, Dang-Nguyen DT, Lux M, Schmidt PT, Riegler M, Halvorsen P (2017) Kvasir: a multi-class image dataset for computer aided gastrointestinal disease detection. In: Proceedings of the 8th ACM on multimedia systems conference. MMSys’17. ACM, New York, NY, pp 164–169. https://doi.org/10.1145/3083187.3083212

  23. Sánchez-Peralta LF, Pagador JB, Picón A, Calderón ÁJ, Polo F, Andraka N, Bilbao R, Glover B, Saratxaga CL, Sánchez-Margallo FM (2020) Piccolo white-light and narrow-band imaging colonoscopic dataset: a performance comparative of models and datasets. Appl Sci 10(23):8501

    Article  Google Scholar 

  24. Silva J, Histace A, Romain O, Dray X, Granado B (2014) Toward embedded detection of polyps in WCE images for early diagnosis of colorectal cancer. Int J Comput Assist Radiol Surg 9(2):283–293

    Article  Google Scholar 

  25. Tajbakhsh N, Gurudu SR, Liang J (2015) Automated polyp detection in colonoscopy videos using shape and context information. IEEE Trans Med Imaging 35(2):630–644

    Article  Google Scholar 

  26. Vázquez D, Bernal J, Sánchez FJ, Fernández-Esparrach G, López AM, Romero A, Drozdzal M, Courville A (2017) A benchmark for endoluminal scene segmentation of colonoscopy images. J Healthcare Eng 2017

    Google Scholar 

  27. Wang W, Tian J, Zhang C, Luo Y, Wang X, Li J (2020) An improved deep learning approach and its applications on colonic polyp images detection. BMC Med Imaging 20(1):1–14

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nidhi Gooel .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Mangotra, H., Handa, P., Gooel, N. (2023). Open-Source Datasets for Colonoscopy Polyps and Its AI-Enabled Techniques. In: Sharma, H., Shrivastava, V., Bharti, K.K., Wang, L. (eds) Communication and Intelligent Systems. ICCIS 2022. Lecture Notes in Networks and Systems, vol 686. Springer, Singapore. https://doi.org/10.1007/978-981-99-2100-3_6

Download citation

Publish with us

Policies and ethics