Abstract
High-quality, open access, and free colonoscopy data act as a catalyst for the ongoing state-of-the-art (SOTA) artificial intelligence (AI) research works in the detection, localization, segmentation, and classification of various colorectal cancer findings in the colon and rectum such as polyp growth, abnormal tissues, cancerous and non-cancerous lesions, etc. This paper presents and summarizes widely used, open-source, and downloadable polyp colonoscopy datasets along with its source links. A comparative analysis of twenty-one colonoscopy datasets has been done. Copy-right-free, raw colonoscopy data (videos and images) from various medical Websites and YouTube videos are also mentioned for potential data development. Such colonoscopy datasets will further aid in the development of AI-powered software and hardware medical systems in this field and eventually help in reducing the burden of experienced gastroenterologists.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ali S, Jha D, Ghatwary N, Realdon S, Cannizzaro R, Salem OE, Lamarque D, Daul C, Anonsen KV, Riegler MA et al (2021) PolypGen: a multi-center polyp detection and segmentation dataset for generalisability assessment. arXiv preprint arXiv:2106.04463
An NS, Lan PN, Hang DV, Long DV, Trung TQ, Thuy NT, Sang DV (2022) BlazeNeo: blazing fast polyp segmentation and neoplasm detection. IEEE Access 10:43669–43684
Bernal J, Real A (2021) Polyp segmentation in colonoscopy images. In: Computer-aided analysis of gastrointestinal videos. Springer, pp 171–175
Bernal J, Sánchez FJ, Fernández-Esparrach G, Gil D, Rodríguez C, Vilariño F (2015) WM-DOVA maps for accurate polyp highlighting in colonoscopy: validation vs. saliency maps from physicians. Comput Med Imaging Graph 43:99–111
Bernal J, Sánchez J, Vilarino F (2012) Towards automatic polyp detection with a polyp appearance model. Pattern Recognit 45(9):3166–3182
Bernal J, Tajkbaksh N, Sánchez FJ, Matuszewski BJ, Chen H, Yu L, Angermann Q, Romain O, Rustad B, Balasingham I, Pogorelov K, Choi S, Debard Q, Maier-Hein L, Speidel S, Stoyanov D, Brandao P, Córdova H, Sánchez-Montes C, Gurudu SR, Fernández-Esparrach G, Dray X, Liang J, Histace A (2017) Comparative validation of polyp detection methods in video colonoscopy: results from the MICCAI 2015 endoscopic vision challenge. IEEE Trans Med Imaging 36(6):1231–1249. https://doi.org/10.1109/TMI.2017.2664042
Borgli H, Thambawita V, Smedsrud PH, Hicks S, Jha D, Eskeland SL, Randel KR, Pogorelov K, Lux M, Nguyen DTD, Johansen D, Griwodz C, Stensland HK, Garcia-Ceja E, Schmidt PT, Hammer HL, Riegler MA, Halvorsen P, de Lange T (2020) HyperKvasir, a comprehensive multi-class image and video dataset for gastrointestinal endoscopy. Sci Data 7(1):283. https://doi.org/10.1038/s41597-020-00622-y
Fitting D, Krenzer A, Troya J, Banck M, Sudarevic B, Brand M, Böck W, Zoller WG, Rösch T, Puppe F et al (2022) A video based benchmark data set (ENDOTEST) to evaluate computer-aided polyp detection systems. Scand J Gastroenterol 1–7
Goel N, Kaur S, Gunjan D, Mahapatra S (2022) Investigating the significance of color space for abnormality detection in wireless capsule endoscopy images. Biomed Signal Process Control 75:103624
Goel N, Kaur S, Gunjan D, Mahapatra S (2022) Dilated CNN for abnormality detection in wireless capsule endoscopy images. Soft Comput 26(3):1231–1247
Handa P, Goel N, Indu S (2022) Datasets of wireless capsule endoscopy for AI-enabled techniques. In: Raman B, Murala S, Chowdhury A, Dhall A, Goyal P (eds) Computer vision and image processing. Springer International Publishing, Cham, pp 439–446
Jha D, Smedsrud PH, Riegler MA, Halvorsen P, Lange TD, Johansen D, Johansen HD (2020) Kvasir-SEG: a segmented polyp dataset. In: International conference on multimedia modeling. Springer, pp 451–462
Ji GP, Chou YC, Fan DP, Chen G, Fu H, Jha D, Shao L (2021) Progressively normalized self-attention network for video polyp segmentation. In: International conference on medical image computing and computer-assisted intervention. Springer, pp 142–152
Kaur S, Goel N (2020) A dilated convolutional approach for inflammatory lesion detection using multi-scale input feature fusion (workshop paper). In: 2020 IEEE sixth international conference on multimedia big data (BigMM). IEEE, pp 386–393
Li K, Fathan MI, Patel K, Zhang T, Zhong C, Bansal A, Rastogi A, Wang JS, Wang G (2021) Colonoscopy polyp detection and classification: dataset creation and comparative evaluations. PLoS ONE 16(8):e0255809
Ma Y, Chen X, Cheng K, Li Y, Sun B (2021) LDPolypVideo benchmark: a large-scale colonoscopy video dataset of diverse polyps. In: International conference on medical image computing and computer-assisted intervention. Springer, pp 387–396
Mesejo P, Pizarro D, Abergel A, Rouquette O, Beorchia S, Poincloux L, Bartoli A (2016) Computer-aided classification of gastrointestinal lesions in regular colonoscopy. IEEE Trans Med Imaging 35(9):2051–2063
Misawa M, Kudo SE, Mori Y, Hotta K, Ohtsuka K, Matsuda T, Saito S, Kudo T, Baba T, Ishida F et al (2021) Development of a computer-aided detection system for colonoscopy and a publicly accessible large colonoscopy video database (with video). Gastrointest Endosc 93(4):960–967
Ngoc Lan P, An NS, Hang DV, Long DV, Trung TQ, Thuy NT, Sang DV (2021) NeoUNet: towards accurate colon polyp segmentation and neoplasm detection. In: International symposium on visual computing. Springer, pp 15–28
Nogueira-Rodríguez A, Domínguez-Carbajales R, López-Fernández H, Iglesias Á, Cubiella J, Fdez-Riverola F, Reboiro-Jato M, Glez-Peña D (2021) Deep neural networks approaches for detecting and classifying colorectal polyps. Neurocomputing 423:721–734
Peña DG, Jato MR et al (2022) Dataset, polyp. https://www.iisgaliciasur.es/home/biobanco/colorectal-polyp-image-cohort-pibadb/?lang=en
Pogorelov K, Randel KR, Griwodz C, Eskeland SL, de Lange T, Johansen D, Spampinato C, Dang-Nguyen DT, Lux M, Schmidt PT, Riegler M, Halvorsen P (2017) Kvasir: a multi-class image dataset for computer aided gastrointestinal disease detection. In: Proceedings of the 8th ACM on multimedia systems conference. MMSys’17. ACM, New York, NY, pp 164–169. https://doi.org/10.1145/3083187.3083212
Sánchez-Peralta LF, Pagador JB, Picón A, Calderón ÁJ, Polo F, Andraka N, Bilbao R, Glover B, Saratxaga CL, Sánchez-Margallo FM (2020) Piccolo white-light and narrow-band imaging colonoscopic dataset: a performance comparative of models and datasets. Appl Sci 10(23):8501
Silva J, Histace A, Romain O, Dray X, Granado B (2014) Toward embedded detection of polyps in WCE images for early diagnosis of colorectal cancer. Int J Comput Assist Radiol Surg 9(2):283–293
Tajbakhsh N, Gurudu SR, Liang J (2015) Automated polyp detection in colonoscopy videos using shape and context information. IEEE Trans Med Imaging 35(2):630–644
Vázquez D, Bernal J, Sánchez FJ, Fernández-Esparrach G, López AM, Romero A, Drozdzal M, Courville A (2017) A benchmark for endoluminal scene segmentation of colonoscopy images. J Healthcare Eng 2017
Wang W, Tian J, Zhang C, Luo Y, Wang X, Li J (2020) An improved deep learning approach and its applications on colonic polyp images detection. BMC Med Imaging 20(1):1–14
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Mangotra, H., Handa, P., Gooel, N. (2023). Open-Source Datasets for Colonoscopy Polyps and Its AI-Enabled Techniques. In: Sharma, H., Shrivastava, V., Bharti, K.K., Wang, L. (eds) Communication and Intelligent Systems. ICCIS 2022. Lecture Notes in Networks and Systems, vol 686. Springer, Singapore. https://doi.org/10.1007/978-981-99-2100-3_6
Download citation
DOI: https://doi.org/10.1007/978-981-99-2100-3_6
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-2099-0
Online ISBN: 978-981-99-2100-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)