Assessing generalisability of deep learning-based polyp detection and segmentation methods through a computer vision challenge

Ali S; Ghatwary N; Jha D; Isik-Polat E; Polat G; Yang C; Li W; Galdran A; Ballester MG; Thambawita V; Hicks S; Poudel S; Lee SW; Jin Z; Gan T; Yu C; Yan J; Yeo D; Lee H; Tomar NK; Haithami M; Ahmed A; Riegler MA; Daul C; Halvorsen P; Rittscher J; Salem OE; Lamarque D; Cannizzaro R; Realdon S; de Lange T; East JE

doi:10.1038/s41598-024-52063-x

Fulltext

Assessing generalisability of deep learning-based polyp detection and segmentation methods through a computer vision challenge

Ali S ¹ , Ghatwary N ² , Jha D ³ , Isik-Polat E ⁴ , Polat G ⁴ , Yang C ⁵ Show all authors , Li W ⁵ , Galdran A ⁶ , Ballester MG ⁶ , Thambawita V ³ , Hicks S ³ , Poudel S ⁷ , Lee SW ⁷ , Jin Z ⁸ , Gan T ⁸ , Yu C ⁹ , Yan J ¹⁰ , Yeo D ¹¹ , Lee H ¹² , Tomar NK ¹³ , Haithami M ¹⁴ , Ahmed A ¹⁵ , Riegler MA ³ , Daul C ¹⁶ , Halvorsen P ³ , Rittscher J ¹⁷ , Salem OE ¹⁸ , Lamarque D ¹⁹ , Cannizzaro R ²⁰ , Realdon S ²⁰ , de Lange T ²¹ , East JE ²²

Affiliations

¹ School of Computing, Faculty of Engineering and Physical Sciences, University of Leeds, Leeds, LS2 9JT, UK. s.s.ali@leeds.ac.uk
² Computer Engineering Department, Arab Academy for Science and Technology, Smart Village, Giza, Egypt
³ SimulaMet, 0167, Oslo, Norway
⁴ Graduate School of Informatics, Middle East Technical University, 06800, Ankara, Turkey
⁵ City University of Hong Kong, Kowloon, Hong Kong
⁶ BCN MedTech, Department of Information and Communication Technologies, Universitat Pompeu Fabra, 08018, Barcelona, Spain
⁷ Department of IT Convergence Engineering, Gachon University, Seongnam, 13120, Republic of Korea
⁸ College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, 310027, China
⁹ Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen, 518055, China
¹⁰ Department of Automation, Tsinghua University, Beijing, 100084, China
¹¹ Smart Sensing and Diagnosis Research Division, Korea Atomic Energy Research Institute, Taejon, 34057, Republic of Korea
¹² Daegu-Gyeongbuk Medical Innovation Foundation, Medical Device Development Center, Taegu, 427724, Republic of Korea
¹³ NepAL Applied Mathematics and Informatics Institute for Research (NAAMII), Kathmandu, Nepal
¹⁴ Computer Science Department, University of Nottingham, Malaysia Campus, 43500, Semenyih, Malaysia
¹⁵ Computer Science, Edge Hill University, Lancashire, United Kingdom
¹⁶ CRAN UMR 7039, Université de Lorraine and CNRS, 54500, Vandœuvre-Lès-Nancy, France
¹⁷ Department of Engineering Science, Institute of Biomedical Engineering, University of Oxford, Oxford, OX3 7DQ, UK
¹⁸ Faculty of Medicine, University of Alexandria, Alexandria, 21131, Egypt
¹⁹ Université de Versailles St-Quentin en Yvelines, Hôpital Ambroise Paré, 9 Av. Charles de Gaulle, 92100, Boulogne-Billancourt, France
²⁰ CRO Centro Riferimento Oncologico IRCCS Aviano Italy, Via Franco Gallini, 2, 33081, Aviano, PN, Italy
²¹ Medical Department, Sahlgrenska University Hospital-Mölndal, Blå stråket 5, 413 45, Göteborg, Sweden
²² Oxford National Institute for Health Research Biomedical Research Centre, Oxford, OX4 2PG, UK

Sci Rep, 2024 Jan 23;14(1):2032.

PMID: 38263232 DOI: 10.1038/s41598-024-52063-x

Abstract

Polyps are well-known cancer precursors identified by colonoscopy. However, variability in their size, appearance, and location makes the detection of polyps challenging. Moreover, colonoscopy surveillance and removal of polyps are highly operator-dependent procedures and occur in a highly complex organ topology. There exists a high missed detection rate and incomplete removal of colonic polyps. To assist in clinical procedures and reduce missed rates, automated methods for detecting and segmenting polyps using machine learning have been achieved in past years. However, the major drawback in most of these methods is their ability to generalise to out-of-sample unseen datasets from different centres, populations, modalities, and acquisition systems. To test this hypothesis rigorously, we, together with expert gastroenterologists, curated a multi-centre and multi-population dataset acquired from six different colonoscopy systems and challenged the computational expert teams to develop robust automated detection and segmentation methods in a crowd-sourcing Endoscopic computer vision challenge. This work put forward rigorous generalisability tests and assesses the usability of devised deep learning methods in dynamic and actual clinical colonoscopy procedures. We analyse the results of four top performing teams for the detection task and five top performing teams for the segmentation task. Our analyses demonstrate that the top-ranking teams concentrated mainly on accuracy over the real-time performance required for clinical applicability. We further dissect the devised methods and provide an experiment-based hypothesis that reveals the need for improved generalisability to tackle diversity present in multi-centre datasets and routine clinical procedures.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.

MeSH terms

Similar publications