PolypDB: A Curated Multi-Center Dataset for Development of AI Algorithms in Colonoscopy

Read original: arXiv:2409.00045 - Published 9/4/2024 by Debesh Jha, Nikhil Kumar Tomar, Vanshali Sharma, Quoc-Huy Trinh, Koushik Biswas, Hongyi Pan, Ritika K. Jha, Gorkem Durak, Alexander Hann, Jonas Varkey and 14 others
Total Score

0

PolypDB: A Curated Multi-Center Dataset for Development of AI Algorithms in Colonoscopy

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • PolypDB is a curated multi-center dataset for developing AI algorithms in colonoscopy
  • It contains over 14,000 colonoscopy images from multiple healthcare centers
  • The dataset aims to support the development and evaluation of AI models for polyp detection and segmentation

Plain English Explanation

PolypDB is a collection of colonoscopy images that researchers can use to train and test artificial intelligence (AI) algorithms. These algorithms are designed to help doctors identify and analyze polyps, which are growths in the colon that can potentially turn into cancer if not removed.

The dataset contains over 14,000 images from multiple healthcare centers around the world. This diversity helps ensure the AI models work well in different medical settings. The researchers who created PolypDB hope it will accelerate the development of more accurate and reliable AI tools for colonoscopy, which can assist doctors in detecting and treating polyps early on.

Technical Explanation

The PolypDB dataset includes 14,225 colonoscopy images from 10 different healthcare centers in 5 countries. Each image is annotated with polyp outlines and classifications. The dataset covers a wide range of polyp types, sizes, and locations within the colon.

To create this dataset, the researchers curated and consolidated colonoscopy images from multiple medical institutions. They developed automated and manual processes to clean the data, remove low-quality images, and annotate the polyps. The resulting dataset is intended to support the development and evaluation of AI algorithms for polyp detection, segmentation, and classification.

Critical Analysis

The PolypDB dataset provides a valuable resource for advancing AI research in colonoscopy. By aggregating data from multiple centers, it captures real-world diversity that can help ensure AI models perform well in clinical practice.

However, the paper acknowledges some limitations. For example, the dataset may not fully represent the distribution of polyp characteristics seen in the general population. Additionally, the annotation process, while rigorous, could still contain some errors or inconsistencies.

Further research is needed to assess the suitability of PolypDB for specific AI development tasks, such as generalization to new clinical settings or robustness to rare polyp types. Continued collaboration and data sharing between the medical and AI research communities will be crucial for advancing the field.

Conclusion

The PolypDB dataset represents an important step forward in the development of AI algorithms for colonoscopy. By providing a large, diverse, and well-annotated dataset, it enables researchers to train and evaluate more accurate and reliable models for polyp detection and analysis.

The widespread adoption and use of PolypDB has the potential to improve the efficiency and effectiveness of colonoscopy procedures, ultimately leading to better patient outcomes and reduced healthcare costs. As the field of AI in medical imaging continues to evolve, datasets like PolypDB will play a crucial role in driving progress and improving patient care.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

PolypDB: A Curated Multi-Center Dataset for Development of AI Algorithms in Colonoscopy
Total Score

0

PolypDB: A Curated Multi-Center Dataset for Development of AI Algorithms in Colonoscopy

Debesh Jha, Nikhil Kumar Tomar, Vanshali Sharma, Quoc-Huy Trinh, Koushik Biswas, Hongyi Pan, Ritika K. Jha, Gorkem Durak, Alexander Hann, Jonas Varkey, Hang Viet Dao, Long Van Dao, Binh Phuc Nguyen, Khanh Cong Pham, Quang Trung Tran, Nikolaos Papachrysos, Brandon Rieders, Peter Thelin Schmidt, Enrik Geissler, Tyler Berzin, P{aa}l Halvorsen, Michael A. Riegler, Thomas de Lange, Ulas Bagci

Colonoscopy is the primary method for examination, detection, and removal of polyps. Regular screening helps detect and prevent colorectal cancer at an early curable stage. However, challenges such as variation among the endoscopists' skills, bowel quality preparation, and complex nature of the large intestine which cause large number of polyp miss-rate. These missed polyps can develop into cancer later on, which underscores the importance of improving the detection methods. A computer-aided diagnosis system can support physicians by assisting in detecting overlooked polyps. However, one of the important challenges for developing novel deep learning models for automatic polyp detection and segmentation is the lack of publicly available, multi-center large and diverse datasets. To address this gap, we introduce PolypDB, a large scale publicly available dataset that contains 3934 still polyp images and their corresponding ground truth from real colonoscopy videos to design efficient polyp detection and segmentation architectures. The dataset has been developed and verified by a team of 10 gastroenterologists. PolypDB comprises of images from five modalities: Blue Light Imaging (BLI), Flexible Imaging Color Enhancement (FICE), Linked Color Imaging (LCI), Narrow Band Imaging (NBI), and White Light Imaging (WLI) and three medical centers from Norway, Sweden and Vietnam. Thus, we split the dataset based on modality and medical center for modality-wise and center-wise analysis. We provide a benchmark on each modality using eight popular segmentation methods and six standard benchmark polyp detection methods. Furthermore, we also provide benchmark on center-wise under federated learning settings. Our dataset is public and can be downloaded at url{https://osf.io/pr7ms/}.

Read more

9/4/2024

👀

Total Score

0

Validating polyp and instrument segmentation methods in colonoscopy through Medico 2020 and MedAI 2021 Challenges

Debesh Jha, Vanshali Sharma, Debapriya Banik, Debayan Bhattacharya, Kaushiki Roy, Steven A. Hicks, Nikhil Kumar Tomar, Vajira Thambawita, Adrian Krenzer, Ge-Peng Ji, Sahadev Poudel, George Batchkala, Saruar Alam, Awadelrahman M. A. Ahmed, Quoc-Huy Trinh, Zeshan Khan, Tien-Phat Nguyen, Shruti Shrestha, Sabari Nathan, Jeonghwan Gwak, Ritika K. Jha, Zheyuan Zhang, Alexander Schlaefer, Debotosh Bhattacharjee, M. K. Bhuyan, Pradip K. Das, Deng-Ping Fan, Sravanthi Parsa, Sharib Ali, Michael A. Riegler, P{aa}l Halvorsen, Thomas De Lange, Ulas Bagci

Automatic analysis of colonoscopy images has been an active field of research motivated by the importance of early detection of precancerous polyps. However, detecting polyps during the live examination can be challenging due to various factors such as variation of skills and experience among the endoscopists, lack of attentiveness, and fatigue leading to a high polyp miss-rate. Deep learning has emerged as a promising solution to this challenge as it can assist endoscopists in detecting and classifying overlooked polyps and abnormalities in real time. In addition to the algorithm's accuracy, transparency and interpretability are crucial to explaining the whys and hows of the algorithm's prediction. Further, most algorithms are developed in private data, closed source, or proprietary software, and methods lack reproducibility. Therefore, to promote the development of efficient and transparent methods, we have organized the Medico automatic polyp segmentation (Medico 2020) and MedAI: Transparency in Medical Image Segmentation (MedAI 2021) competitions. We present a comprehensive summary and analyze each contribution, highlight the strength of the best-performing methods, and discuss the possibility of clinical translations of such methods into the clinic. For the transparency task, a multi-disciplinary team, including expert gastroenterologists, accessed each submission and evaluated the team based on open-source practices, failure case analysis, ablation studies, usability and understandability of evaluations to gain a deeper understanding of the models' credibility for clinical deployment. Through the comprehensive analysis of the challenge, we not only highlight the advancements in polyp and surgical instrument segmentation but also encourage qualitative evaluation for building more transparent and understandable AI-based colonoscopy systems.

Read more

5/8/2024

📶

Total Score

0

Polyp segmentation in colonoscopy images using DeepLabV3++

Al Mohimanul Islam, Sadia Shakiba Bhuiyan, Mysun Mashira, Md. Rayhan Ahmed, Salekul Islam, Swakkhar Shatabda

Segmenting polyps in colonoscopy images is essential for the early identification and diagnosis of colorectal cancer, a significant cause of worldwide cancer deaths. Prior deep learning based models such as Attention based variation, UNet variations and Transformer-derived networks have had notable success in capturing intricate features and complex polyp shapes. In this study, we have introduced the DeepLabv3++ model which is an enhanced version of the DeepLabv3+ architecture. It is designed to improve the precision and robustness of polyp segmentation in colonoscopy images. We have utilized The proposed model incorporates diverse separable convolutional layers and attention mechanisms within the MSPP block, enhancing its capacity to capture multi-scale and directional features. Additionally, the redesigned decoder further transforms the extracted features from the encoder into a more meaningful segmentation map. Our model was evaluated on three public datasets (CVC-ColonDB, CVC-ClinicDB, Kvasir-SEG) achieving Dice coefficient scores of 96.20%, 96.54%, and 96.08%, respectively. The experimental analysis shows that DeepLabV3++ outperforms several state-of-the-art models in polyp segmentation tasks. Furthermore, compared to the baseline DeepLabV3+ model, our DeepLabV3++ with its MSPP module and redesigned decoder architecture, significantly reduced segmentation errors (e.g., false positives/negatives) across small, medium, and large polyps. This improvement in polyp delineation is crucial for accurate clinical decision-making in colonoscopy.

Read more

7/30/2024

Deep Bayesian segmentation for colon polyps: Well-calibrated predictions in medical imaging
Total Score

0

Deep Bayesian segmentation for colon polyps: Well-calibrated predictions in medical imaging

Daniela L. Ramos, Hector J. Hortua

Colorectal polyps are generally benign alterations that, if not identified promptly and managed successfully, can progress to cancer and cause affectations on the colon mucosa, known as adenocarcinoma. Today advances in Deep Learning have demonstrated the ability to achieve significant performance in image classification and detection in medical diagnosis applications. Nevertheless, these models are prone to overfitting, and making decisions based only on point estimations may provide incorrect predictions. Thus, to obtain a more informed decision, we must consider point estimations along with their reliable uncertainty quantification. In this paper, we built different Bayesian neural network approaches based on the flexibility of posterior distribution to develop semantic segmentation of colorectal polyp images. We found that these models not only provide state-of-the-art performance on the segmentation of this medical dataset but also, yield accurate uncertainty estimates. We applied multiplicative normalized flows(MNF) and reparameterization trick on the UNET, FPN, and LINKNET architectures tested with multiple backbones in deterministic and Bayesian versions. We report that the FPN + EfficientnetB7 architecture with MNF is the most promising option given its IOU of 0.94 and Expected Calibration Error (ECE) of 0.004, combined with its superiority in identifying difficult-to-detect colorectal polyps, which is effective in clinical areas where early detection prevents the development of colon cancer.

Read more

7/24/2024