Using Diffusion Models to Generate Synthetic Labelled Data for Medical Image Segmentation

2310.16794

Published 5/13/2024 by Daniel Saragih, Atsuhiro Hibi, Pascal Tyrrell

📊

Abstract

Medical image analysis has become a prominent area where machine learning has been applied. However, high quality, publicly available data is limited either due to patient privacy laws or the time and cost required for experts to annotate images. In this retrospective study, we designed and evaluated a pipeline to generate synthetic labeled polyp images for augmenting medical image segmentation models with the aim of reducing this data scarcity. In particular, we trained diffusion models on the HyperKvasir dataset, comprising 1000 images of polyps in the human GI tract from 2008 to 2016. Qualitative expert review, Fr'echet Inception Distance (FID), and Multi-Scale Structural Similarity (MS-SSIM) were tested for evaluation. Additionally, various segmentation models were trained with the generated data and evaluated using Dice score and Intersection over Union. We found that our pipeline produced images more akin to real polyp images based on FID scores, and segmentation performance also showed improvements over GAN methods when trained entirely, or partially, with synthetic data, despite requiring less compute for training. Moreover, the improvement persists when tested on different datasets, showcasing the transferability of the generated images.

Create account to get full access

Overview

Researchers developed a pipeline to generate synthetic labeled polyp images for improving medical image segmentation models.
They trained diffusion models on the HyperKvasir dataset of 1000 polyp images and evaluated the generated images using qualitative expert review, Fréchet Inception Distance (FID), and Multi-Scale Structural Similarity (MS-SSIM).
Segmentation models were trained on the synthetic data and evaluated using Dice score and Intersection over Union, showing improvements over GAN-based methods.
The generated images demonstrated transferability, improving segmentation performance on different datasets.

Plain English Explanation

Medical image analysis is an important area where machine learning has been applied. However, there is often a lack of high-quality, publicly available data for training these models, either due to patient privacy laws or the time and cost required for experts to annotate the images.

To address this data scarcity, the researchers in this study designed a pipeline to generate synthetic labeled polyp images. Polyps are growths that can form in the gastrointestinal tract and are an important target for medical imaging and analysis.

The researchers trained diffusion models on a dataset of 1000 polyp images, called HyperKvasir, collected between 2008 and 2016. Diffusion models are a type of machine learning model that can generate new images that resemble the training data.

To evaluate the generated images, the researchers used several approaches. They had expert reviewers qualitatively assess the images, and they also used two objective metrics: Fréchet Inception Distance (FID) and Multi-Scale Structural Similarity (MS-SSIM). These metrics compare the generated images to real images and can indicate how similar they are.

Additionally, the researchers trained various segmentation models on the synthetic data and evaluated their performance using Dice score and Intersection over Union. These metrics measure how well the models can identify and outline the polyps in the images.

The researchers found that their pipeline produced images that were more similar to real polyp images based on the FID scores. Furthermore, the segmentation models trained on the synthetic data showed improvements over models trained on GAN-generated images, despite requiring less computing power to train.

Importantly, the improvements in segmentation performance persisted when the models were tested on different datasets, demonstrating the transferability of the generated images.

Technical Explanation

The researchers in this study designed a pipeline to generate synthetic labeled polyp images for augmenting medical image segmentation models. They trained diffusion models on the HyperKvasir dataset, which contains 1000 images of polyps in the human gastrointestinal tract collected between 2008 and 2016.

To evaluate the generated images, the researchers used qualitative expert review, Fréchet Inception Distance (FID), and Multi-Scale Structural Similarity (MS-SSIM). FID and MS-SSIM are objective metrics that compare the generated images to the real images in the dataset.

The researchers also trained various segmentation models using the synthetic data and evaluated their performance using Dice score and Intersection over Union (IoU). These metrics measure how well the models can identify and outline the polyps in the images.

The researchers found that their pipeline produced images that were more akin to real polyp images based on the FID scores. Additionally, the segmentation models trained on the synthetic data showed improvements over models trained on GAN-generated images, despite requiring less compute for training.

Furthermore, the improvement in segmentation performance persisted when the models were tested on different datasets, demonstrating the transferability of the generated images.

Critical Analysis

The researchers acknowledge several limitations and areas for further research in their paper. One key limitation is that the study was focused on a specific type of medical image (polyp images) and may not generalize to other types of medical images or tasks.

Additionally, the researchers note that the qualitative expert review of the generated images was limited to a small number of reviewers. A more extensive evaluation with a larger panel of experts could provide more robust insights into the visual quality of the synthetic images.

While the researchers demonstrated the transferability of the generated images to different datasets, it would be valuable to further explore the limits of this transferability. Applying the pipeline to a wider range of medical imaging tasks and datasets could shed light on the broader applicability of the approach.

Furthermore, the researchers did not provide a detailed comparison of the computational efficiency of their diffusion-based approach versus other generative methods, such as GANs. A more comprehensive analysis of the tradeoffs between performance and training complexity could help researchers and practitioners make more informed choices when selecting generative models for medical image augmentation.

Conclusion

This study presents a promising pipeline for generating synthetic labeled polyp images to augment medical image segmentation models. The researchers demonstrated that their diffusion-based approach can produce images that are more realistic and useful for training segmentation models compared to GAN-based methods, while requiring less computational resources.

The transferability of the generated images to different datasets suggests that this approach could have broad applications in medical image analysis, helping to address the challenge of data scarcity. Further research is needed to explore the generalizability of the pipeline to other types of medical images and tasks, as well as to provide a more detailed comparison with alternative generative techniques.

Overall, this work represents an important step forward in leveraging synthetic data to improve the performance and accessibility of medical image analysis models, with the potential to benefit both researchers and clinicians in the field.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

3D MRI Synthesis with Slice-Based Latent Diffusion Models: Improving Tumor Segmentation Tasks in Data-Scarce Regimes

Aghiles Kebaili, J'er^ome Lapuyade-Lahorgue, Pierre Vera, Su Ruan

Despite the increasing use of deep learning in medical image segmentation, the limited availability of annotated training data remains a major challenge due to the time-consuming data acquisition and privacy regulations. In the context of segmentation tasks, providing both medical images and their corresponding target masks is essential. However, conventional data augmentation approaches mainly focus on image synthesis. In this study, we propose a novel slice-based latent diffusion architecture designed to address the complexities of volumetric data generation in a slice-by-slice fashion. This approach extends the joint distribution modeling of medical images and their associated masks, allowing a simultaneous generation of both under data-scarce regimes. Our approach mitigates the computational complexity and memory expensiveness typically associated with diffusion models. Furthermore, our architecture can be conditioned by tumor characteristics, including size, shape, and relative position, thereby providing a diverse range of tumor variations. Experiments on a segmentation task using the BRATS2022 confirm the effectiveness of the synthesized volumes and masks for data augmentation.

6/11/2024

eess.IV cs.CV

✨

Generalize Polyp Segmentation via Inpainting across Diverse Backgrounds and Pseudo-Mask Refinement

Jiajian Ma, Fangqi Lu, Silin Huang, Song Wu, Zhen Li

Inpainting lesions within different normal backgrounds is a potential method of addressing the generalization problem, which is crucial for polyp segmentation models. However, seamlessly introducing polyps into complex endoscopic environments while simultaneously generating accurate pseudo-masks remains a challenge for current inpainting methods. To address these issues, we first leverage the pre-trained Stable Diffusion Inpaint and ControlNet, to introduce a robust generative model capable of inpainting polyps across different backgrounds. Secondly, we utilize the prior that synthetic polyps are confined to the inpainted region, to establish an inpainted region-guided pseudo-mask refinement network. We also propose a sample selection strategy that prioritizes well-aligned and hard synthetic cases for further model fine-tuning. Experiments demonstrate that our inpainting model outperformed baseline methods both qualitatively and quantitatively in inpainting quality. Moreover, our data augmentation strategy significantly enhances the performance of polyp segmentation models on external datasets, achieving or surpassing the level of fully supervised training benchmarks in that domain. Our code is available at https://github.com/497662892/PolypInpainter.

5/22/2024

cs.CV

An expert-driven data generation pipeline for histological images

Roberto Basla, Loris Giulivi, Luca Magri, Giacomo Boracchi

Deep Learning (DL) models have been successfully applied to many applications including biomedical cell segmentation and classification in histological images. These models require large amounts of annotated data which might not always be available, especially in the medical field where annotations are scarce and expensive. To overcome this limitation, we propose a novel pipeline for generating synthetic datasets for cell segmentation. Given only a handful of annotated images, our method generates a large dataset of images which can be used to effectively train DL instance segmentation models. Our solution is designed to generate cells of realistic shapes and placement by allowing experts to incorporate domain knowledge during the generation of the dataset.

6/4/2024

eess.IV cs.CV

🏋️

Exploring Limits of Diffusion-Synthetic Training with Weakly Supervised Semantic Segmentation

Ryota Yoshihashi, Yuya Otsuka, Kenji Doi, Tomohiro Tanaka, Hirokatsu Kataoka

The advance of generative models for images has inspired various training techniques for image recognition utilizing synthetic images. In semantic segmentation, one promising approach is extracting pseudo-masks from attention maps in text-to-image diffusion models, which enables real-image-and-annotation-free training. However, the pioneering training method using the diffusion-synthetic images and pseudo-masks, i.e., DiffuMask has limitations in terms of mask quality, scalability, and ranges of applicable domains. To overcome these limitations, this work introduces three techniques for diffusion-synthetic semantic segmentation training. First, reliability-aware robust training, originally used in weakly supervised learning, helps segmentation with insufficient synthetic mask quality. %Second, large-scale pretraining of whole segmentation models, not only backbones, on synthetic ImageNet-1k-class images with pixel-labels benefits downstream segmentation tasks. Second, we introduce prompt augmentation, data augmentation to the prompt text set to scale up and diversify training images with a limited text resources. Finally, LoRA-based adaptation of Stable Diffusion enables the transfer to a distant domain, e.g., auto-driving images. Experiments in PASCAL VOC, ImageNet-S, and Cityscapes show that our method effectively closes gap between real and synthetic training in semantic segmentation.

4/16/2024

cs.CV