SAM-Med3D: Towards General-purpose Segmentation Models for Volumetric Medical Images

Read original: arXiv:2310.15161 - Published 9/17/2024 by Haoyu Wang, Sizheng Guo, Jin Ye, Zhongying Deng, Junlong Cheng, Tianbin Li, Jianpin Chen, Yanzhou Su, Ziyan Huang, Yiqing Shen and 4 others

👁️

Overview

Existing medical image segmentation models are often specialized for specific tasks, making it difficult for them to generalize across different anatomical structures or imaging modalities.
This paper introduces SAM-Med3D, a general-purpose segmentation model for volumetric medical images that can accurately segment diverse anatomical structures and lesions using only a few 3D prompt points.
The researchers gathered a large-scale dataset, SA-Med3D-140K, from public and private sources, and used it to train SAM-Med3D using a two-stage procedure.
SAM-Med3D exhibits impressive performance on a wide range of medical segmentation tasks, including seen and unseen targets, across different anatomical structures, modalities, and zero-shot transferability.

Plain English Explanation

Medical imaging is a critical tool for diagnosing and treating various health conditions, but the task of accurately segmenting or outlining different structures in these images can be challenging. Existing segmentation models are often designed for specific medical tasks, such as identifying a particular organ or type of lesion. This means they may struggle to generalize and perform well on a diverse range of medical imaging scenarios.

To address this limitation, the researchers developed a new segmentation model called SAM-Med3D. Unlike specialized models, SAM-Med3D is designed to be a general-purpose tool that can accurately segment a wide variety of anatomical structures and abnormalities across different types of medical images, such as MRI, CT, or ultrasound scans.

The key to SAM-Med3D's versatility is that it only requires a few simple 3D "prompt" points from the user to know what to segment, rather than being pre-trained on a specific task. The researchers also compiled a large dataset of 3D medical images and their corresponding segmentation masks, which they used to train SAM-Med3D using a two-stage process.

When tested on a diverse range of medical segmentation tasks, SAM-Med3D demonstrated impressive performance, accurately outlining both familiar and unfamiliar structures and lesions. This suggests that the model could be a valuable tool for clinicians and researchers working with medical imaging, as it could potentially be applied to a wide variety of applications without the need to develop specialized models for each one.

Technical Explanation

The researchers developed SAM-Med3D, a general-purpose segmentation model for volumetric medical images that can accurately segment diverse anatomical structures and lesions using only a few 3D prompt points. To train SAM-Med3D, the researchers gathered a large-scale dataset, SA-Med3D-140K, which includes 22,000 3D medical images and 143,000 corresponding 3D segmentation masks from a blend of public and licensed private sources.

SAM-Med3D is characterized by its fully learnable 3D structure, which allows it to effectively process and segment volumetric medical images. The researchers trained the model using a two-stage procedure: first, they pre-trained the model on the SA-Med3D-140K dataset, and then they fine-tuned it using a smaller dataset of specific medical segmentation tasks.

The researchers comprehensively evaluated SAM-Med3D on 16 different medical segmentation datasets, covering a wide range of anatomical structures, imaging modalities, and target segmentation tasks, including both familiar and unseen scenarios. The results demonstrate the efficiency and efficacy of SAM-Med3D, as well as its promising potential for application to diverse downstream medical imaging tasks as a pre-trained model.

Critical Analysis

The researchers acknowledge several limitations and areas for further research in their paper. First, while SAM-Med3D has shown impressive performance on a wide range of medical segmentation tasks, the model's generalization capabilities may be constrained by the diversity of the training data. The researchers note that the SA-Med3D-140K dataset, while large, may not capture the full range of anatomical variations and imaging characteristics encountered in real-world clinical settings.

Additionally, the researchers indicate that further work is needed to improve the model's performance on smaller or more challenging segmentation targets, as well as to explore the integration of SAM-Med3D with other medical imaging analysis tools and workflows. The paper also suggests that investigating the model's interpretability and robustness to various types of noise or artifact in medical images could be valuable areas for future research.

Overall, the researchers have made a significant contribution to the field of medical image analysis by developing a general-purpose segmentation model that can potentially be applied to a wide range of clinical scenarios. However, as with any new technology, there are still opportunities for refinement and further validation before SAM-Med3D can be widely adopted in real-world medical practice.

Conclusion

This paper introduces SAM-Med3D, a general-purpose segmentation model for volumetric medical images that can accurately segment diverse anatomical structures and lesions using only a few 3D prompt points. The researchers developed a large-scale dataset, SA-Med3D-140K, and used it to train SAM-Med3D using a two-stage procedure.

The comprehensive evaluation of SAM-Med3D on a wide range of medical segmentation tasks demonstrates the model's efficiency, efficacy, and potential for application to diverse downstream medical imaging applications. This research represents a significant step towards the development of more versatile and clinically relevant medical image analysis tools, which could ultimately improve the diagnosis, treatment, and monitoring of various health conditions.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

👁️

New!SAM-Med3D: Towards General-purpose Segmentation Models for Volumetric Medical Images

Haoyu Wang, Sizheng Guo, Jin Ye, Zhongying Deng, Junlong Cheng, Tianbin Li, Jianpin Chen, Yanzhou Su, Ziyan Huang, Yiqing Shen, Bin Fu, Shaoting Zhang, Junjun He, Yu Qiao

Existing volumetric medical image segmentation models are typically task-specific, excelling at specific target but struggling to generalize across anatomical structures or modalities. This limitation restricts their broader clinical use. In this paper, we introduce SAM-Med3D for general-purpose segmentation on volumetric medical images. Given only a few 3D prompt points, SAM-Med3D can accurately segment diverse anatomical structures and lesions across various modalities. To achieve this, we gather and process a large-scale 3D medical image dataset, SA-Med3D-140K, from a blend of public sources and licensed private datasets. This dataset includes 22K 3D images and 143K corresponding 3D masks. Then SAM-Med3D, a promptable segmentation model characterized by the fully learnable 3D structure, is trained on this dataset using a two-stage procedure and exhibits impressive performance on both seen and unseen segmentation targets. We comprehensively evaluate SAM-Med3D on 16 datasets covering diverse medical scenarios, including different anatomical structures, modalities, targets, and zero-shot transferability to new/unseen tasks. The evaluation shows the efficiency and efficacy of SAM-Med3D, as well as its promising application to diverse downstream tasks as a pre-trained model. Our approach demonstrates that substantial medical resources can be utilized to develop a general-purpose medical AI for various potential applications. Our dataset, code, and models are available at https://github.com/uni-medical/SAM-Med3D.

9/17/2024

SAM-Med3D-MoE: Towards a Non-Forgetting Segment Anything Model via Mixture of Experts for 3D Medical Image Segmentation

Guoan Wang, Jin Ye, Junlong Cheng, Tianbin Li, Zhaolin Chen, Jianfei Cai, Junjun He, Bohan Zhuang

Volumetric medical image segmentation is pivotal in enhancing disease diagnosis, treatment planning, and advancing medical research. While existing volumetric foundation models for medical image segmentation, such as SAM-Med3D and SegVol, have shown remarkable performance on general organs and tumors, their ability to segment certain categories in clinical downstream tasks remains limited. Supervised Finetuning (SFT) serves as an effective way to adapt such foundation models for task-specific downstream tasks but at the cost of degrading the general knowledge previously stored in the original foundation model.To address this, we propose SAM-Med3D-MoE, a novel framework that seamlessly integrates task-specific finetuned models with the foundational model, creating a unified model at minimal additional training expense for an extra gating network. This gating network, in conjunction with a selection strategy, allows the unified model to achieve comparable performance of the original models in their respective tasks both general and specialized without updating any parameters of them.Our comprehensive experiments demonstrate the efficacy of SAM-Med3D-MoE, with an average Dice performance increase from 53 to 56.4 on 15 specific classes. It especially gets remarkable gains of 29.6, 8.5, 11.2 on the spinal cord, esophagus, and right hip, respectively. Additionally, it achieves 48.9 Dice on the challenging SPPIN2023 Challenge, significantly surpassing the general expert's performance of 32.3. We anticipate that SAM-Med3D-MoE can serve as a new framework for adapting the foundation model to specific areas in medical image analysis. Codes and datasets will be publicly available.

7/9/2024

FastSAM-3DSlicer: A 3D-Slicer Extension for 3D Volumetric Segment Anything Model with Uncertainty Quantification

Yiqing Shen, Xinyuan Shao, Blanca Inigo Romillo, David Dreizin, Mathias Unberath

Accurate segmentation of anatomical structures and pathological regions in medical images is crucial for diagnosis, treatment planning, and disease monitoring. While the Segment Anything Model (SAM) and its variants have demonstrated impressive interactive segmentation capabilities on image types not seen during training without the need for domain adaptation or retraining, their practical application in volumetric 3D medical imaging workflows has been hindered by the lack of a user-friendly interface. To address this challenge, we introduce FastSAM-3DSlicer, a 3D Slicer extension that integrates both 2D and 3D SAM models, including SAM-Med2D, MedSAM, SAM-Med3D, and FastSAM-3D. Building on the well-established open-source 3D Slicer platform, our extension enables efficient, real-time segmentation of 3D volumetric medical images, with seamless interaction and visualization. By automating the handling of raw image data, user prompts, and segmented masks, FastSAM-3DSlicer provides a streamlined, user-friendly interface that can be easily incorporated into medical image analysis workflows. Performance evaluations reveal that the FastSAM-3DSlicer extension running FastSAM-3D achieves low inference times of only 1.09 seconds per volume on CPU and 0.73 seconds per volume on GPU, making it well-suited for real-time interactive segmentation. Moreover, we introduce an uncertainty quantification scheme that leverages the rapid inference capabilities of FastSAM-3D for practical implementation, further enhancing its reliability and applicability in medical settings. FastSAM-3DSlicer offers an interactive platform and user interface for 2D and 3D interactive volumetric medical image segmentation, offering a powerful combination of efficiency, precision, and ease of use with SAMs. The source code and a video demonstration are publicly available at https://github.com/arcadelab/FastSAM3D_slicer.

7/18/2024

SAM3D: Zero-Shot Semi-Automatic Segmentation in 3D Medical Images with the Segment Anything Model

Trevor J. Chan, Aarush Sahni, Yijin Fang, Jie Li, Alisha Luthra, Alison Pouch, Chamith S. Rajapakse

We introduce SAM3D, a new approach to semi-automatic zero-shot segmentation of 3D images building on the existing Segment Anything Model. We achieve fast and accurate segmentations in 3D images with a four-step strategy involving: user prompting with 3D polylines, volume slicing along multiple axes, slice-wide inference with a pretrained model, and recomposition and refinement in 3D. We evaluated SAM3D performance qualitatively on an array of imaging modalities and anatomical structures and quantify performance for specific structures in abdominal pelvic CT and brain MRI. Notably, our method achieves good performance with zero model training or finetuning, making it particularly useful for tasks with a scarcity of preexisting labeled data. By enabling users to create 3D segmentations of unseen data quickly and with dramatically reduced manual input, these methods have the potential to aid surgical planning and education, diagnostic imaging, and scientific research.

8/9/2024