SAM3D: Zero-Shot Semi-Automatic Segmentation in 3D Medical Images with the Segment Anything Model

Read original: arXiv:2405.06786 - Published 8/9/2024 by Trevor J. Chan, Aarush Sahni, Yijin Fang, Jie Li, Alisha Luthra, Alison Pouch, Chamith S. Rajapakse

SAM3D: Zero-Shot Semi-Automatic Segmentation in 3D Medical Images with the Segment Anything Model

Overview

Presents a novel method called SAM3D for zero-shot semi-automatic segmentation of 3D medical images using the Segment Anything Model (SAM)
Demonstrates the capability of SAM to perform zero-shot segmentation of diverse anatomical structures in 3D medical scans without any task-specific training
Introduces interactive tools to allow users to refine the initial SAM segmentations with minimal effort

Plain English Explanation

The paper introduces a new method called SAM3D that uses the Segment Anything Model (SAM) to perform zero-shot segmentation of various anatomical structures in 3D medical images. This means the model can segment these structures without any prior training on the specific task.

The key idea is to leverage the impressive generalization capabilities of SAM, which was trained on a large and diverse set of 2D images. By applying SAM directly to 3D medical scans, the researchers show it can accurately segment a wide range of anatomical features, from organs to tissues to pathologies, without needing to retrain the model for the medical domain.

To make the segmentation process more interactive and user-friendly, the authors also introduce tools that allow clinicians and researchers to refine the initial SAM-generated segmentations with minimal effort. This "semi-automatic" approach combines the power of the zero-shot SAM model with the ability for humans to correct any mistakes or add additional details.

Technical Explanation

The SAM3D method first applies the pre-trained SAM model to 3D medical scans in a sliding-window fashion, generating segmentation predictions for each local 2D slice. These 2D segmentations are then aggregated into a final 3D segmentation using a novel fusion mechanism.

To improve the usability and performance of the zero-shot SAM segmentations, the authors introduce two key innovations:

Interactive Refinement: The system provides tools for users to easily refine the initial SAM segmentations by adding, removing, or modifying the predicted regions. This "semi-automatic" approach leverages human expertise to correct any mistakes or add additional details.
Semantic Boosting: The researchers propose a "semantic boosting" technique that fine-tunes the SAM model on a small set of manually segmented examples from the target medical domain. This step helps adapt the model to the specific characteristics of the 3D medical data, further improving the zero-shot segmentation performance.

Experiments on a variety of 3D medical datasets, including MedVista and ocular features, demonstrate the effectiveness of the SAM3D approach. The method achieves strong segmentation results without any task-specific training, and the interactive refinement tools allow users to easily correct any errors.

Critical Analysis

The paper presents a compelling approach for leveraging the power of the Segment Anything Model (SAM) to perform zero-shot segmentation of 3D medical images. The ability to accurately segment diverse anatomical structures without any prior training is a significant advancement, as it can greatly simplify the development of segmentation tools for medical applications.

That said, the authors acknowledge several limitations and areas for future work. For example, the current SAM3D method operates on a slice-by-slice basis, which may not fully capture the 3D context of the medical scans. Exploring more holistic 3D segmentation approaches could further improve the accuracy and consistency of the results.

Additionally, while the interactive refinement tools are a valuable addition, the paper does not provide a detailed evaluation of their usability and effectiveness from the perspective of end-users (e.g., clinicians). Understanding the real-world practicality and workflow integration of such tools would be an important next step.

Overall, the SAM3D method represents an exciting step forward in the application of powerful zero-shot segmentation models to the medical imaging domain. Continued research and development in this area could lead to significant improvements in the efficiency and accessibility of 3D medical image analysis.

Conclusion

The SAM3D method presented in this paper demonstrates the potential of the Segment Anything Model (SAM) to enable zero-shot segmentation of diverse anatomical structures in 3D medical images. By leveraging the impressive generalization capabilities of SAM, the researchers have developed a novel approach that can accurately segment a wide range of features without any task-specific training.

The introduction of interactive refinement tools and semantic boosting techniques further enhance the usability and performance of the zero-shot segmentations, making the SAM3D system a promising step towards more accessible and efficient 3D medical image analysis. As the field of medical imaging continues to evolve, innovations like SAM3D could have significant implications for a variety of clinical and research applications.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

SAM3D: Zero-Shot Semi-Automatic Segmentation in 3D Medical Images with the Segment Anything Model

Trevor J. Chan, Aarush Sahni, Yijin Fang, Jie Li, Alisha Luthra, Alison Pouch, Chamith S. Rajapakse

We introduce SAM3D, a new approach to semi-automatic zero-shot segmentation of 3D images building on the existing Segment Anything Model. We achieve fast and accurate segmentations in 3D images with a four-step strategy involving: user prompting with 3D polylines, volume slicing along multiple axes, slice-wide inference with a pretrained model, and recomposition and refinement in 3D. We evaluated SAM3D performance qualitatively on an array of imaging modalities and anatomical structures and quantify performance for specific structures in abdominal pelvic CT and brain MRI. Notably, our method achieves good performance with zero model training or finetuning, making it particularly useful for tasks with a scarcity of preexisting labeled data. By enabling users to create 3D segmentations of unseen data quickly and with dramatically reduced manual input, these methods have the potential to aid surgical planning and education, diagnostic imaging, and scientific research.

8/9/2024

Segment anything model 2: an application to 2D and 3D medical images

Haoyu Dong, Hanxue Gu, Yaqian Chen, Jichen Yang, Yuwen Chen, Maciej A. Mazurowski

Segment Anything Model (SAM) has gained significant attention because of its ability to segment various objects in images given a prompt. The recently developed SAM 2 has extended this ability to video inputs. This opens an opportunity to apply SAM to 3D images, one of the fundamental tasks in the medical imaging field. In this paper, we extensively evaluate SAM 2's ability to segment both 2D and 3D medical images by first collecting 21 medical imaging datasets, including surgical videos, common 3D modalities such as computed tomography (CT), magnetic resonance imaging (MRI), and positron emission tomography (PET) as well as 2D modalities such as X-ray and ultrasound. Two evaluation settings of SAM 2 are considered: (1) multi-frame 3D segmentation, where prompts are provided to one or multiple slice(s) selected from the volume, and (2) single-frame 2D segmentation, where prompts are provided to each slice. The former only applies to videos and 3D modalities, while the latter applies to all datasets. Our results show that SAM 2 exhibits similar performance as SAM under single-frame 2D segmentation, and has variable performance under multi-frame 3D segmentation depending on the choices of slices to annotate, the direction of the propagation, the predictions utilized during the propagation, etc. We believe our work enhances the understanding of SAM 2's behavior in the medical field and provides directions for future work in adapting SAM 2 to this domain. Our code is available at: https://github.com/mazurowski-lab/segment-anything2-medical-evaluation.

8/23/2024

FastSAM-3DSlicer: A 3D-Slicer Extension for 3D Volumetric Segment Anything Model with Uncertainty Quantification

Yiqing Shen, Xinyuan Shao, Blanca Inigo Romillo, David Dreizin, Mathias Unberath

Accurate segmentation of anatomical structures and pathological regions in medical images is crucial for diagnosis, treatment planning, and disease monitoring. While the Segment Anything Model (SAM) and its variants have demonstrated impressive interactive segmentation capabilities on image types not seen during training without the need for domain adaptation or retraining, their practical application in volumetric 3D medical imaging workflows has been hindered by the lack of a user-friendly interface. To address this challenge, we introduce FastSAM-3DSlicer, a 3D Slicer extension that integrates both 2D and 3D SAM models, including SAM-Med2D, MedSAM, SAM-Med3D, and FastSAM-3D. Building on the well-established open-source 3D Slicer platform, our extension enables efficient, real-time segmentation of 3D volumetric medical images, with seamless interaction and visualization. By automating the handling of raw image data, user prompts, and segmented masks, FastSAM-3DSlicer provides a streamlined, user-friendly interface that can be easily incorporated into medical image analysis workflows. Performance evaluations reveal that the FastSAM-3DSlicer extension running FastSAM-3D achieves low inference times of only 1.09 seconds per volume on CPU and 0.73 seconds per volume on GPU, making it well-suited for real-time interactive segmentation. Moreover, we introduce an uncertainty quantification scheme that leverages the rapid inference capabilities of FastSAM-3D for practical implementation, further enhancing its reliability and applicability in medical settings. FastSAM-3DSlicer offers an interactive platform and user interface for 2D and 3D interactive volumetric medical image segmentation, offering a powerful combination of efficiency, precision, and ease of use with SAMs. The source code and a video demonstration are publicly available at https://github.com/arcadelab/FastSAM3D_slicer.

7/18/2024

📈

Zero-shot 3D Segmentation of Abdominal Organs in CT Scans Using Segment Anything Model 2: Adapting Video Tracking Capabilities for 3D Medical Imaging

Yosuke Yamagishi, Shouhei Hanaoka, Tomohiro Kikuchi, Takahiro Nakao, Yuta Nakamura, Yukihiro Nomura, Soichiro Miki, Takeharu Yoshikawa, Osamu Abe

Objectives: To evaluate the zero-shot performance of Segment Anything Model 2 (SAM 2) in 3D segmentation of abdominal organs in CT scans, and to investigate the effects of prompt settings on segmentation results. Materials and Methods: In this retrospective study, we used a subset of the TotalSegmentator CT dataset from eight institutions to assess SAM 2's ability to segment eight abdominal organs. Segmentation was initiated from three different z-coordinate levels (caudal, mid, and cranial levels) of each organ. Performance was measured using the Dice similarity coefficient (DSC). We also analyzed the impact of negative prompts, which explicitly exclude certain regions from the segmentation process, on accuracy. Results: 123 patients (mean age, 60.7 pm 15.5 years; 63 men, 60 women) were evaluated. As a zero-shot approach, larger organs with clear boundaries demonstrated high segmentation performance, with mean DSCs as follows: liver 0.821 pm 0.192, right kidney 0.862 pm 0.212, left kidney 0.870 pm 0.154, and spleen 0.891 pm 0.131. Smaller organs showed lower performance: gallbladder 0.531 pm 0.291, pancreas 0.361 pm 0.197, and adrenal glands, right 0.203 pm 0.222, left 0.308 pm 0.234. The initial slice for segmentation and the use of negative prompts significantly influenced the results. By removing negative prompts from the input, the DSCs significantly decreased for six organs. Conclusion: SAM 2 demonstrated promising zero-shot performance in segmenting certain abdominal organs in CT scans, particularly larger organs. Performance was significantly influenced by input negative prompts and initial slice selection, highlighting the importance of optimizing these factors.

9/25/2024