Zero-shot 3D Segmentation of Abdominal Organs in CT Scans Using Segment Anything Model 2: Adapting Video Tracking Capabilities for 3D Medical Imaging

Read original: arXiv:2408.06170 - Published 9/25/2024 by Yosuke Yamagishi, Shouhei Hanaoka, Tomohiro Kikuchi, Takahiro Nakao, Yuta Nakamura, Yukihiro Nomura, Soichiro Miki, Takeharu Yoshikawa, Osamu Abe
Total Score

0

📈

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • This study evaluated the zero-shot performance of the Segment Anything Model 2 (SAM 2) in segmenting abdominal organs from CT scans.
  • The researchers used a subset of the TotalSegmentator CT dataset to assess SAM 2's ability to segment 8 different abdominal organs.
  • Segmentation was initiated from three different Z-coordinate levels of each organ, and performance was measured using the Dice similarity coefficient (DSC).
  • The researchers also analyzed organ volumes to provide context for the results.

Plain English Explanation

The researchers wanted to see how well the Segment Anything Model 2 (SAM 2) could segment, or outline, different organs in CT scans of the abdomen without any additional training. CT scans are 3D images of the body that doctors use to diagnose and monitor medical conditions.

The researchers used a dataset of 123 CT scans from 8 different hospitals to test SAM 2's performance on 8 abdominal organs: the liver, left and right kidneys, spleen, gallbladder, pancreas, and adrenal glands. They started the segmentation process at three different heights (or Z-coordinates) within the scans to see how that affected the results.

The researchers measured the accuracy of the segmentation using a metric called the Dice similarity coefficient (DSC). DSC values range from 0 to 1, with 1 indicating a perfect match between the model's segmentation and the ground truth.

The results showed that SAM 2 performed well on larger organs with clear boundaries, like the liver, kidneys, and spleen, achieving high DSC scores. However, it struggled more with smaller or less defined structures, like the gallbladder, pancreas, and adrenal glands.

The researchers also found that the starting point of the segmentation (the initial slice) made a significant difference in the results for different organs. Additionally, there was a lot of variability in the model's performance, with DSC values ranging from near 0 to almost 1 for the same organs across different scans.

Overall, the study demonstrates that SAM 2 has promising zero-shot capabilities for segmenting certain abdominal organs in CT scans, but it still has room for improvement, especially when it comes to smaller and less distinct structures.

Technical Explanation

The researchers used a zero-shot approach to evaluate the performance of the Segment Anything Model 2 (SAM 2) in segmenting 8 different abdominal organs from CT scans.

They leveraged a subset of the TotalSegmentator CT dataset (n=123) from 8 different institutions to assess SAM 2's ability to segment the liver, left kidney, right kidney, spleen, gallbladder, pancreas, and adrenal glands.

Segmentation was initiated from three different Z-coordinate levels (caudal, mid, and cranial) of each organ. Performance was measured using the Dice similarity coefficient (DSC), which quantifies the overlap between the model's segmentation and the ground truth. The researchers also analyzed organ volumes to provide context for the results.

The findings showed that larger organs with clear boundaries, such as the liver (mean(median) DSC: 0.821(0.898)), left kidney (0.870(0.921)), right kidney (0.862(0.935)), and spleen (0.891(0.932)), demonstrated high segmentation performance in the zero-shot setting.

However, smaller or less defined structures, like the gallbladder (0.531(0.590)), pancreas (0.361(0.359)), and adrenal glands (0.203-0.308(0.109-0.231)), exhibited lower performance. Significant differences in DSC were observed depending on the starting initial slice of segmentation for different organs.

A moderate positive correlation was found between organ volume and DSC (Spearman's rs = 0.731, P <.001 at the caudal level), indicating that larger organs were easier for the model to segment. The results also showed high variability in DSC values within organs, ranging from near 0 to almost 1.0, suggesting substantial inconsistency in segmentation performance between scans.

Critical Analysis

The study demonstrates the promising zero-shot capabilities of SAM 2 in segmenting certain abdominal organs from CT scans, particularly larger structures with clear boundaries. This highlights the model's potential for cross-domain generalization in medical imaging, where it can be applied to new tasks without the need for additional training.

However, the researchers also acknowledge the limitations of the model's performance, particularly for smaller and less defined organs. This suggests that further improvements are needed to enhance the model's ability to accurately segment a wider range of anatomical structures in medical imaging.

Additionally, the study's finding of significant differences in segmentation performance based on the starting slice of the segmentation process raises questions about the model's robustness and consistency across different scan regions. This variability in results may be an important consideration when deploying such models in clinical settings.

While the study provides valuable insights into the zero-shot capabilities of SAM 2, further research is needed to explore the model's applicability in more diverse medical imaging scenarios, as well as to investigate potential strategies for improving its performance on challenging anatomical structures.

Conclusion

This study evaluated the zero-shot performance of the Segment Anything Model 2 (SAM 2) in segmenting abdominal organs from CT scans. The results demonstrate that SAM 2 can effectively segment larger organs with clear boundaries, such as the liver, kidneys, and spleen, without any additional training.

However, the model struggled more with smaller or less defined structures, like the gallbladder, pancreas, and adrenal glands. The study also highlighted the importance of the starting point of the segmentation process and the substantial variability in the model's performance across different scans.

Overall, the findings suggest that SAM 2 has promising zero-shot capabilities for certain medical imaging tasks, but further improvements are needed to enhance its performance on a wider range of anatomical structures. This study provides valuable insights into the potential and limitations of this advanced segmentation model in the context of 3D medical imaging.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

📈

Total Score

0

Zero-shot 3D Segmentation of Abdominal Organs in CT Scans Using Segment Anything Model 2: Adapting Video Tracking Capabilities for 3D Medical Imaging

Yosuke Yamagishi, Shouhei Hanaoka, Tomohiro Kikuchi, Takahiro Nakao, Yuta Nakamura, Yukihiro Nomura, Soichiro Miki, Takeharu Yoshikawa, Osamu Abe

Objectives: To evaluate the zero-shot performance of Segment Anything Model 2 (SAM 2) in 3D segmentation of abdominal organs in CT scans, and to investigate the effects of prompt settings on segmentation results. Materials and Methods: In this retrospective study, we used a subset of the TotalSegmentator CT dataset from eight institutions to assess SAM 2's ability to segment eight abdominal organs. Segmentation was initiated from three different z-coordinate levels (caudal, mid, and cranial levels) of each organ. Performance was measured using the Dice similarity coefficient (DSC). We also analyzed the impact of negative prompts, which explicitly exclude certain regions from the segmentation process, on accuracy. Results: 123 patients (mean age, 60.7 pm 15.5 years; 63 men, 60 women) were evaluated. As a zero-shot approach, larger organs with clear boundaries demonstrated high segmentation performance, with mean DSCs as follows: liver 0.821 pm 0.192, right kidney 0.862 pm 0.212, left kidney 0.870 pm 0.154, and spleen 0.891 pm 0.131. Smaller organs showed lower performance: gallbladder 0.531 pm 0.291, pancreas 0.361 pm 0.197, and adrenal glands, right 0.203 pm 0.222, left 0.308 pm 0.234. The initial slice for segmentation and the use of negative prompts significantly influenced the results. By removing negative prompts from the input, the DSCs significantly decreased for six organs. Conclusion: SAM 2 demonstrated promising zero-shot performance in segmenting certain abdominal organs in CT scans, particularly larger organs. Performance was significantly influenced by input negative prompts and initial slice selection, highlighting the importance of optimizing these factors.

Read more

9/25/2024

SAM3D: Zero-Shot Semi-Automatic Segmentation in 3D Medical Images with the Segment Anything Model
Total Score

0

SAM3D: Zero-Shot Semi-Automatic Segmentation in 3D Medical Images with the Segment Anything Model

Trevor J. Chan, Aarush Sahni, Yijin Fang, Jie Li, Alisha Luthra, Alison Pouch, Chamith S. Rajapakse

We introduce SAM3D, a new approach to semi-automatic zero-shot segmentation of 3D images building on the existing Segment Anything Model. We achieve fast and accurate segmentations in 3D images with a four-step strategy involving: user prompting with 3D polylines, volume slicing along multiple axes, slice-wide inference with a pretrained model, and recomposition and refinement in 3D. We evaluated SAM3D performance qualitatively on an array of imaging modalities and anatomical structures and quantify performance for specific structures in abdominal pelvic CT and brain MRI. Notably, our method achieves good performance with zero model training or finetuning, making it particularly useful for tasks with a scarcity of preexisting labeled data. By enabling users to create 3D segmentations of unseen data quickly and with dramatically reduced manual input, these methods have the potential to aid surgical planning and education, diagnostic imaging, and scientific research.

Read more

8/9/2024

Novel adaptation of video segmentation to 3D MRI: efficient zero-shot knee segmentation with SAM2
Total Score

0

Novel adaptation of video segmentation to 3D MRI: efficient zero-shot knee segmentation with SAM2

Andrew Seohwan Yu, Mohsen Hariri, Xuecen Zhang, Mingrui Yang, Vipin Chaudhary, Xiaojuan Li

Intelligent medical image segmentation methods are rapidly evolving and being increasingly applied, yet they face the challenge of domain transfer, where algorithm performance degrades due to different data distributions between source and target domains. To address this, we introduce a method for zero-shot, single-prompt segmentation of 3D knee MRI by adapting Segment Anything Model 2 (SAM2), a general-purpose segmentation model designed to accept prompts and retain memory across frames of a video. By treating slices from 3D medical volumes as individual video frames, we leverage SAM2's advanced capabilities to generate motion- and spatially-aware predictions. We demonstrate that SAM2 can efficiently perform segmentation tasks in a zero-shot manner with no additional training or fine-tuning, accurately delineating structures in knee MRI scans using only a single prompt. Our experiments on the Osteoarthritis Initiative Zuse Institute Berlin (OAI-ZIB) dataset reveal that SAM2 achieves high accuracy on 3D knee bone segmentation, with a testing Dice similarity coefficient of 0.9643 on tibia. We also present results generated using different SAM2 model sizes, different prompt schemes, as well as comparative results from the SAM1 model deployed on the same dataset. This breakthrough has the potential to revolutionize medical image analysis by providing a scalable, cost-effective solution for automated segmentation, paving the way for broader clinical applications and streamlined workflows.

Read more

8/12/2024

📈

Total Score

0

Zero-Shot Surgical Tool Segmentation in Monocular Video Using Segment Anything Model 2

Ange Lou, Yamin Li, Yike Zhang, Robert F. Labadie, Jack Noble

The Segment Anything Model 2 (SAM 2) is the latest generation foundation model for image and video segmentation. Trained on the expansive Segment Anything Video (SA-V) dataset, which comprises 35.5 million masks across 50.9K videos, SAM 2 advances its predecessor's capabilities by supporting zero-shot segmentation through various prompts (e.g., points, boxes, and masks). Its robust zero-shot performance and efficient memory usage make SAM 2 particularly appealing for surgical tool segmentation in videos, especially given the scarcity of labeled data and the diversity of surgical procedures. In this study, we evaluate the zero-shot video segmentation performance of the SAM 2 model across different types of surgeries, including endoscopy and microscopy. We also assess its performance on videos featuring single and multiple tools of varying lengths to demonstrate SAM 2's applicability and effectiveness in the surgical domain. We found that: 1) SAM 2 demonstrates a strong capability for segmenting various surgical videos; 2) When new tools enter the scene, additional prompts are necessary to maintain segmentation accuracy; and 3) Specific challenges inherent to surgical videos can impact the robustness of SAM 2.

Read more

8/6/2024