Iris-SAM: Iris Segmentation Using a Foundation Model

Read original: arXiv:2402.06497 - Published 6/3/2024 by Parisa Farmanifard, Arun Ross

📈

Overview

Iris segmentation is a crucial step in iris biometric systems, which involves extracting the annular iris region from an eye image.
This work presents a pixel-level iris segmentation model built upon the Segment Anything Model (SAM), a foundational model for segmenting arbitrary objects.
The key contribution is the integration of different loss functions, particularly the Focal Loss, during the fine-tuning of SAM on ocular images to address the class imbalance problem (iris vs. non-iris pixels).
The model is evaluated on multiple datasets, demonstrating its effectiveness in iris segmentation, achieving high accuracy compared to baseline approaches.

Plain English Explanation

Iris segmentation is an important step in iris biometric systems, which are used to identify people based on the unique patterns in their eyes. This work develops a new model for automatically separating the iris (the colored part of the eye) from the rest of the eye in an image.

The researchers started with a powerful model called the Segment Anything Model (SAM), which can segment any object in an image. They then fine-tuned this model specifically for iris segmentation by using a special type of loss function called Focal Loss. This loss function helps the model better handle the imbalance between the number of iris and non-iris pixels in the images.

The researchers tested their fine-tuned model on several datasets of eye images and found that it achieved very high accuracy in separating the iris from the rest of the eye, outperforming other existing approaches. For example, on one dataset, the model achieved an average segmentation accuracy of 99.58%, compared to the previous best of 89.75%.

This improved iris segmentation model can be a valuable tool for iris biometric systems, making them more reliable and accurate. It could also potentially be adapted to work with other types of images and segmentation tasks.

Technical Explanation

The researchers in this work developed a pixel-level iris segmentation model by fine-tuning the Segment Anything Model (SAM), a powerful model for segmenting arbitrary objects in images. The key contribution lies in the integration of different loss functions during the fine-tuning process, particularly the Focal Loss, which helps address the class imbalance problem (i.e., the disparity between the number of iris and non-iris pixels in the images).

The model was evaluated on three datasets: ND-IRIS-0405, CASIA-Iris-Interval-v3, and IIT-Delhi-Iris. On the ND-IRIS-0405 dataset, the model achieved an average segmentation accuracy of 99.58%, outperforming the previous best baseline performance of 89.75%.

Critical Analysis

The paper provides a comprehensive evaluation of the proposed iris segmentation model, demonstrating its effectiveness across multiple datasets. However, the authors do not discuss any potential limitations or caveats of their approach.

One area for further research could be exploring the model's performance on more challenging or diverse eye image datasets, particularly those with varying lighting conditions, eye angles, or image quality. Additionally, the researchers could investigate the model's robustness to different types of noise or occlusions that may occur in real-world iris biometric systems.

While the reported results are impressive, it would be valuable to understand the computational complexity and inference time of the model, as these factors can be crucial in practical applications.

Conclusion

This work presents a novel iris segmentation model that builds upon the powerful Segment Anything Model (SAM) and leverages specialized loss functions to achieve state-of-the-art performance on multiple datasets. The integration of Focal Loss during the fine-tuning process effectively addresses the class imbalance problem inherent in iris segmentation tasks.

The high accuracy demonstrated by the model, particularly on the ND-IRIS-0405 dataset, suggests that it could be a valuable tool for improving the reliability and robustness of iris biometric systems. This research also highlights the potential for adapting foundational vision models to tackle specific task-oriented challenges in computer vision.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

📈

Iris-SAM: Iris Segmentation Using a Foundation Model

Parisa Farmanifard, Arun Ross

Iris segmentation is a critical component of an iris biometric system and it involves extracting the annular iris region from an ocular image. In this work, we develop a pixel-level iris segmentation model from a foundational model, viz., Segment Anything Model (SAM), that has been successfully used for segmenting arbitrary objects. The primary contribution of this work lies in the integration of different loss functions during the fine-tuning of SAM on ocular images. In particular, the importance of Focal Loss is borne out in the fine-tuning process since it strategically addresses the class imbalance problem (i.e., iris versus non-iris pixels). Experiments on ND-IRIS-0405, CASIA-Iris-Interval-v3, and IIT-Delhi-Iris datasets convey the efficacy of the trained model for the task of iris segmentation. For instance, on the ND-IRIS-0405 dataset, an average segmentation accuracy of 99.58% was achieved, compared to the best baseline performance of 89.75%.

6/3/2024

📈

Zero-Shot Segmentation of Eye Features Using the Segment Anything Model (SAM)

Virmarie Maquiling, Sean Anthony Byrne, Diederick C. Niehorster, Marcus Nystrom, Enkelejda Kasneci

The advent of foundation models signals a new era in artificial intelligence. The Segment Anything Model (SAM) is the first foundation model for image segmentation. In this study, we evaluate SAM's ability to segment features from eye images recorded in virtual reality setups. The increasing requirement for annotated eye-image datasets presents a significant opportunity for SAM to redefine the landscape of data annotation in gaze estimation. Our investigation centers on SAM's zero-shot learning abilities and the effectiveness of prompts like bounding boxes or point clicks. Our results are consistent with studies in other domains, demonstrating that SAM's segmentation effectiveness can be on-par with specialized models depending on the feature, with prompts improving its performance, evidenced by an IoU of 93.34% for pupil segmentation in one dataset. Foundation models like SAM could revolutionize gaze estimation by enabling quick and easy image segmentation, reducing reliance on specialized models and extensive manual annotation.

4/9/2024

How to build the best medical image segmentation algorithm using foundation models: a comprehensive empirical study with Segment Anything Model

Hanxue Gu, Haoyu Dong, Jichen Yang, Maciej A. Mazurowski

Automated segmentation is a fundamental medical image analysis task, which enjoys significant advances due to the advent of deep learning. While foundation models have been useful in natural language processing and some vision tasks for some time, the foundation model developed with image segmentation in mind - Segment Anything Model (SAM) - has been developed only recently and has shown similar promise. However, there are still no systematic analyses or best-practice guidelines for optimal fine-tuning of SAM for medical image segmentation. This work summarizes existing fine-tuning strategies with various backbone architectures, model components, and fine-tuning algorithms across 18 combinations, and evaluates them on 17 datasets covering all common radiology modalities. Our study reveals that (1) fine-tuning SAM leads to slightly better performance than previous segmentation methods, (2) fine-tuning strategies that use parameter-efficient learning in both the encoder and decoder are superior to other strategies, (3) network architecture has a small impact on final performance, (4) further training SAM with self-supervised learning can improve final model performance. We also demonstrate the ineffectiveness of some methods popular in the literature and further expand our experiments into few-shot and prompt-based settings. Lastly, we released our code and MRI-specific fine-tuned weights, which consistently obtained superior performance over the original SAM, at https://github.com/mazurowski-lab/finetune-SAM.

5/14/2024

Boosting Medical Image Classification with Segmentation Foundation Model

Pengfei Gu, Zihan Zhao, Hongxiao Wang, Yaopeng Peng, Yizhe Zhang, Nishchal Sapkota, Chaoli Wang, Danny Z. Chen

The Segment Anything Model (SAM) exhibits impressive capabilities in zero-shot segmentation for natural images. Recently, SAM has gained a great deal of attention for its applications in medical image segmentation. However, to our best knowledge, no studies have shown how to harness the power of SAM for medical image classification. To fill this gap and make SAM a true ``foundation model'' for medical image analysis, it is highly desirable to customize SAM specifically for medical image classification. In this paper, we introduce SAMAug-C, an innovative augmentation method based on SAM for augmenting classification datasets by generating variants of the original images. The augmented datasets can be used to train a deep learning classification model, thereby boosting the classification performance. Furthermore, we propose a novel framework that simultaneously processes raw and SAMAug-C augmented image input, capitalizing on the complementary information that is offered by both. Experiments on three public datasets validate the effectiveness of our new approach.

6/18/2024