RetinaRegNet: A Versatile Approach for Retinal Image Registration

Read original: arXiv:2404.16017 - Published 9/12/2024 by Vishal Balaji Sivaraman, Muhammad Imran, Qingyue Wei, Preethika Muralidharan, Michelle R. Tamplin, Isabella M . Grumbach, Randy H. Kardon, Jui-Kai Wang, Yuyin Zhou, Wei Shao

RetinaRegNet: A Versatile Approach for Retinal Image Registration

Overview

This paper introduces RetinaRegNet, a new approach for registering retinal images.
Retinal image registration is an important task in ophthalmology, enabling doctors to compare images over time and detect changes.
RetinaRegNet uses a deep learning-based architecture to align retinal images, potentially improving on previous registration methods.

Plain English Explanation

RetinaRegNet is a new computer system designed to help doctors compare retinal images over time. Retinal images are pictures of the back of the eye, which doctors use to check for signs of eye diseases like diabetes or glaucoma. When a patient visits the doctor multiple times, the doctor needs to compare the new retinal image to previous ones to see if anything has changed.

RetinaRegNet is a type of artificial intelligence that can automatically align, or "register", these retinal images so the doctor can easily compare them side-by-side. This is helpful because small changes in the eye can be difficult for a human to spot without the images being perfectly aligned.

The key idea behind RetinaRegNet is to use a deep learning approach, which means the system learns how to register images by analyzing many examples. This allows RetinaRegNet to be more flexible and accurate than previous registration methods that relied on predefined mathematical rules.

Technical Explanation

The core of RetinaRegNet is a convolutional neural network that takes a pair of retinal images as input and outputs the optimal transformation (e.g. translation, rotation, scaling) to align them. The network is trained on a large dataset of retinal image pairs, with the goal of learning the complex geometric relationships between corresponding features in the images.

A key innovation is the use of a multi-resolution approach, where the network operates at different scales to capture both local and global alignment. This helps the system handle challenges like varying fields of view and anatomical distortions in the retinal images.

The authors also incorporate a mask-guided component, which focuses the alignment on the relevant anatomical structures in the retina rather than irrelevant background regions.

Critical Analysis

The authors provide a thorough evaluation of RetinaRegNet, demonstrating state-of-the-art performance on standard retinal image registration benchmarks. However, the paper does not address certain limitations, such as the generalization of the system to diverse patient populations or its robustness to low-quality or atypical images.

Additionally, while the explainability of the system's decision-making is not a focus of this work, further insights into how RetinaRegNet arrives at its alignments could help build trust and understanding among clinicians.

Conclusion

Overall, RetinaRegNet represents a promising advance in retinal image registration, with the potential to streamline disease monitoring and improve patient care in ophthalmology. As the authors note, future work could explore applying the multi-resolution, mask-guided approach to other medical imaging tasks beyond the retina. Continued development and validation of such AI-powered tools can help unlock the full diagnostic and prognostic value of medical imaging data.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

RetinaRegNet: A Versatile Approach for Retinal Image Registration

Vishal Balaji Sivaraman, Muhammad Imran, Qingyue Wei, Preethika Muralidharan, Michelle R. Tamplin, Isabella M . Grumbach, Randy H. Kardon, Jui-Kai Wang, Yuyin Zhou, Wei Shao

We introduce RetinaRegNet, a zero-shot image registration model designed to register retinal images with minimal overlap, large deformations, and varying image quality. RetinaRegNet addresses these challenges and achieves robust and accurate registration through the following steps. First, we extract features from the moving and fixed images using latent diffusion models. We then sample feature points from the fixed image using a combination of the SIFT algorithm and random point sampling. For each sampled point, we identify its corresponding point in the moving image using a 2D correlation map, which computes the cosine similarity between the diffusion feature vectors of the point in the fixed image and all pixels in the moving image. Second, we eliminate most incorrectly detected point correspondences (outliers) by enforcing an inverse consistency constraint, ensuring that correspondences are consistent in both forward and backward directions. We further remove outliers with large distances between corresponding points using a global transformation based outlier detector. Finally, we implement a two-stage registration framework to handle large deformations. The first stage estimates a homography transformation to achieve global alignment between the images, while the second stage uses a third-order polynomial transformation to estimate local deformations. We evaluated RetinaRegNet on three retinal image registration datasets: color fundus images, fluorescein angiography images, and laser speckle flowgraphy images. Our model consistently outperformed state-of-the-art methods across all datasets. The accurate registration achieved by RetinaRegNet enables the tracking of eye disease progression, enhances surgical planning, and facilitates the evaluation of treatment efficacy. Our code is publicly available at: https://github.com/mirthAI/RetinaRegNet.

9/12/2024

Progressive Retinal Image Registration via Global and Local Deformable Transformations

Yepeng Liu, Baosheng Yu, Tian Chen, Yuliang Gu, Bo Du, Yongchao Xu, Jun Cheng

Retinal image registration plays an important role in the ophthalmological diagnosis process. Since there exist variances in viewing angles and anatomical structures across different retinal images, keypoint-based approaches become the mainstream methods for retinal image registration thanks to their robustness and low latency. These methods typically assume the retinal surfaces are planar, and adopt feature matching to obtain the homography matrix that represents the global transformation between images. Yet, such a planar hypothesis inevitably introduces registration errors since retinal surface is approximately curved. This limitation is more prominent when registering image pairs with significant differences in viewing angles. To address this problem, we propose a hybrid registration framework called HybridRetina, which progressively registers retinal images with global and local deformable transformations. For that, we use a keypoint detector and a deformation network called GAMorph to estimate the global transformation and local deformable transformation, respectively. Specifically, we integrate multi-level pixel relation knowledge to guide the training of GAMorph. Additionally, we utilize an edge attention module that includes the geometric priors of the images, ensuring the deformation field focuses more on the vascular regions of clinical interest. Experiments on two widely-used datasets, FIRE and FLoRI21, show that our proposed HybridRetina significantly outperforms some state-of-the-art methods. The code is available at https://github.com/lyp-deeplearning/awesome-retinal-registration.

9/4/2024

Retinal IPA: Iterative KeyPoints Alignment for Multimodal Retinal Imaging

Jiacheng Wang, Hao Li, Dewei Hu, Rui Xu, Xing Yao, Yuankai K. Tao, Ipek Oguz

We propose a novel framework for retinal feature point alignment, designed for learning cross-modality features to enhance matching and registration across multi-modality retinal images. Our model draws on the success of previous learning-based feature detection and description methods. To better leverage unlabeled data and constrain the model to reproduce relevant keypoints, we integrate a keypoint-based segmentation task. It is trained in a self-supervised manner by enforcing segmentation consistency between different augmentations of the same image. By incorporating a keypoint augmented self-supervised layer, we achieve robust feature extraction across modalities. Extensive evaluation on two public datasets and one in-house dataset demonstrates significant improvements in performance for modality-agnostic retinal feature alignment. Our code and model weights are publicly available at url{https://github.com/MedICL-VU/RetinaIPA}.

7/29/2024

ConKeD: Multiview contrastive descriptor learning for keypoint-based retinal image registration

David Rivas-Villar, 'Alvaro S. Hervella, Jos'e Rouco, Jorge Novo

Retinal image registration is of utmost importance due to its wide applications in medical practice. In this context, we propose ConKeD, a novel deep learning approach to learn descriptors for retinal image registration. In contrast to current registration methods, our approach employs a novel multi-positive multi-negative contrastive learning strategy that enables the utilization of additional information from the available training samples. This makes it possible to learn high quality descriptors from limited training data. To train and evaluate ConKeD, we combine these descriptors with domain-specific keypoints, particularly blood vessel bifurcations and crossovers, that are detected using a deep neural network. Our experimental results demonstrate the benefits of the novel multi-positive multi-negative strategy, as it outperforms the widely used triplet loss technique (single-positive and single-negative) as well as the single-positive multi-negative alternative. Additionally, the combination of ConKeD with the domain-specific keypoints produces comparable results to the state-of-the-art methods for retinal image registration, while offering important advantages such as avoiding pre-processing, utilizing fewer training samples, and requiring fewer detected keypoints, among others. Therefore, ConKeD shows a promising potential towards facilitating the development and application of deep learning-based methods for retinal image registration.

7/9/2024