Perception and Localization of Macular Degeneration Applying Convolutional Neural Network, ResNet and Grad-CAM

Read original: arXiv:2404.15918 - Published 5/3/2024 by Tahmim Hossain, Sagor Chandro Bakchy

Perception and Localization of Macular Degeneration Applying Convolutional Neural Network, ResNet and Grad-CAM

Overview

This paper explores the use of deep learning techniques, specifically Convolutional Neural Networks (CNNs) and ResNet, to detect and localize macular degeneration in fundus images.
The researchers apply Gradient-weighted Class Activation Mapping (Grad-CAM) to visualize the regions of the fundus image that are most important for the model's prediction.
The goal is to develop a system that can accurately identify and localize macular degeneration, a leading cause of vision loss, to support early diagnosis and treatment.

Plain English Explanation

Macular degeneration is a common eye condition that can lead to vision loss, particularly in older adults. This paper looks at using advanced artificial intelligence (AI) techniques to help detect and locate signs of macular degeneration in digital images of the back of the eye, called fundus images.

The researchers trained a deep learning model, which is a type of AI inspired by the human brain, to analyze fundus images and identify whether they contain signs of macular degeneration. They used two popular deep learning architectures, called Convolutional Neural Networks (CNNs) and ResNet, to develop their model.

To help explain how the model works, the researchers also used a technique called Grad-CAM. This allowed them to visualize the specific areas of the fundus image that the model was focusing on to make its prediction about macular degeneration. [This could be useful for <a href="https://aimodels.fyi/papers/arxiv/diagnosis-multiple-fundus-disorders-amidst-scarcity-medical">understanding macular degeneration and other eye conditions</a> from medical images.]

The goal of this research is to create a tool that can quickly and accurately detect signs of macular degeneration in fundus images. This could help doctors diagnose the condition earlier and provide treatment to prevent or slow down vision loss. [It builds on previous work in <a href="https://aimodels.fyi/papers/arxiv/classification-diabetic-retinopathy-using-pre-trained-deep">using deep learning for eye disease detection</a> and <a href="https://aimodels.fyi/papers/arxiv/nnmobilenet-rethinking-cnn-retinopathy-research">improving the efficiency of these models</a>.]

Technical Explanation

The researchers used a dataset of fundus images labeled as either containing signs of macular degeneration or not. They split this dataset into training, validation, and test sets to develop and evaluate their deep learning models.

They experimented with two popular CNN architectures: a custom CNN and the ResNet model. The custom CNN had several convolutional, pooling, and fully connected layers, while ResNet is a deeper, more complex model that has been shown to perform well on image recognition tasks.

To help understand how the models were making their predictions, the researchers applied Grad-CAM. This technique generates a heat map that highlights the regions of the fundus image that were most important for the model's classification of macular degeneration.

The results showed that both the custom CNN and ResNet models were able to achieve high accuracy in detecting macular degeneration, with the ResNet model performing slightly better. The Grad-CAM visualizations also provided insights into the specific areas of the fundus image that the models were focusing on, such as the macula and optic disc.

Critical Analysis

The paper provides a thorough and well-designed study on the use of deep learning for macular degeneration detection and localization. The researchers have carefully selected their deep learning models and evaluation techniques, and the results demonstrate the potential of this approach.

However, the paper does not address some potential limitations or areas for further research. For example, the dataset size and diversity could be expanded to further test the generalizability of the models. [Additional research may be needed to <a href="https://aimodels.fyi/papers/arxiv/classification-prostate-cancer-3d-magnetic-resonance-imaging">apply these techniques to other medical imaging domains</a>.]

Additionally, the paper does not discuss the clinical implications or practical deployment of such a system. Further research would be needed to understand how this technology could be integrated into the clinical workflow and its potential impact on patient outcomes.

Overall, this paper provides a solid foundation for using deep learning and Grad-CAM to detect and localize macular degeneration, but there is still room for continued <a href="https://aimodels.fyi/papers/arxiv/leveraging-pre-trained-cnns-efficient-feature-extraction">optimization and real-world application</a> of these techniques.

Conclusion

This paper demonstrates the potential of using deep learning, specifically CNNs and ResNet, to accurately detect and localize macular degeneration in fundus images. The application of Grad-CAM provides valuable insights into the model's decision-making process, which could help improve the interpretability and trust in these AI-powered diagnostic tools.

The results of this research could contribute to the development of more effective early detection and monitoring systems for macular degeneration, potentially leading to improved patient outcomes and quality of life. Further research is needed to address the limitations and explore the practical implementation of this technology in clinical settings.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Perception and Localization of Macular Degeneration Applying Convolutional Neural Network, ResNet and Grad-CAM

Tahmim Hossain, Sagor Chandro Bakchy

A well-known retinal disease that sends blurry visions to the affected patients is Macular Degeneration. This research is based on classifying the healthy and macular degeneration fundus by localizing the affected region of the fundus. A CNN architecture and CNN with ResNet architecture (ResNet50, ResNet50v2, ResNet101, ResNet101v2, ResNet152, ResNet152v2) as the backbone are used to classify the two types of fundus. The data are split into three categories including (a) Training set is 90% and Testing set is 10% (b) Training set is 80% and Testing set is 20%, (c) Training set is 50% and Testing set is 50%. After the training, the best model has been selected from the evaluation metrics. Among the models, CNN with a backbone of ResNet50 performs best which gives the training accuracy of 98.7% for 90% train and 10% test data split. With this model, we have performed the Grad-CAM visualization to get the region of the affected area of the fundus.

5/3/2024

🧠

Explainable Convolutional Neural Networks for Retinal Fundus Classification and Cutting-Edge Segmentation Models for Retinal Blood Vessels from Fundus Images

Fatema Tuj Johora Faria, Mukaffi Bin Moin, Pronay Debnath, Asif Iftekher Fahim, Faisal Muhammad Shah

Our research focuses on the critical field of early diagnosis of disease by examining retinal blood vessels in fundus images. While automatic segmentation of retinal blood vessels holds promise for early detection, accurate analysis remains challenging due to the limitations of existing methods, which often lack discrimination power and are susceptible to influences from pathological regions. Our research in fundus image analysis advances deep learning-based classification using eight pre-trained CNN models. To enhance interpretability, we utilize Explainable AI techniques such as Grad-CAM, Grad-CAM++, Score-CAM, Faster Score-CAM, and Layer CAM. These techniques illuminate the decision-making processes of the models, fostering transparency and trust in their predictions. Expanding our exploration, we investigate ten models, including TransUNet with ResNet backbones, Attention U-Net with DenseNet and ResNet backbones, and Swin-UNET. Incorporating diverse architectures such as ResNet50V2, ResNet101V2, ResNet152V2, and DenseNet121 among others, this comprehensive study deepens our insights into attention mechanisms for enhanced fundus image analysis. Among the evaluated models for fundus image classification, ResNet101 emerged with the highest accuracy, achieving an impressive 94.17%. On the other end of the spectrum, EfficientNetB0 exhibited the lowest accuracy among the models, achieving a score of 88.33%. Furthermore, in the domain of fundus image segmentation, Swin-Unet demonstrated a Mean Pixel Accuracy of 86.19%, showcasing its effectiveness in accurately delineating regions of interest within fundus images. Conversely, Attention U-Net with DenseNet201 backbone exhibited the lowest Mean Pixel Accuracy among the evaluated models, achieving a score of 75.87%.

5/14/2024

🌐

Lesion-aware network for diabetic retinopathy diagnosis

Xue Xia, Kun Zhan, Yuming Fang, Wenhui Jiang, Fei Shen

Deep learning brought boosts to auto diabetic retinopathy (DR) diagnosis, thus, greatly helping ophthalmologists for early disease detection, which contributes to preventing disease deterioration that may eventually lead to blindness. It has been proved that convolutional neural network (CNN)-aided lesion identifying or segmentation benefits auto DR screening. The key to fine-grained lesion tasks mainly lies in: (1) extracting features being both sensitive to tiny lesions and robust against DR-irrelevant interference, and (2) exploiting and re-using encoded information to restore lesion locations under extremely imbalanced data distribution. To this end, we propose a CNN-based DR diagnosis network with attention mechanism involved, termed lesion-aware network, to better capture lesion information from imbalanced data. Specifically, we design the lesion-aware module (LAM) to capture noise-like lesion areas across deeper layers, and the feature-preserve module (FPM) to assist shallow-to-deep feature fusion. Afterward, the proposed lesion-aware network (LANet) is constructed by embedding the LAM and FPM into the CNN decoders for DR-related information utilization. The proposed LANet is then further extended to a DR screening network by adding a classification layer. Through experiments on three public fundus datasets with pixel-level annotations, our method outperforms the mainstream methods with an area under curve of 0.967 in DR screening, and increases the overall average precision by 7.6%, 2.1%, and 1.2% in lesion segmentation on three datasets. Besides, the ablation study validates the effectiveness of the proposed sub-modules.

8/15/2024

Classification of Diabetic Retinopathy using Pre-Trained Deep Learning Models

Inas Al-Kamachy (Karlstad University, Sweden), Prof. Dr. Reza Hassanpour (Rotterdam University, Netherlands), Prof. Roya Choupani (Angelo State University, USA)

Diabetic Retinopathy (DR) stands as the leading cause of blindness globally, particularly affecting individuals between the ages of 20 and 70. This paper presents a Computer-Aided Diagnosis (CAD) system designed for the automatic classification of retinal images into five distinct classes: Normal, Mild, Moderate, Severe, and Proliferative Diabetic Retinopathy (PDR). The proposed system leverages Convolutional Neural Networks (CNNs) employing pre-trained deep learning models. Through the application of fine-tuning techniques, our model is trained on fundus images of diabetic retinopathy with resolutions of 350x350x3 and 224x224x3. Experimental results obtained on the Kaggle platform, utilizing resources comprising 4 CPUs, 17 GB RAM, and 1 GB Disk, demonstrate the efficacy of our approach. The achieved Area Under the Curve (AUC) values for CNN, MobileNet, VGG-16, InceptionV3, and InceptionResNetV2 models are 0.50, 0.70, 0.53, 0.63, and 0.69, respectively.

4/1/2024