Comparison of fine-tuning strategies for transfer learning in medical image classification

Read original: arXiv:2406.10050 - Published 6/17/2024 by Ana Davila, Jacinto Colan, Yasuhisa Hasegawa

Comparison of fine-tuning strategies for transfer learning in medical image classification

Overview

This paper compares different fine-tuning strategies for transfer learning in medical image classification tasks.
Fine-tuning is a common technique in deep learning where a model trained on a large dataset is further trained on a smaller, domain-specific dataset to improve performance.
The researchers explored different approaches to fine-tuning, such as freezing layers or selectively training layers, to see which strategies work best for medical imaging applications.

Plain English Explanation

Deep learning models, such as those used for image classification, are often trained on large, general datasets like ImageNet. While these models can perform well on a wide range of tasks, they may not be optimized for specific domains like medical imaging. To improve performance on medical image classification, researchers often use a technique called fine-tuning.

Fine-tuning involves taking a model pre-trained on a large dataset and further training it on a smaller, more specialized dataset from the domain of interest. This allows the model to learn features that are particularly relevant to the new task, rather than relying solely on the general features learned during the initial training.

In this paper, the researchers explored different strategies for fine-tuning deep learning models for medical image classification. Some of the approaches they investigated include freezing the lower layers of the model and selectively training only certain layers during the fine-tuning process. The goal was to determine which fine-tuning strategies work best for improving the performance of deep learning models on medical imaging tasks.

Technical Explanation

The researchers conducted experiments on several medical image classification datasets, including chest X-rays, mammograms, and skin lesion images. They used pre-trained models, such as ResNet and Inception, as the starting point for fine-tuning.

The researchers explored various fine-tuning strategies, including:

Freezing the lower layers of the model and only fine-tuning the upper layers
Selectively fine-tuning only certain layers of the model
[Applying contrastive adversarial training or fusion strategies during fine-tuning

The researchers evaluated the performance of the fine-tuned models on the medical image classification tasks and compared the results across the different fine-tuning strategies. They found that the selective fine-tuning approach, where only certain layers are updated during fine-tuning, generally performed the best, outperforming the other strategies.

Critical Analysis

The researchers acknowledged that the optimal fine-tuning strategy may depend on the specific dataset and task at hand. They also noted that the performance gains from fine-tuning can be limited by the size and quality of the medical image datasets available for training.

One potential limitation of the study is that it only considered a few pre-trained models and fine-tuning strategies. There may be other approaches, such as parameter-efficient fine-tuning, that could also be effective for medical image classification tasks.

Additionally, the researchers did not explore the potential trade-offs between model performance and computational efficiency, which could be an important consideration in real-world medical imaging applications.

Conclusion

This paper provides a valuable comparison of different fine-tuning strategies for medical image classification tasks. The researchers found that selectively fine-tuning certain layers of a pre-trained model generally outperformed other approaches, such as freezing layers or applying contrastive adversarial training.

These findings could be useful for researchers and practitioners working on developing deep learning models for medical imaging applications, where the ability to effectively leverage transfer learning is often crucial for achieving good performance on limited datasets. The insights from this study can help guide the selection of appropriate fine-tuning strategies for various medical image classification problems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Comparison of fine-tuning strategies for transfer learning in medical image classification

Ana Davila, Jacinto Colan, Yasuhisa Hasegawa

In the context of medical imaging and machine learning, one of the most pressing challenges is the effective adaptation of pre-trained models to specialized medical contexts. Despite the availability of advanced pre-trained models, their direct application to the highly specialized and diverse field of medical imaging often falls short due to the unique characteristics of medical data. This study provides a comprehensive analysis on the performance of various fine-tuning methods applied to pre-trained models across a spectrum of medical imaging domains, including X-ray, MRI, Histology, Dermoscopy, and Endoscopic surgery. We evaluated eight fine-tuning strategies, including standard techniques such as fine-tuning all layers or fine-tuning only the classifier layers, alongside methods such as gradually unfreezing layers, regularization based fine-tuning and adaptive learning rates. We selected three well-established CNN architectures (ResNet-50, DenseNet-121, and VGG-19) to cover a range of learning and feature extraction scenarios. Although our results indicate that the efficacy of these fine-tuning methods significantly varies depending on both the architecture and the medical imaging type, strategies such as combining Linear Probing with Full Fine-tuning resulted in notable improvements in over 50% of the evaluated cases, demonstrating general effectiveness across medical domains. Moreover, Auto-RGN, which dynamically adjusts learning rates, led to performance enhancements of up to 11% for specific modalities. Additionally, the DenseNet architecture showed more pronounced benefits from alternative fine-tuning approaches compared to traditional full fine-tuning. This work not only provides valuable insights for optimizing pre-trained models in medical image analysis but also suggests the potential for future research into more advanced architectures and fine-tuning methods.

6/17/2024

Disease Classification and Impact of Pretrained Deep Convolution Neural Networks on Diverse Medical Imaging Datasets across Imaging Modalities

Jutika Borah, Kumaresh Sarmah, Hidam Kumarjit Singh

Imaging techniques such as Chest X-rays, whole slide images, and optical coherence tomography serve as the initial screening and detection for a wide variety of medical pulmonary and ophthalmic conditions respectively. This paper investigates the intricacies of using pretrained deep convolutional neural networks with transfer learning across diverse medical imaging datasets with varying modalities for binary and multiclass classification. We conducted a comprehensive performance analysis with ten network architectures and model families each with pretraining and random initialization. Our finding showed that the use of pretrained models as fixed feature extractors yields poor performance irrespective of the datasets. Contrary, histopathology microscopy whole slide images have better performance. It is also found that deeper and more complex architectures did not necessarily result in the best performance. This observation implies that the improvements in ImageNet are not parallel to the medical imaging tasks. Within a medical domain, the performance of the network architectures varies within model families with shifts in datasets. This indicates that the performance of models within a specific modality may not be conclusive for another modality within the same domain. This study provides a deeper understanding of the applications of deep learning techniques in medical imaging and highlights the impact of pretrained networks across different medical imaging datasets under five different experimental settings.

9/4/2024

Contrastive-Adversarial and Diffusion: Exploring pre-training and fine-tuning strategies for sulcal identification

Michail Mamalakis, H'eloise de Vareilles, Shun-Chin Jim Wu, Ingrid Agartz, Lynn Egeland M{o}rch-Johnsen, Jane Garrison, Jon Simons, Pietro Lio, John Suckling, Graham Murray

In the last decade, computer vision has witnessed the establishment of various training and learning approaches. Techniques like adversarial learning, contrastive learning, diffusion denoising learning, and ordinary reconstruction learning have become standard, representing state-of-the-art methods extensively employed for fully training or pre-training networks across various vision tasks. The exploration of fine-tuning approaches has emerged as a current focal point, addressing the need for efficient model tuning with reduced GPU memory usage and time costs while enhancing overall performance, as exemplified by methodologies like low-rank adaptation (LoRA). Key questions arise: which pre-training technique yields optimal results - adversarial, contrastive, reconstruction, or diffusion denoising? How does the performance of these approaches vary as the complexity of fine-tuning is adjusted? This study aims to elucidate the advantages of pre-training techniques and fine-tuning strategies to enhance the learning process of neural networks in independent identical distribution (IID) cohorts. We underscore the significance of fine-tuning by examining various cases, including full tuning, decoder tuning, top-level tuning, and fine-tuning of linear parameters using LoRA. Systematic summaries of model performance and efficiency are presented, leveraging metrics such as accuracy, time cost, and memory efficiency. To empirically demonstrate our findings, we focus on a multi-task segmentation-classification challenge involving the paracingulate sulcus (PCS) using different 3D Convolutional Neural Network (CNN) architectures by using the TOP-OSLO cohort comprising 596 subjects.

5/30/2024

➖

MoVL:Exploring Fusion Strategies for the Domain-Adaptive Application of Pretrained Models in Medical Imaging Tasks

Haijiang Tian, Jingkun Yue, Xiaohong Liu, Guoxing Yang, Zeyu Jiang, Guangyu Wang

Medical images are often more difficult to acquire than natural images due to the specialism of the equipment and technology, which leads to less medical image datasets. So it is hard to train a strong pretrained medical vision model. How to make the best of natural pretrained vision model and adapt in medical domain still pends. For image classification, a popular method is linear probe (LP). However, LP only considers the output after feature extraction. Yet, there exists a gap between input medical images and natural pretrained vision model. We introduce visual prompting (VP) to fill in the gap, and analyze the strategies of coupling between LP and VP. We design a joint learning loss function containing categorisation loss and discrepancy loss, which describe the variance of prompted and plain images, naming this joint training strategy MoVL (Mixture of Visual Prompting and Linear Probe). We experiment on 4 medical image classification datasets, with two mainstream architectures, ResNet and CLIP. Results shows that without changing the parameters and architecture of backbone model and with less parameters, there is potential for MoVL to achieve full finetune (FF) accuracy (on four medical datasets, average 90.91% for MoVL and 91.13% for FF). On out of distribution medical dataset, our method(90.33%) can outperform FF (85.15%) with absolute 5.18 % lead.

5/14/2024