Think Twice Before Selection: Federated Evidential Active Learning for Medical Image Analysis with Domain Shifts

Read original: arXiv:2312.02567 - Published 4/23/2024 by Jiayi Chen, Benteng Ma, Hengfei Cui, Yong Xia
Total Score

0

🖼️

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • Federated learning allows multiple medical institutions to collaborate on training a global model without centralizing their data.
  • However, the high cost of annotating local data remains a challenge.
  • Federated active learning methods try to address this by selecting a small amount of informative local data for annotation.
  • Existing methods focus on local data from the same domain, which can be unreliable when there are domain shifts across different clients.
  • This paper proposes a new method called Federated Evidential Active Learning (FEAL) to address this issue.

Plain English Explanation

Federated learning is a way for different medical institutions to work together to train a single machine learning model, without having to share their private patient data. This is useful because hospitals and clinics often can't share data due to privacy concerns.

However, one problem with federated learning is that it can be expensive for each institution to carefully label or "annotate" their local data before sharing it. Federated active learning tries to solve this by having the global model identify the most important local data that needs to be annotated, so institutions only have to label a small amount.

But the existing federated active learning methods have a limitation - they assume all the local data comes from the same type of medical cases or "domain". In the real world, different hospitals and clinics may have data from quite different types of patients. This "domain shift" can cause problems for the active learning approach.

To address this, the researchers propose a new method called "Federated Evidential Active Learning" (FEAL). FEAL uses a special way of modeling the uncertainty in the predictions of both the local and global models. This allows it to better identify informative local data, even when there are differences between the data at different institutions. FEAL also has a strategy to reduce redundancy and maintain diversity in the selected local data.

Technical Explanation

The key technical innovation of the FEAL method is the use of a Dirichlet prior distribution in both the local and global models. This allows the models to output a distribution over the probability simplex, rather than just a single prediction. This distribution can capture both the "aleatoric" uncertainty (inherent randomness in the data) and the "epistemic" uncertainty (uncertainty due to limited knowledge).

FEAL then uses the epistemic uncertainty to calibrate the aleatoric uncertainty, in order to better assess the informativeness of local data samples, even when there are domain shifts across clients. Additionally, FEAL employs a diversity relaxation strategy to reduce data redundancy and maintain diversity in the selected local samples.

Extensive experiments on five real-world medical image datasets demonstrate that FEAL outperforms state-of-the-art federated active learning methods, especially in scenarios with domain shifts across clients. The code for FEAL will be available on GitHub.

Critical Analysis

The paper makes a valuable contribution by addressing the challenge of domain shifts in federated active learning, which is an important practical concern for real-world deployment of these methods. The use of the Dirichlet distribution to model uncertainty is a principled approach, and the diversity relaxation strategy is a sensible way to ensure the selected local data is informative.

However, the paper does not provide a deep analysis of the limitations of FEAL. For example, it would be helpful to understand how FEAL performs when the domain shifts are more severe, or when there are multiple distinct domains represented across the clients. Additionally, the paper does not discuss potential privacy or fairness implications of the proposed approach.

It would also be worthwhile for future research to explore ways of further reducing the annotation burden on local clients, perhaps by investigating semi-supervised or self-supervised learning techniques in the federated setting. Uncertainty-Aware Evidential Fusion-Based Learning for Semi-Supervised Medical Image Classification is an example of related work in this direction.

Overall, the FEAL method represents a solid contribution to the field of cross-silo federated learning across divergent domains, and the code release will likely be valuable for further research and development in this area.

Conclusion

This paper proposes a novel Federated Evidential Active Learning (FEAL) method to address the challenge of expensive data annotation in federated learning scenarios with domain shifts across medical institutions. By modeling epistemic and aleatoric uncertainties using a Dirichlet distribution and employing a diversity relaxation strategy, FEAL can effectively identify informative local data samples for annotation, even when the local data comes from diverse domains.

The experimental results demonstrate the superiority of FEAL over state-of-the-art federated active learning approaches, particularly in realistic medical settings with domain shifts. This work represents an important step forward in distributed federated learning-based deep learning model development, with the potential to enable more effective collaborative learning among medical institutions without compromising patient data privacy.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🖼️

Total Score

0

Think Twice Before Selection: Federated Evidential Active Learning for Medical Image Analysis with Domain Shifts

Jiayi Chen, Benteng Ma, Hengfei Cui, Yong Xia

Federated learning facilitates the collaborative learning of a global model across multiple distributed medical institutions without centralizing data. Nevertheless, the expensive cost of annotation on local clients remains an obstacle to effectively utilizing local data. To mitigate this issue, federated active learning methods suggest leveraging local and global model predictions to select a relatively small amount of informative local data for annotation. However, existing methods mainly focus on all local data sampled from the same domain, making them unreliable in realistic medical scenarios with domain shifts among different clients. In this paper, we make the first attempt to assess the informativeness of local data derived from diverse domains and propose a novel methodology termed Federated Evidential Active Learning (FEAL) to calibrate the data evaluation under domain shift. Specifically, we introduce a Dirichlet prior distribution in both local and global models to treat the prediction as a distribution over the probability simplex and capture both aleatoric and epistemic uncertainties by using the Dirichlet-based evidential model. Then we employ the epistemic uncertainty to calibrate the aleatoric uncertainty. Afterward, we design a diversity relaxation strategy to reduce data redundancy and maintain data diversity. Extensive experiments and analysis on five real multi-center medical image datasets demonstrate the superiority of FEAL over the state-of-the-art active learning methods in federated scenarios with domain shifts. The code will be available at https://github.com/JiayiChen815/FEAL.

Read more

4/23/2024

🏷️

Total Score

0

Federated Active Learning Framework for Efficient Annotation Strategy in Skin-lesion Classification

Zhipeng Deng, Yuqiao Yang, Kenji Suzuki

Federated Learning (FL) enables multiple institutes to train models collaboratively without sharing private data. Current FL research focuses on communication efficiency, privacy protection, and personalization and assumes that the data of FL have already been ideally collected. In medical scenarios, however, data annotation demands both expertise and intensive labor, which is a critical problem in FL. Active learning (AL), has shown promising performance in reducing the number of data annotations in medical image analysis. We propose a federated AL (FedAL) framework in which AL is executed periodically and interactively under FL. We exploit a local model in each hospital and a global model acquired from FL to construct an ensemble. We use ensemble-entropy-based AL as an efficient data-annotation strategy in FL. Therefore, our FedAL framework can decrease the amount of annotated data and preserve patient privacy while maintaining the performance of FL. To our knowledge, this is the first FedAL framework applied to medical images. We validated our framework on real-world dermoscopic datasets. Using only 50% of samples, our framework was able to achieve state-of-the-art performance on a skin-lesion classification task. Our framework performed better than several state-of-the-art AL methods under FL and achieved comparable performance to full-data FL.

Read more

6/18/2024

🖼️

Total Score

0

Federated Learning for Medical Image Analysis: A Survey

Hao Guan, Pew-Thian Yap, Andrea Bozoki, Mingxia Liu

Machine learning in medical imaging often faces a fundamental dilemma, namely, the small sample size problem. Many recent studies suggest using multi-domain data pooled from different acquisition sites/centers to improve statistical power. However, medical images from different sites cannot be easily shared to build large datasets for model training due to privacy protection reasons. As a promising solution, federated learning, which enables collaborative training of machine learning models based on data from different sites without cross-site data sharing, has attracted considerable attention recently. In this paper, we conduct a comprehensive survey of the recent development of federated learning methods in medical image analysis. In this survey, we first introduce the background knowledge of federated learning for dealing with privacy protection and collaborative learning issues in medical imaging. We then present a comprehensive review of recent advances in federated learning methods for medical image analysis. Specifically, existing methods are categorized based on three critical aspects of a federated learning system, including client end, server end, and communication techniques. In each category, we summarize the existing federated learning methods according to specific research problems in medical image analysis and also provide insights into the motivations of different approaches. In addition, we provide a review of existing benchmark medical imaging datasets and software platforms for current federated learning research. We also conduct an experimental study to empirically evaluate typical federated learning methods for medical image analysis. This survey can help to better understand the current research status, challenges, and potential research opportunities in this promising research field.

Read more

7/9/2024

📈

Total Score

0

New!An Enhanced Federated Prototype Learning Method under Domain Shift

Liang Kuang, Kuangpu Guo, Jian Liang, Jianguo Zhang

Federated Learning (FL) allows collaborative machine learning training without sharing private data. Numerous studies have shown that one significant factor affecting the performance of federated learning models is the heterogeneity of data across different clients, especially when the data is sampled from various domains. A recent paper introduces variance-aware dual-level prototype clustering and uses a novel $alpha$-sparsity prototype loss, which increases intra-class similarity and reduces inter-class similarity. To ensure that the features converge within specific clusters, we introduce an improved algorithm, Federated Prototype Learning with Convergent Clusters, abbreviated as FedPLCC. To increase inter-class distances, we weight each prototype with the size of the cluster it represents. To reduce intra-class distances, considering that prototypes with larger distances might come from different domains, we select only a certain proportion of prototypes for the loss function calculation. Evaluations on the Digit-5, Office-10, and DomainNet datasets show that our method performs better than existing approaches.

Read more

9/30/2024