Multi-Site Class-Incremental Learning with Weighted Experts in Echocardiography

Read original: arXiv:2407.21577 - Published 8/1/2024 by Kit M. Bransby, Woo-jin Cho Kim, Jorge Oliveira, Alex Thorley, Arian Beqiri, Alberto Gomez, Agisilaos Chartsias

Multi-Site Class-Incremental Learning with Weighted Experts in Echocardiography

Overview

The paper proposes a multi-site class-incremental learning approach with weighted experts for echocardiography tasks.
The method aims to address the challenge of learning new classes without forgetting previous knowledge, while leveraging data from multiple hospital sites.
Weighted experts are used to combine the knowledge from different sites and adapt to new classes incrementally.

Plain English Explanation

The researchers developed a machine learning system that can learn new types of medical images (echocardiograms) over time, without forgetting what it has already learned. This is called class-incremental learning.

Traditional machine learning models tend to "forget" old information when learning new things. But for medical applications, it's important to maintain and build upon existing knowledge. The researchers addressed this by using "weighted experts" - different parts of the model that specialize in different hospital sites or types of echocardiograms.

As the model learns new classes of echocardiograms from additional hospital sites, it can leverage the knowledge stored in these weighted experts, rather than starting from scratch. This allows the model to continually expand its capabilities without losing what it has already learned.

The key innovation is this multi-site, incremental learning approach, which helps the model adapt to new medical data over time in a more efficient and effective way. This could lead to more robust and capable AI systems for assisting clinicians with analyzing echocardiograms.

Technical Explanation

The paper presents a multi-site class-incremental learning method with weighted experts for echocardiography tasks. The core idea is to leverage knowledge from multiple hospital sites to learn new classes of echocardiograms incrementally, without forgetting previous knowledge.

The model architecture consists of a shared backbone and site-specific weighted experts. As new classes are introduced from additional sites, the weighted experts are updated to selectively remember and adapt to the new data. This allows the model to continually expand its capabilities without catastrophic forgetting.

The training process involves an iterative procedure where the shared backbone is first trained on all available data, then the weighted experts are fine-tuned on data from individual sites. This enables the model to learn site-specific characteristics while maintaining a common representation.

Experiments on real-world echocardiography datasets demonstrate the effectiveness of the proposed approach compared to standard incremental learning baselines. The weighted experts were able to outperform other methods in terms of classification accuracy, while also exhibiting better memory retention of previous tasks.

Critical Analysis

The authors acknowledge several limitations of their approach. First, the performance of the weighted experts is dependent on the quality and quantity of data from each hospital site. Imbalanced or biased datasets could lead to suboptimal performance of the site-specific experts.

Additionally, the iterative training procedure, while effective, can be computationally expensive as the number of sites and classes grows. The authors suggest exploring more efficient optimization techniques to address this scalability challenge.

Another potential issue is the interpretability of the weighted experts - it may be difficult to understand the specific reasons behind the model's decisions, which could be a concern for medical applications where transparency is highly valued.

Further research could investigate ways to improve the interpretability of the weighted experts, as well as techniques to automatically detect and mitigate dataset biases that could negatively impact the model's performance.

Conclusion

This paper presents a novel multi-site class-incremental learning approach with weighted experts for echocardiography tasks. By leveraging knowledge from multiple hospital sites and selectively adapting to new classes, the model can continuously expand its capabilities without forgetting previous knowledge.

The key contributions of this work are the weighted expert architecture and the iterative training procedure, which enable effective class-incremental learning in a multi-site setting. The experimental results demonstrate the advantages of this approach over standard incremental learning baselines, highlighting its potential for developing more robust and adaptable AI systems in medical imaging applications.

Future research could explore ways to address the scalability and interpretability challenges, as well as investigate the broader applicability of this multi-site class-incremental learning framework beyond echocardiography.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Multi-Site Class-Incremental Learning with Weighted Experts in Echocardiography

Kit M. Bransby, Woo-jin Cho Kim, Jorge Oliveira, Alex Thorley, Arian Beqiri, Alberto Gomez, Agisilaos Chartsias

Building an echocardiography view classifier that maintains performance in real-life cases requires diverse multi-site data, and frequent updates with newly available data to mitigate model drift. Simply fine-tuning on new datasets results in catastrophic forgetting, and cannot adapt to variations of view labels between sites. Alternatively, collecting all data on a single server and re-training may not be feasible as data sharing agreements may restrict image transfer, or datasets may only become available at different times. Furthermore, time and cost associated with re-training grows with every new dataset. We propose a class-incremental learning method which learns an expert network for each dataset, and combines all expert networks with a score fusion model. The influence of ``unqualified experts'' is minimised by weighting each contribution with a learnt in-distribution score. These weights promote transparency as the contribution of each expert is known during inference. Instead of using the original images, we use learned features from each dataset, which are easier to share and raise fewer licensing and privacy concerns. We validate our work on six datasets from multiple sites, demonstrating significant reductions in training time while improving view classification performance.

8/1/2024

BackMix: Mitigating Shortcut Learning in Echocardiography with Minimal Supervision

Kit Mills Bransby, Arian Beqiri, Woo-Jin Cho Kim, Jorge Oliveira, Agisilaos Chartsias, Alberto Gomez

Neural networks can learn spurious correlations that lead to the correct prediction in a validation set, but generalise poorly because the predictions are right for the wrong reason. This undesired learning of naive shortcuts (Clever Hans effect) can happen for example in echocardiogram view classification when background cues (e.g. metadata) are biased towards a class and the model learns to focus on those background features instead of on the image content. We propose a simple, yet effective random background augmentation method called BackMix, which samples random backgrounds from other examples in the training set. By enforcing the background to be uncorrelated with the outcome, the model learns to focus on the data within the ultrasound sector and becomes invariant to the regions outside this. We extend our method in a semi-supervised setting, finding that the positive effects of BackMix are maintained with as few as 5% of segmentation labels. A loss weighting mechanism, wBackMix, is also proposed to increase the contribution of the augmented examples. We validate our method on both in-distribution and out-of-distribution datasets, demonstrating significant improvements in classification accuracy, region focus and generalisability. Our source code is available at: https://github.com/kitbransby/BackMix

6/28/2024

Label Dropout: Improved Deep Learning Echocardiography Segmentation Using Multiple Datasets With Domain Shift and Partial Labelling

Iman Islam (King's College London), Esther Puyol-Ant'on (King's College London), Bram Ruijsink (King's College London), Andrew J. Reader (King's College London), Andrew P. King (King's College London)

Echocardiography (echo) is the first imaging modality used when assessing cardiac function. The measurement of functional biomarkers from echo relies upon the segmentation of cardiac structures and deep learning models have been proposed to automate the segmentation process. However, in order to translate these tools to widespread clinical use it is important that the segmentation models are robust to a wide variety of images (e.g. acquired from different scanners, by operators with different levels of expertise etc.). To achieve this level of robustness it is necessary that the models are trained with multiple diverse datasets. A significant challenge faced when training with multiple diverse datasets is the variation in label presence, i.e. the combined data are often partially-labelled. Adaptations of the cross entropy loss function have been proposed to deal with partially labelled data. In this paper we show that training naively with such a loss function and multiple diverse datasets can lead to a form of shortcut learning, where the model associates label presence with domain characteristics, leading to a drop in performance. To address this problem, we propose a novel label dropout scheme to break the link between domain characteristics and the presence or absence of labels. We demonstrate that label dropout improves echo segmentation Dice score by 62% and 25% on two cardiac structures when training using multiple diverse partially labelled datasets.

8/16/2024

From Uncertainty to Clarity: Uncertainty-Guided Class-Incremental Learning for Limited Biomedical Samples via Semantic Expansion

Yifei Yao, Hanrong Zhang

In real-world clinical settings, data distributions evolve over time, with a continuous influx of new, limited disease cases. Therefore, class incremental learning is of great significance, i.e., deep learning models are required to learn new class knowledge while maintaining accurate recognition of previous diseases. However, traditional deep neural networks often suffer from severe forgetting of prior knowledge when adapting to new data unless trained from scratch, which undesirably costs much time and computational burden. Additionally, the sample sizes for different diseases can be highly imbalanced, with newly emerging diseases typically having much fewer instances, consequently causing the classification bias. To tackle these challenges, we are the first to propose a class-incremental learning method under limited samples in the biomedical field. First, we propose a novel cumulative entropy prediction module to measure the uncertainty of the samples, of which the most uncertain samples are stored in a memory bank as exemplars for the model's later review. Furthermore, we theoretically demonstrate its effectiveness in measuring uncertainty. Second, we developed a fine-grained semantic expansion module through various augmentations, leading to more compact distributions within the feature space and creating sufficient room for generalization to new classes. Besides, a cosine classifier is utilized to mitigate classification bias caused by imbalanced datasets. Across four imbalanced data distributions over two datasets, our method achieves optimal performance, surpassing state-of-the-art methods by as much as 53.54% in accuracy.

9/14/2024