Federated Continual Learning Goes Online: Leveraging Uncertainty for Modality-Agnostic Class-Incremental Learning

2405.18925

Published 5/30/2024 by Giuseppe Serra, Florian Buettner

Federated Continual Learning Goes Online: Leveraging Uncertainty for Modality-Agnostic Class-Incremental Learning

Abstract

Given the ability to model more realistic and dynamic problems, Federated Continual Learning (FCL) has been increasingly investigated recently. A well-known problem encountered in this setting is the so-called catastrophic forgetting, for which the learning model is inclined to focus on more recent tasks while forgetting the previously learned knowledge. The majority of the current approaches in FCL propose generative-based solutions to solve said problem. However, this setting requires multiple training epochs over the data, implying an offline setting where datasets are stored locally and remain unchanged over time. Furthermore, the proposed solutions are tailored for vision tasks solely. To overcome these limitations, we propose a new modality-agnostic approach to deal with the online scenario where new data arrive in streams of mini-batches that can only be processed once. To solve catastrophic forgetting, we propose an uncertainty-aware memory-based approach. In particular, we suggest using an estimator based on the Bregman Information (BI) to compute the model's variance at the sample level. Through measures of predictive uncertainty, we retrieve samples with specific characteristics, and - by retraining the model on such samples - we demonstrate the potential of this approach to reduce the forgetting effect in realistic settings.

Create account to get full access

Overview

This paper presents a novel approach to federated continual learning (FCL) that leverages uncertainty to enable modality-agnostic class-incremental learning.
The proposed method, called Federated Continual Learning Goes Online (FCLGO), aims to address the challenges of learning in a continual and federated setting where tasks and data modalities change over time.
FCLGO utilizes uncertainty-based sample selection to selectively retain and replay relevant examples, allowing the model to learn new classes without catastrophically forgetting previous ones.

Plain English Explanation

In the world of machine learning, there are often situations where the data we work with changes over time, and we need to continuously learn new information without forgetting what we've learned before. This is known as "continual learning." Additionally, in many real-world applications, data may be spread across multiple devices or locations, and we need to learn from this distributed data, which is called "federated learning."

The paper introduces a new approach called "Federated Continual Learning Goes Online" (FCLGO) that combines these two concepts. FCLGO allows machine learning models to continuously learn new classes of data, even as the data modalities (the different types of data, like images, text, or audio) change over time, and the data is spread across multiple devices.

The key innovation of FCLGO is its use of "uncertainty-based sample selection." This means that the model keeps track of how confident it is about the data it has learned, and only selectively retains and replays the examples it is most uncertain about. This helps the model learn new information without forgetting what it has already learned, a common problem in continual learning.

By using this uncertainty-based approach, FCLGO can adapt to changes in data modalities, allowing it to learn a wide variety of information over time, without being limited to a fixed set of data types or classes. This makes it a powerful and flexible tool for real-world machine learning applications that need to evolve and adapt to changing conditions.

Technical Explanation

The paper proposes a novel approach called "Federated Continual Learning Goes Online" (FCLGO) that addresses the challenges of federated continual learning in a modality-agnostic, class-incremental setting.

FCLGO leverages uncertainty-based sample selection to selectively retain and replay relevant examples, enabling the model to learn new classes without catastrophically forgetting previous ones. This is in contrast to approaches like variational Bayes for federated continual learning and data-free federated class-incremental learning, which do not explicitly address modality changes or rely on maintaining a fixed set of class labels.

The key technical components of FCLGO include:

Uncertainty-based Sample Selection: The model tracks its predictive uncertainty for each example and selectively retains and replays the most uncertain samples. This helps the model learn new information without forgetting what it has already learned, a common problem in continual learning and federated incremental learning.
Modality-Agnostic Learning: FCLGO can adapt to changes in data modalities, allowing the model to learn a wide variety of information over time, without being limited to a fixed set of data types or classes.
Federated Continual Learning: The proposed approach operates in a federated setting, where data is distributed across multiple devices or locations, and the model must learn continuously without access to the full dataset.

The authors evaluate FCLGO on several benchmark datasets and demonstrate its ability to outperform existing federated continual learning methods in terms of both learning performance and memory efficiency.

Critical Analysis

The paper presents a well-designed and thorough evaluation of the FCLGO approach, comparing it to several state-of-the-art federated continual learning methods on various benchmark datasets. The results suggest that FCLGO's uncertainty-based sample selection is an effective strategy for addressing the challenges of modality-agnostic, class-incremental learning in a federated setting.

However, the paper does not discuss the potential limitations or caveats of the FCLGO approach. For example, the method's reliance on maintaining a set of selected examples for replay may not scale well to scenarios with very large or continuously evolving data streams. Additionally, the paper does not explore the computational and communication costs associated with the uncertainty-based sample selection process, which could be a concern in real-world federated learning deployments.

Further research could investigate ways to reduce the memory and computational footprint of FCLGO, such as by exploring more efficient sample selection strategies or incorporating data-free approaches to reduce the need for retaining examples. Exploring the robustness of FCLGO to changes in data distributions, hardware heterogeneity, and communication constraints in the federated setting would also be valuable.

Conclusion

The "Federated Continual Learning Goes Online" (FCLGO) approach presented in this paper represents a significant advancement in the field of federated continual learning. By leveraging uncertainty-based sample selection, FCLGO can effectively learn new classes of data, even as the data modalities change over time and the data is distributed across multiple devices.

The paper's innovative techniques and strong experimental results suggest that FCLGO could be a valuable tool for real-world machine learning applications that require continuous learning and adaptation to evolving data landscapes. As the field of federated learning continues to grow in importance, methods like FCLGO that can handle the challenges of modality-agnostic, class-incremental learning will become increasingly crucial for building robust and adaptable AI systems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

📊

Variational Bayes for Federated Continual Learning

Dezhong Yao, Sanmu Li, Yutong Dai, Zhiqiang Xu, Shengshan Hu, Peilin Zhao, Lichao Sun

Federated continual learning (FCL) has received increasing attention due to its potential in handling real-world streaming data, characterized by evolving data distributions and varying client classes over time. The constraints of storage limitations and privacy concerns confine local models to exclusively access the present data within each learning cycle. Consequently, this restriction induces performance degradation in model training on previous data, termed catastrophic forgetting. However, existing FCL approaches need to identify or know changes in data distribution, which is difficult in the real world. To release these limitations, this paper directs attention to a broader continuous framework. Within this framework, we introduce Federated Bayesian Neural Network (FedBNN), a versatile and efficacious framework employing a variational Bayesian neural network across all clients. Our method continually integrates knowledge from local and historical data distributions into a single model, adeptly learning from new data distributions while retaining performance on historical distributions. We rigorously evaluate FedBNN's performance against prevalent methods in federated learning and continual learning using various metrics. Experimental analyses across diverse datasets demonstrate that FedBNN achieves state-of-the-art results in mitigating forgetting.

5/24/2024

cs.LG cs.AI cs.DC

Reducing Bias in Federated Class-Incremental Learning with Hierarchical Generative Prototypes

Riccardo Salami, Pietro Buzzega, Matteo Mosconi, Mattia Verasani, Simone Calderara

Federated Learning (FL) aims at unburdening the training of deep models by distributing computation across multiple devices (clients) while safeguarding data privacy. On top of that, Federated Continual Learning (FCL) also accounts for data distribution evolving over time, mirroring the dynamic nature of real-world environments. In this work, we shed light on the Incremental and Federated biases that naturally emerge in FCL. While the former is a known problem in Continual Learning, stemming from the prioritization of recently introduced classes, the latter (i.e., the bias towards local distributions) remains relatively unexplored. Our proposal constrains both biases in the last layer by efficiently fine-tuning a pre-trained backbone using learnable prompts, resulting in clients that produce less biased representations and more biased classifiers. Therefore, instead of solely relying on parameter aggregation, we also leverage generative prototypes to effectively balance the predictions of the global model. Our method improves on the current State Of The Art, providing an average increase of +7.9% in accuracy.

6/5/2024

cs.LG

Controlling Forgetting with Test-Time Data in Continual Learning

Vaibhav Singh, Rahaf Aljundi, Eugene Belilovsky

Foundational vision-language models have shown impressive performance on various downstream tasks. Yet, there is still a pressing need to update these models later as new tasks or domains become available. Ongoing Continual Learning (CL) research provides techniques to overcome catastrophic forgetting of previous information when new knowledge is acquired. To date, CL techniques focus only on the supervised training sessions. This results in significant forgetting yielding inferior performance to even the prior model zero shot performance. In this work, we argue that test-time data hold great information that can be leveraged in a self supervised manner to refresh the model's memory of previous learned tasks and hence greatly reduce forgetting at no extra labelling cost. We study how unsupervised data can be employed online to improve models' performance on prior tasks upon encountering representative samples. We propose a simple yet effective student-teacher model with gradient based sparse parameters updates and show significant performance improvements and reduction in forgetting, which could alleviate the role of an offline episodic memory/experience replay buffer.

6/21/2024

cs.LG

FedMeS: Personalized Federated Continual Learning Leveraging Local Memory

Jin Xie, Chenqing Zhu, Songze Li

We focus on the problem of Personalized Federated Continual Learning (PFCL): a group of distributed clients, each with a sequence of local tasks on arbitrary data distributions, collaborate through a central server to train a personalized model at each client, with the model expected to achieve good performance on all local tasks. We propose a novel PFCL framework called Federated Memory Strengthening FedMeS to address the challenges of client drift and catastrophic forgetting. In FedMeS, each client stores samples from previous tasks using a small amount of local memory, and leverages this information to both 1) calibrate gradient updates in training process; and 2) perform KNN-based Gaussian inference to facilitate personalization. FedMeS is designed to be task-oblivious, such that the same inference process is applied to samples from all tasks to achieve good performance. FedMeS is analyzed theoretically and evaluated experimentally. It is shown to outperform all baselines in average accuracy and forgetting rate, over various combinations of datasets, task distributions, and client numbers.

4/22/2024

cs.LG