Out-of-distribution forgetting: vulnerability of continual learning to intra-class distribution shift

Read original: arXiv:2306.00427 - Published 7/8/2024 by Liangxuan Guo, Yang Chen, Shan Yu

🔍

Overview

Continual learning (CL) is a technique that allows artificial neural networks to learn new tasks without severely impacting their performance on old tasks, overcoming the problem of catastrophic forgetting.
In joint learning, the out-of-distribution (OOD) problem caused by attacks or environmental changes can severely impair a network's ability to generalize.
This paper identifies a special form of catastrophic forgetting caused by the OOD problem in CL settings, dubbed "out-of-distribution forgetting" (OODF).

Plain English Explanation

Continual learning is a way for AI systems to keep learning new things without completely forgetting what they already know. This is important because it allows AI to work in the real world, where the data and tasks are always changing.

The researchers found that when the distribution of data changes in a particular category, continual learning methods have trouble recognizing that category later on. This happens even when the same level of change doesn't cause problems for a system that learns everything at once.

The researchers call this "out-of-distribution forgetting" (OODF), and they show that it's a risk for all continual learning methods that don't dedicate separate parts of the network to each task. OODF can happen in many different situations, so it's an important issue for continual learning to address.

Technical Explanation

The paper investigates a special form of catastrophic forgetting that occurs in continual learning settings due to the out-of-distribution (OOD) problem. They find that introducing an intra-class distribution shift, even a small one, can significantly impair a continual learning method's ability to recognize that class in subsequent learning, a phenomenon they call "out-of-distribution forgetting" (OODF).

Interestingly, the same level of distribution shift has only negligible effects in a joint learning scenario, suggesting OODF is a unique challenge for continual learning. The researchers verify that continual learning methods without dedicated subnetworks for individual tasks are all vulnerable to OODF, and that OODF is not dependent on the specific way the distribution is shifted, making it a broad risk factor.

Critical Analysis

The paper provides a thorough investigation of a previously unidentified issue in continual learning - the risk of out-of-distribution forgetting. While the researchers demonstrate the phenomenon across multiple continual learning methods, they do not propose a specific solution to overcome OODF.

Additionally, the paper focuses on image classification tasks, so it's unclear if OODF would manifest similarly in other domains like language modeling or reinforcement learning. Further research is needed to understand the broader applicability of OODF and develop effective countermeasures.

Conclusion

This paper uncovers a significant challenge in continual learning - the risk of out-of-distribution forgetting. By identifying OODF as a unique problem for continual learning systems, the researchers highlight the importance of developing approaches that can overcome this issue. Addressing OODF will be crucial for building robust and adaptable AI systems that can learn continuously in the real world.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🔍

Out-of-distribution forgetting: vulnerability of continual learning to intra-class distribution shift

Liangxuan Guo, Yang Chen, Shan Yu

Continual learning (CL) is an important technique to allow artificial neural networks to work in open environments. CL enables a system to learn new tasks without severe interference to its performance on old tasks, i.e., overcome the problems of catastrophic forgetting. In joint learning, it is well known that the out-of-distribution (OOD) problem caused by intentional attacks or environmental perturbations will severely impair the ability of networks to generalize. In this work, we reported a special form of catastrophic forgetting raised by the OOD problem in continual learning settings, and we named it out-of-distribution forgetting (OODF). In continual image classification tasks, we found that for a given category, introducing an intra-class distribution shift significantly impaired the recognition accuracy of CL methods for that category during subsequent learning. Interestingly, this phenomenon is special for CL as the same level of distribution shift had only negligible effects in the joint learning scenario. We verified that CL methods without dedicating subnetworks for individual tasks are all vulnerable to OODF. Moreover, OODF does not depend on any specific way of shifting the distribution, suggesting it is a risk for CL in a wide range of circumstances. Taken together, our work identified an under-attended risk during CL, highlighting the importance of developing approaches that can overcome OODF. Code available: url{https://github.com/Hiroid/OODF}

7/8/2024

Continual Unsupervised Out-of-Distribution Detection

Lars Doorenbos, Raphael Sznitman, Pablo M'arquez-Neila

Deep learning models excel when the data distribution during training aligns with testing data. Yet, their performance diminishes when faced with out-of-distribution (OOD) samples, leading to great interest in the field of OOD detection. Current approaches typically assume that OOD samples originate from an unconcentrated distribution complementary to the training distribution. While this assumption is appropriate in the traditional unsupervised OOD (U-OOD) setting, it proves inadequate when considering the place of deployment of the underlying deep learning model. To better reflect this real-world scenario, we introduce the novel setting of continual U-OOD detection. To tackle this new setting, we propose a method that starts from a U-OOD detector, which is agnostic to the OOD distribution, and slowly updates during deployment to account for the actual OOD distribution. Our method uses a new U-OOD scoring function that combines the Mahalanobis distance with a nearest-neighbor approach. Furthermore, we design a confidence-scaled few-shot OOD detector that outperforms previous methods. We show our method greatly improves upon strong baselines from related fields.

6/5/2024

Out-of-Distribution Learning with Human Feedback

Haoyue Bai, Xuefeng Du, Katie Rainey, Shibin Parameswaran, Yixuan Li

Out-of-distribution (OOD) learning often relies heavily on statistical approaches or predefined assumptions about OOD data distributions, hindering their efficacy in addressing multifaceted challenges of OOD generalization and OOD detection in real-world deployment environments. This paper presents a novel framework for OOD learning with human feedback, which can provide invaluable insights into the nature of OOD shifts and guide effective model adaptation. Our framework capitalizes on the freely available unlabeled data in the wild that captures the environmental test-time OOD distributions under both covariate and semantic shifts. To harness such data, our key idea is to selectively provide human feedback and label a small number of informative samples from the wild data distribution, which are then used to train a multi-class classifier and an OOD detector. By exploiting human feedback, we enhance the robustness and reliability of machine learning models, equipping them with the capability to handle OOD scenarios with greater precision. We provide theoretical insights on the generalization error bounds to justify our algorithm. Extensive experiments show the superiority of our method, outperforming the current state-of-the-art by a significant margin.

8/16/2024

Overcoming Domain Drift in Online Continual Learning

Fan Lyu, Daofeng Liu, Linglan Zhao, Zhang Zhang, Fanhua Shang, Fuyuan Hu, Wei Feng, Liang Wang

Online Continual Learning (OCL) empowers machine learning models to acquire new knowledge online across a sequence of tasks. However, OCL faces a significant challenge: catastrophic forgetting, wherein the model learned in previous tasks is substantially overwritten upon encountering new tasks, leading to a biased forgetting of prior knowledge. Moreover, the continual doman drift in sequential learning tasks may entail the gradual displacement of the decision boundaries in the learned feature space, rendering the learned knowledge susceptible to forgetting. To address the above problem, in this paper, we propose a novel rehearsal strategy, termed Drift-Reducing Rehearsal (DRR), to anchor the domain of old tasks and reduce the negative transfer effects. First, we propose to select memory for more representative samples guided by constructed centroids in a data stream. Then, to keep the model from domain chaos in drifting, a two-level angular cross-task Contrastive Margin Loss (CML) is proposed, to encourage the intra-class and intra-task compactness, and increase the inter-class and inter-task discrepancy. Finally, to further suppress the continual domain drift, we present an optional Centorid Distillation Loss (CDL) on the rehearsal memory to anchor the knowledge in feature space for each previous old task. Extensive experimental results on four benchmark datasets validate that the proposed DRR can effectively mitigate the continual domain drift and achieve the state-of-the-art (SOTA) performance in OCL.

5/16/2024