Toward industrial use of continual learning : new metrics proposal for class incremental learning

2404.06972

Published 4/11/2024 by Konat'e Mohamed Abbas, Anne-Franc{c}oise Yao, Thierry Chateau, Pierre Bouges

Toward industrial use of continual learning : new metrics proposal for class incremental learning

Abstract

In this paper, we investigate continual learning performance metrics used in class incremental learning strategies for continual learning (CL) using some high performing methods. We investigate especially mean task accuracy. First, we show that it lacks of expressiveness through some simple experiments to capture performance. We show that monitoring average tasks performance is over optimistic and can lead to misleading conclusions for future real life industrial uses. Then, we propose first a simple metric, Minimal Incremental Class Accuracy (MICA) which gives a fair and more useful evaluation of different continual learning methods. Moreover, in order to provide a simple way to easily compare different methods performance in continual learning, we derive another single scalar metric that take into account the learning performance variation as well as our newly introduced metric.

Create account to get full access

Overview

This paper proposes new metrics for evaluating class incremental learning, a type of continual learning where AI models learn new classes without forgetting previous ones.
The authors argue that current metrics do not adequately capture performance and risk in an industrial setting, and they introduce novel measures focused on quality, robustness, and fairness.
The proposed metrics are designed to help drive the adoption of continual learning techniques in real-world applications where maintaining high and consistent performance is crucial.

Plain English Explanation

The paper looks at a challenge in the field of machine learning called "continual learning." This is where an AI model learns new skills or information over time, without forgetting what it previously learned.

Specifically, the paper focuses on a type of continual learning called "class incremental learning." In this approach, the model learns to recognize new classes or categories of objects, while still being able to accurately identify the original classes it was trained on.

The authors argue that the current ways of measuring the performance of these continual learning models are not sufficient for real-world industrial applications. In a factory or business setting, you don't just care about the overall accuracy - you also want the model to be reliable, consistent, and fair across all the different classes it has learned.

To address this, the paper proposes some new metrics or ways of evaluating continual learning models. These new measures look at factors like:

Quality: How well does the model perform on new classes without degrading on old classes?
Robustness: How stable and reliable is the model's performance over time?
Fairness: Is the model treating all the different classes it has learned equally well, or are some being neglected?

The goal is to create evaluation methods that are better suited for deploying continual learning models in real industrial applications, where maintaining high and reliable performance across many different tasks is crucial. By using these new metrics, the authors hope to drive wider adoption of continual learning techniques in practical, high-stakes settings.

Technical Explanation

The paper introduces a new framework for evaluating class incremental learning (CIL) algorithms, which are a type of continual learning where AI models learn new classes without forgetting previously learned ones.

The authors argue that existing CIL metrics, such as average accuracy or forgetting, do not adequately capture the performance and risk characteristics required for industrial deployment. To address this, they propose three new metrics:

Quality: Measures the model's performance on new classes without degrading on old classes, capturing the trade-off between learning new skills and retaining previous knowledge.
Robustness: Evaluates the stability and consistency of the model's performance over time, as it learns new classes incrementally.
Fairness: Assesses whether the model treats all learned classes equally well, or if some classes are neglected in favor of others.

These metrics are designed to better reflect the needs of real-world applications, where maintaining high and reliable performance across numerous tasks is crucial.

The authors evaluate their proposed metrics on several benchmark CIL datasets and compare them to existing evaluation approaches. They find that the new metrics provide a more nuanced and comprehensive view of model performance, highlighting important trade-offs and failure modes that are overlooked by traditional metrics.

Critical Analysis

The authors make a compelling case for the need to rethink continual learning evaluation beyond just accuracy-based measures. Their proposed metrics of quality, robustness, and fairness are a step in the right direction for assessing the suitability of CIL models for real-world industrial applications.

However, the paper does not delve into the potential limitations or caveats of these new metrics. For example, it's unclear how to balance the different metrics when they may be in tension with each other, or how to interpret the trade-offs between them.

Additionally, the authors do not address the challenges of learning from long-tailed distributions or the impact of adaptive optimization methods on CIL performance, which could significantly influence the proposed metrics.

Further research is needed to understand the convergence properties of CIL algorithms under these new evaluation criteria and how they can be effectively optimized. Nonetheless, this paper provides a valuable contribution by highlighting the need to go beyond accuracy when assessing the suitability of continual learning models for industrial use.

Conclusion

This paper proposes a new framework for evaluating class incremental learning (CIL) algorithms, introducing three novel metrics focused on quality, robustness, and fairness. The authors argue that these measures are better suited for assessing the performance and risk characteristics required for the industrial deployment of continual learning models.

By shifting the evaluation focus beyond just accuracy, the proposed metrics aim to drive the adoption of CIL techniques in real-world applications where maintaining high and consistent performance across many different tasks is crucial. While further research is needed to address the potential limitations and practical implications of these new measures, this work represents an important step towards developing continual learning systems that are truly fit for industrial use.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Provable Contrastive Continual Learning

Yichen Wen, Zhiquan Tan, Kaipeng Zheng, Chuanlong Xie, Weiran Huang

Continual learning requires learning incremental tasks with dynamic data distributions. So far, it has been observed that employing a combination of contrastive loss and distillation loss for training in continual learning yields strong performance. To the best of our knowledge, however, this contrastive continual learning framework lacks convincing theoretical explanations. In this work, we fill this gap by establishing theoretical performance guarantees, which reveal how the performance of the model is bounded by training losses of previous tasks in the contrastive continual learning framework. Our theoretical explanations further support the idea that pre-training can benefit continual learning. Inspired by our theoretical analysis of these guarantees, we propose a novel contrastive continual learning algorithm called CILA, which uses adaptive distillation coefficients for different tasks. These distillation coefficients are easily computed by the ratio between average distillation losses and average contrastive losses from previous tasks. Our method shows great improvement on standard benchmarks and achieves new state-of-the-art performance.

5/30/2024

cs.LG cs.AI cs.CV stat.ML

Bayesian Learning-driven Prototypical Contrastive Loss for Class-Incremental Learning

Nisha L. Raichur, Lucas Heublein, Tobias Feigl, Alexander Rugamer, Christopher Mutschler, Felix Ott

The primary objective of methods in continual learning is to learn tasks in a sequential manner over time from a stream of data, while mitigating the detrimental phenomenon of catastrophic forgetting. In this paper, we focus on learning an optimal representation between previous class prototypes and newly encountered ones. We propose a prototypical network with a Bayesian learning-driven contrastive loss (BLCL) tailored specifically for class-incremental learning scenarios. Therefore, we introduce a contrastive loss that incorporates new classes into the latent representation by reducing the intra-class distance and increasing the inter-class distance. Our approach dynamically adapts the balance between the cross-entropy and contrastive loss functions with a Bayesian learning technique. Empirical evaluations conducted on both the CIFAR-10 dataset for image classification and images of a GNSS-based dataset for interference classification validate the efficacy of our method, showcasing its superiority over existing state-of-the-art approaches.

5/21/2024

cs.CV cs.AI

Improving Data-aware and Parameter-aware Robustness for Continual Learning

Hanxi Xiao, Fan Lyu

The goal of Continual Learning (CL) task is to continuously learn multiple new tasks sequentially while achieving a balance between the plasticity and stability of new and old knowledge. This paper analyzes that this insufficiency arises from the ineffective handling of outliers, leading to abnormal gradients and unexpected model updates. To address this issue, we enhance the data-aware and parameter-aware robustness of CL, proposing a Robust Continual Learning (RCL) method. From the data perspective, we develop a contrastive loss based on the concepts of uniformity and alignment, forming a feature distribution that is more applicable to outliers. From the parameter perspective, we present a forward strategy for worst-case perturbation and apply robust gradient projection to the parameters. The experimental results on three benchmarks show that the proposed method effectively maintains robustness and achieves new state-of-the-art (SOTA) results. The code is available at: https://github.com/HanxiXiao/RCL

5/28/2024

cs.LG

Calibration of Continual Learning Models

Lanpei Li, Elia Piccoli, Andrea Cossu, Davide Bacciu, Vincenzo Lomonaco

Continual Learning (CL) focuses on maximizing the predictive performance of a model across a non-stationary stream of data. Unfortunately, CL models tend to forget previous knowledge, thus often underperforming when compared with an offline model trained jointly on the entire data stream. Given that any CL model will eventually make mistakes, it is of crucial importance to build calibrated CL models: models that can reliably tell their confidence when making a prediction. Model calibration is an active research topic in machine learning, yet to be properly investigated in CL. We provide the first empirical study of the behavior of calibration approaches in CL, showing that CL strategies do not inherently learn calibrated models. To mitigate this issue, we design a continual calibration approach that improves the performance of post-processing calibration methods over a wide range of different benchmarks and CL strategies. CL does not necessarily need perfect predictive models, but rather it can benefit from reliable predictive models. We believe our study on continual calibration represents a first step towards this direction.

4/15/2024

cs.LG cs.AI