INCPrompt: Task-Aware incremental Prompting for Rehearsal-Free Class-incremental Learning

Read original: arXiv:2401.11667 - Published 4/9/2024 by Zhiyuan Wang, Xiaoyang Qu, Jing Xiao, Bokui Chen, Jianzong Wang

INCPrompt: Task-Aware incremental Prompting for Rehearsal-Free Class-incremental Learning

Overview

The paper proposes a method called "INCPrompt" for class-incremental learning using task-aware prompting without the need for rehearsal.
Class-incremental learning is the ability to learn new classes without forgetting previously learned ones.
INCPrompt uses prompts to condition the language model on the current task, allowing it to learn new classes without interfering with old ones.
The approach is "rehearsal-free", meaning it doesn't require storing or replaying old data to prevent forgetting.

Plain English Explanation

The paper introduces a new technique called "INCPrompt" that helps AI models learn new skills without forgetting old ones. This is an important problem in machine learning called "class-incremental learning".

Imagine an AI assistant that starts by learning to recognize 10 different objects. Later, you want to expand its abilities to recognize 10 more objects. The challenge is that as the AI learns the new objects, it tends to forget how to recognize the original 10. [See Convolutional Prompting Meets Language Models for Continual Learning for more on this problem.]

The INCPrompt method solves this by using "prompts" - short text descriptions that condition the AI model on the current task. When learning new objects, the model gets a prompt that focuses it on the new task, preventing it from forgetting the old one. [Prompts are also explored in Deconstructing Context: Learning Understanding Prompts via Corruption.]

Importantly, this "rehearsal-free" approach doesn't require storing or replaying old data, which can be computationally expensive. It allows the AI to continually expand its skills without getting bogged down.

Technical Explanation

The paper frames class-incremental learning as a problem where an AI model must learn new classes of data (e.g. new object categories) without forgetting previously learned classes. [See Class-Incremental Few-Shot Event Detection for more on this setting.]

The key innovation of INCPrompt is the use of task-aware prompting. Rather than training the model directly on the new data, INCPrompt provides the model with a prompt that conditions it on the current task. This allows the model to learn new classes without interfering with its knowledge of old classes.

Experiments show that INCPrompt outperforms standard fine-tuning approaches, which tend to suffer from catastrophic forgetting - the rapid decline in performance on old tasks as new tasks are learned. [Continual learning techniques like Weight Interpolation have also been proposed to address this issue.]

The rehearsal-free nature of INCPrompt is an advantage, as it avoids the need to store and replay old data, which can be computationally expensive. This makes the approach more practical for real-world applications with dynamic data streams.

Critical Analysis

The paper provides a thorough empirical evaluation of INCPrompt, demonstrating its effectiveness on several benchmark datasets. However, the authors acknowledge that the method relies on having access to task descriptors or prompts, which may not always be available in practice.

Additionally, the experiments are limited to image classification tasks. It would be valuable to see how INCPrompt performs on a wider range of problems, such as language understanding or reinforcement learning tasks. [Techniques for Convergence in Continual Learning with Adaptive Methods may provide insights for expanding the approach.]

Overall, the INCPrompt method represents an interesting and promising direction for addressing the challenge of class-incremental learning. Further research is needed to explore the broader applicability of the approach and potential ways to relax the requirement for task-specific prompts.

Conclusion

The INCPrompt method introduces a novel approach to class-incremental learning that uses task-aware prompting to enable models to learn new skills without forgetting old ones. By conditioning the model on the current task, INCPrompt avoids the need for expensive data rehearsal, making it a more practical solution for real-world applications.

While the current evaluation is limited, the paper demonstrates the potential of prompt-based techniques for continual learning. Continued research in this direction could lead to AI systems that can continuously expand their capabilities without sacrificing their previous knowledge.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

INCPrompt: Task-Aware incremental Prompting for Rehearsal-Free Class-incremental Learning

Zhiyuan Wang, Xiaoyang Qu, Jing Xiao, Bokui Chen, Jianzong Wang

This paper introduces INCPrompt, an innovative continual learning solution that effectively addresses catastrophic forgetting. INCPrompt's key innovation lies in its use of adaptive key-learner and task-aware prompts that capture task-relevant information. This unique combination encapsulates general knowledge across tasks and encodes task-specific knowledge. Our comprehensive evaluation across multiple continual learning benchmarks demonstrates INCPrompt's superiority over existing algorithms, showing its effectiveness in mitigating catastrophic forgetting while maintaining high performance. These results highlight the significant impact of task-aware incremental prompting on continual learning performance.

4/9/2024

CP-Prompt: Composition-Based Cross-modal Prompting for Domain-Incremental Continual Learning

Yu Feng, Zhen Tian, Yifan Zhu, Zongfu Han, Haoran Luo, Guangwei Zhang, Meina Song

The key challenge of cross-modal domain-incremental learning (DIL) is to enable the learning model to continuously learn from novel data with different feature distributions under the same task without forgetting old ones. However, existing top-performing methods still cause high forgetting rates, by lacking intra-domain knowledge extraction and inter-domain common prompting strategy. In this paper, we propose a simple yet effective framework, CP-Prompt, by training limited parameters to instruct a pre-trained model to learn new domains and avoid forgetting existing feature distributions. CP-Prompt captures intra-domain knowledge by compositionally inserting personalized prompts on multi-head self-attention layers and then learns the inter-domain knowledge with a common prompting strategy. CP-Prompt shows superiority compared with state-of-the-art baselines among three widely evaluated DIL tasks. The source code is available at https://github.com/dannis97500/CP_Prompt.

8/6/2024

PECTP: Parameter-Efficient Cross-Task Prompts for Incremental Vision Transformer

Qian Feng, Hanbin Zhao, Chao Zhang, Jiahua Dong, Henghui Ding, Yu-Gang Jiang, Hui Qian

Incremental Learning (IL) aims to learn deep models on sequential tasks continually, where each new task includes a batch of new classes and deep models have no access to task-ID information at the inference time. Recent vast pre-trained models (PTMs) have achieved outstanding performance by prompt technique in practical IL without the old samples (rehearsal-free) and with a memory constraint (memory-constrained): Prompt-extending and Prompt-fixed methods. However, prompt-extending methods need a large memory buffer to maintain an ever-expanding prompt pool and meet an extra challenging prompt selection problem. Prompt-fixed methods only learn a single set of prompts on one of the incremental tasks and can not handle all the incremental tasks effectively. To achieve a good balance between the memory cost and the performance on all the tasks, we propose a Parameter-Efficient Cross-Task Prompt (PECTP) framework with Prompt Retention Module (PRM) and classifier Head Retention Module (HRM). To make the final learned prompts effective on all incremental tasks, PRM constrains the evolution of cross-task prompts' parameters from Outer Prompt Granularity and Inner Prompt Granularity. Besides, we employ HRM to inherit old knowledge in the previously learned classifier heads to facilitate the cross-task prompts' generalization ability. Extensive experiments show the effectiveness of our method. The source codes will be available at url{https://github.com/RAIAN08/PECTP}.

7/8/2024

Convolutional Prompting meets Language Models for Continual Learning

Anurag Roy, Riddhiman Moulick, Vinay K. Verma, Saptarshi Ghosh, Abir Das

Continual Learning (CL) enables machine learning models to learn from continuously shifting new training data in absence of data from old tasks. Recently, pretrained vision transformers combined with prompt tuning have shown promise for overcoming catastrophic forgetting in CL. These approaches rely on a pool of learnable prompts which can be inefficient in sharing knowledge across tasks leading to inferior performance. In addition, the lack of fine-grained layer specific prompts does not allow these to fully express the strength of the prompts for CL. We address these limitations by proposing ConvPrompt, a novel convolutional prompt creation mechanism that maintains layer-wise shared embeddings, enabling both layer-specific learning and better concept transfer across tasks. The intelligent use of convolution enables us to maintain a low parameter overhead without compromising performance. We further leverage Large Language Models to generate fine-grained text descriptions of each category which are used to get task similarity and dynamically decide the number of prompts to be learned. Extensive experiments demonstrate the superiority of ConvPrompt and improves SOTA by ~3% with significantly less parameter overhead. We also perform strong ablation over various modules to disentangle the importance of different components.

4/1/2024