Expanding continual few-shot learning benchmarks to include recognition of specific instances

Read original: arXiv:2209.07863 - Published 7/10/2024 by Gideon Kowadlo, Abdelrahman Ahmed, Amir Mayan, David Rawlinson

👁️

Overview

This paper explores the combination of continual learning and few-shot learning, two important frontiers in advancing machine learning capabilities.
The researchers extended a previous framework called Continual Few-Shot Learning (CFSL) in two key ways:
1. Increasing the number of classes by an order of magnitude to make the results more comparable to standard continual learning experiments.
2. Introducing an "instance test" that requires recognizing specific instances of classes, a capability important for animal cognition that is often overlooked in machine learning.

Plain English Explanation

The paper focuses on two important areas of machine learning research: continual learning and few-shot learning. Continual learning is about training AI systems to continuously learn new information without forgetting what they've learned before. Few-shot learning is about training AI systems to learn new tasks or concepts from just a few examples, like how humans can quickly learn new things.

The researchers took an existing framework that combined these two ideas, called Continual Few-Shot Learning (CFSL), and made two key changes to it. First, they increased the number of classes the AI system had to learn by a lot, to make the experiments more realistic compared to real-world continual learning challenges. Second, they added a new "instance test" that required the AI to recognize specific examples of each class, not just the classes themselves. This is an important capability for intelligent agents to have, like how humans can recognize specific individuals.

Technical Explanation

The researchers selected representative baseline models from the original CFSL work and added a model variant that used replay for consolidation. As expected, learning more classes was more difficult than the original CFSL experiments. Interestingly, the way the image instances and classes were presented affected the AI's classification performance.

Surprisingly, the baseline models performed comparably on the instance test as they did on other classification tasks. However, their performance was poor when the images had significant occlusion and noise. Using a replay mechanism, where the AI system remembers and revisits past examples, substantially improved performance on both the class-level and instance-level tasks, especially for the instance test.

Critical Analysis

The paper demonstrates the value of combining continual learning and few-shot learning, but also highlights the challenges of scaling these techniques to more realistic scenarios with a larger number of classes. The instance test introduced an important new capability that is often overlooked in machine learning, but the baseline models struggled with it, especially in more difficult visual conditions.

While the replay mechanism helped improve performance, the paper does not explore other potential techniques for addressing the instance recognition task, such as few-shot class-incremental learning or multi-label continual learning. Further research could investigate alternative approaches to this problem.

Conclusion

This paper extends the Continual Few-Shot Learning framework in two important ways, increasing the number of classes and introducing an instance recognition test. The results show that while continual and few-shot learning can be combined, scaling these techniques to more realistic scenarios with larger numbers of classes and more challenging visual conditions remains an open challenge. The use of replay mechanisms can help, but further research is needed to develop more robust solutions for these types of continual and few-shot learning problems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

👁️

Expanding continual few-shot learning benchmarks to include recognition of specific instances

Gideon Kowadlo, Abdelrahman Ahmed, Amir Mayan, David Rawlinson

Continual learning and few-shot learning are important frontiers in progress toward broader Machine Learning (ML) capabilities. Recently, there has been intense interest in combining both. One of the first examples to do so was the Continual few-shot Learning (CFSL) framework of Antoniou et al. arXiv:2004.11967. In this study, we extend CFSL in two ways that capture a broader range of challenges, important for intelligent agent behaviour in real-world conditions. First, we increased the number of classes by an order of magnitude, making the results more comparable to standard continual learning experiments. Second, we introduced an 'instance test' which requires recognition of specific instances of classes -- a capability of animal cognition that is usually neglected in ML. For an initial exploration of ML model performance under these conditions, we selected representative baseline models from the original CFSL work and added a model variant with replay. As expected, learning more classes is more difficult than the original CFSL experiments, and interestingly, the way in which image instances and classes are presented affects classification performance. Surprisingly, accuracy in the baseline instance test is comparable to other classification tasks, but poor given significant occlusion and noise. The use of replay for consolidation substantially improves performance for both types of tasks, but particularly for the instance test.

7/10/2024

Continual Learning in the Presence of Repetition

Hamed Hemati, Lorenzo Pellegrini, Xiaotian Duan, Zixuan Zhao, Fangfang Xia, Marc Masana, Benedikt Tscheschner, Eduardo Veas, Yuxiang Zheng, Shiji Zhao, Shao-Yuan Li, Sheng-Jun Huang, Vincenzo Lomonaco, Gido M. van de Ven

Continual learning (CL) provides a framework for training models in ever-evolving environments. Although re-occurrence of previously seen objects or tasks is common in real-world problems, the concept of repetition in the data stream is not often considered in standard benchmarks for CL. Unlike with the rehearsal mechanism in buffer-based strategies, where sample repetition is controlled by the strategy, repetition in the data stream naturally stems from the environment. This report provides a summary of the CLVision challenge at CVPR 2023, which focused on the topic of repetition in class-incremental learning. The report initially outlines the challenge objective and then describes three solutions proposed by finalist teams that aim to effectively exploit the repetition in the stream to learn continually. The experimental results from the challenge highlight the effectiveness of ensemble-based solutions that employ multiple versions of similar modules, each trained on different but overlapping subsets of classes. This report underscores the transformative potential of taking a different perspective in CL by employing repetition in the data stream to foster innovative strategy design.

5/8/2024

🚀

Learning to Learn for Few-shot Continual Active Learning

Stella Ho, Ming Liu, Shang Gao, Longxiang Gao

Continual learning strives to ensure stability in solving previously seen tasks while demonstrating plasticity in a novel domain. Recent advances in continual learning are mostly confined to a supervised learning setting, especially in NLP domain. In this work, we consider a few-shot continual active learning setting where labeled data are inadequate, and unlabeled data are abundant but with a limited annotation budget. We exploit meta-learning and propose a method, called Meta-Continual Active Learning. This method sequentially queries the most informative examples from a pool of unlabeled data for annotation to enhance task-specific performance and tackle continual learning problems through meta-objective. Specifically, we employ meta-learning and experience replay to address inter-task confusion and catastrophic forgetting. We further incorporate textual augmentations to avoid memory over-fitting caused by experience replay and sample queries, thereby ensuring generalization. We conduct extensive experiments on benchmark text classification datasets from diverse domains to validate the feasibility and effectiveness of meta-continual active learning. We also analyze the impact of different active learning strategies on various meta continual learning models. The experimental results demonstrate that introducing randomness into sample selection is the best default strategy for maintaining generalization in meta-continual learning framework.

6/3/2024

🏋️

Multi-Label Continual Learning for the Medical Domain: A Novel Benchmark

Marina Ceccon, Davide Dalle Pezze, Alessandro Fabris, Gian Antonio Susto

Despite the critical importance of the medical domain in Deep Learning, most of the research in this area solely focuses on training models in static environments. It is only in recent years that research has begun to address dynamic environments and tackle the Catastrophic Forgetting problem through Continual Learning (CL) techniques. Previous studies have primarily focused on scenarios such as Domain Incremental Learning and Class Incremental Learning, which do not fully capture the complexity of real-world applications. Therefore, in this work, we propose a novel benchmark combining the challenges of new class arrivals and domain shifts in a single framework, by considering the New Instances and New Classes (NIC) scenario. This benchmark aims to model a realistic CL setting for the multi-label classification problem in medical imaging. Additionally, it encompasses a greater number of tasks compared to previously tested scenarios. Specifically, our benchmark consists of two datasets (NIH and CXP), nineteen classes, and seven tasks, a stream longer than the previously tested ones. To solve common challenges (e.g., the task inference problem) found in the CIL and NIC scenarios, we propose a novel approach called Replay Consolidation with Label Propagation (RCLP). Our method surpasses existing approaches, exhibiting superior performance with minimal forgetting.

7/19/2024