Evolutionary Generalized Zero-Shot Learning

2211.13174

Published 5/14/2024 by Dubing Chen, Chenyi Jiang, Haofeng Zhang

↗️

Abstract

Attribute-based Zero-Shot Learning (ZSL) has revolutionized the ability of models to recognize new classes not seen during training. However, with the advancement of large-scale models, the expectations have risen. Beyond merely achieving zero-shot generalization, there is a growing demand for universal models that can continually evolve in expert domains using unlabeled data. To address this, we introduce a scaled-down instantiation of this challenge: Evolutionary Generalized Zero-Shot Learning (EGZSL). This setting allows a low-performing zero-shot model to adapt to the test data stream and evolve online. We elaborate on three challenges of this special task, ie, catastrophic forgetting, initial prediction bias, and evolutionary data class bias. Moreover, we propose targeted solutions for each challenge, resulting in a generic method capable of continuous evolution from a given initial IGZSL model. Experiments on three popular GZSL benchmark datasets demonstrate that our model can learn from the test data stream while other baselines fail. Codes are available at url{https://github.com/cdb342/EGZSL}.

Create account to get full access

Overview

Introduces a new challenge called Evolutionary Generalized Zero-Shot Learning (EGZSL)
EGZSL aims to enable low-performing zero-shot models to continuously adapt and evolve using unlabeled test data
Identifies three key challenges: catastrophic forgetting, initial prediction bias, and evolutionary data class bias
Proposes targeted solutions to address each challenge and develop a generic method for continuous evolution from an initial zero-shot model

Plain English Explanation

Zero-shot learning

(link: https://aimodels.fyi/papers/arxiv/eyes-hawk-ears-fox-part-prototype-network)

has made it possible for models to recognize new classes that they weren't trained on. However, as large-scale models have become more advanced, the expectations have grown. Now, there's a demand for universal models that can continuously learn and improve in specialized domains using unlabeled data.

To address this, the researchers introduced a scaled-down version of this challenge, called Evolutionary Generalized Zero-Shot Learning (EGZSL). EGZSL allows a low-performing zero-shot model to adapt to new test data and evolve over time. The key challenges they identified are:

Catastrophic forgetting: The model might forget what it has learned previously as it adapts to new data.
Initial prediction bias: The model's initial predictions on the test data might be biased.
Evolutionary data class bias: The distribution of classes in the test data might change over time, leading to biases.

The researchers proposed solutions to each of these challenges, resulting in a method that can continuously evolve a zero-shot model using unlabeled test data. They tested their approach on benchmark datasets and found that it outperformed other baselines that couldn't adapt as well.

Technical Explanation

The researchers introduce a new task called Evolutionary Generalized Zero-Shot Learning (EGZSL), which aims to enable low-performing zero-shot models

(link: https://aimodels.fyi/papers/arxiv/multi-method-integration-confidence-based-weighting-zero)

to continuously adapt and evolve using unlabeled test data. This is in contrast to traditional zero-shot learning, which focuses on achieving generalization to new classes during the initial training phase.

The researchers identify three key challenges in EGZSL:

Catastrophic forgetting: The model might forget what it has learned previously as it adapts to new data.
Initial prediction bias: The model's initial predictions on the test data might be biased.
Evolutionary data class bias: The distribution of classes in the test data might change over time, leading to biases.

To address these challenges, the researchers propose targeted solutions:

To mitigate catastrophic forgetting, they use a knowledge distillation approach to preserve the model's previous knowledge.
To tackle initial prediction bias, they introduce a self-supervised learning module to calibrate the model's predictions.
To handle evolutionary data class bias, they employ a class-balanced sampling strategy during training.

The researchers combine these solutions into a generic method that can continuously evolve a given initial zero-shot model. They evaluate their approach on three popular GZSL benchmark datasets and show that it outperforms other baselines that fail to adapt as well to the test data stream.

Critical Analysis

The researchers have identified an important and timely challenge in the field of zero-shot learning

(link: https://aimodels.fyi/papers/arxiv/visual-augmented-dynamic-semantic-prototype-generative-zero)

. The introduction of Evolutionary Generalized Zero-Shot Learning (EGZSL) represents a significant advancement beyond the traditional zero-shot learning setting, as it addresses the need for models that can continuously adapt and improve in specialized domains using unlabeled data.

The researchers have highlighted several key challenges that must be addressed to enable this type of continuous evolution, and their proposed solutions seem well-reasoned and grounded in existing research. However, it's important to note that the EGZSL setting is still relatively new, and there may be additional challenges or limitations that were not explored in this work.

For example, the researchers do not discuss the computational and memory requirements of their approach, which could be a significant factor in real-world deployments. Additionally, the performance of the proposed method may be sensitive to the initial quality of the zero-shot model, and the researchers do not explore the impact of different initialization strategies.

Overall, the researchers have made a valuable contribution to the field of zero-shot learning by introducing the EGZSL challenge and proposing a promising approach to address it. As the demand for continually evolving models in specialized domains grows, this work provides a strong foundation for further research and development in this area.

Conclusion

This paper introduces a new challenge called Evolutionary Generalized Zero-Shot Learning (EGZSL), which aims to enable low-performing zero-shot models to continuously adapt and evolve using unlabeled test data. The researchers identified three key challenges in this setting: catastrophic forgetting, initial prediction bias, and evolutionary data class bias. They proposed targeted solutions to address each of these challenges and developed a generic method for continuous evolution from an initial zero-shot model.

The researchers' experiments on popular GZSL benchmark datasets demonstrate the effectiveness of their approach, which outperforms other baselines that fail to adapt as well to the test data stream. This work represents an important step forward in the field of zero-shot learning, as it addresses the growing need for universal models that can continually learn and improve in expert domains using unlabeled data

(link: https://aimodels.fyi/papers/arxiv/gpt4ego-unleashing-potential-pre-trained-models-zero)

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

📊

Exploring Data Efficiency in Zero-Shot Learning with Diffusion Models

Zihan Ye, Shreyank N. Gowda, Xiaobo Jin, Xiaowei Huang, Haotian Xu, Yaochu Jin, Kaizhu Huang

Zero-Shot Learning (ZSL) aims to enable classifiers to identify unseen classes by enhancing data efficiency at the class level. This is achieved by generating image features from pre-defined semantics of unseen classes. However, most current approaches heavily depend on the number of samples from seen classes, i.e. they do not consider instance-level effectiveness. In this paper, we demonstrate that limited seen examples generally result in deteriorated performance of generative models. To overcome these challenges, we propose ZeroDiff, a Diffusion-based Generative ZSL model. This unified framework incorporates diffusion models to improve data efficiency at both the class and instance levels. Specifically, for instance-level effectiveness, ZeroDiff utilizes a forward diffusion chain to transform limited data into an expanded set of noised data. For class-level effectiveness, we design a two-branch generation structure that consists of a Diffusion-based Feature Generator (DFG) and a Diffusion-based Representation Generator (DRG). DFG focuses on learning and sampling the distribution of cross-entropy-based features, whilst DRG learns the supervised contrastive-based representation to boost the zero-shot capabilities of DFG. Additionally, we employ three discriminators to evaluate generated features from various aspects and introduce a Wasserstein-distance-based mutual learning loss to transfer knowledge among discriminators, thereby enhancing guidance for generation. Demonstrated through extensive experiments on three popular ZSL benchmarks, our ZeroDiff not only achieves significant improvements over existing ZSL methods but also maintains robust performance even with scarce training data. Code will be released upon acceptance.

6/6/2024

cs.CV cs.LG

`Eyes of a Hawk and Ears of a Fox': Part Prototype Network for Generalized Zero-Shot Learning

Joshua Feinglass, Jayaraman J. Thiagarajan, Rushil Anirudh, T. S. Jayram, Yezhou Yang

Current approaches in Generalized Zero-Shot Learning (GZSL) are built upon base models which consider only a single class attribute vector representation over the entire image. This is an oversimplification of the process of novel category recognition, where different regions of the image may have properties from different seen classes and thus have different predominant attributes. With this in mind, we take a fundamentally different approach: a pre-trained Vision-Language detector (VINVL) sensitive to attribute information is employed to efficiently obtain region features. A learned function maps the region features to region-specific attribute attention used to construct class part prototypes. We conduct experiments on a popular GZSL benchmark consisting of the CUB, SUN, and AWA2 datasets where our proposed Part Prototype Network (PPN) achieves promising results when compared with other popular base models. Corresponding ablation studies and analysis show that our approach is highly practical and has a distinct advantage over global attribute attention when localized proposals are available.

4/16/2024

cs.CV cs.LG

Progressive Semantic-Guided Vision Transformer for Zero-Shot Learning

Shiming Chen, Wenjin Hou, Salman Khan, Fahad Shahbaz Khan

Zero-shot learning (ZSL) recognizes the unseen classes by conducting visual-semantic interactions to transfer semantic knowledge from seen classes to unseen ones, supported by semantic information (e.g., attributes). However, existing ZSL methods simply extract visual features using a pre-trained network backbone (i.e., CNN or ViT), which fail to learn matched visual-semantic correspondences for representing semantic-related visual features as lacking of the guidance of semantic information, resulting in undesirable visual-semantic interactions. To tackle this issue, we propose a progressive semantic-guided vision transformer for zero-shot learning (dubbed ZSLViT). ZSLViT mainly considers two properties in the whole network: i) discover the semantic-related visual representations explicitly, and ii) discard the semantic-unrelated visual information. Specifically, we first introduce semantic-embedded token learning to improve the visual-semantic correspondences via semantic enhancement and discover the semantic-related visual tokens explicitly with semantic-guided token attention. Then, we fuse low semantic-visual correspondence visual tokens to discard the semantic-unrelated visual information for visual enhancement. These two operations are integrated into various encoders to progressively learn semantic-related visual representations for accurate visual-semantic interactions in ZSL. The extensive experiments show that our ZSLViT achieves significant performance gains on three popular benchmark datasets, i.e., CUB, SUN, and AWA2.

4/12/2024

cs.CV cs.LG

Description Boosting for Zero-Shot Entity and Relation Classification

Gabriele Picco, Leopold Fuchs, Marcos Mart'inez Galindo, Alberto Purpura, Vanessa L'opez, Hoang Thanh Lam

Zero-shot entity and relation classification models leverage available external information of unseen classes -- e.g., textual descriptions -- to annotate input text data. Thanks to the minimum data requirement, Zero-Shot Learning (ZSL) methods have high value in practice, especially in applications where labeled data is scarce. Even though recent research in ZSL has demonstrated significant results, our analysis reveals that those methods are sensitive to provided textual descriptions of entities (or relations). Even a minor modification of descriptions can lead to a change in the decision boundary between entity (or relation) classes. In this paper, we formally define the problem of identifying effective descriptions for zero shot inference. We propose a strategy for generating variations of an initial description, a heuristic for ranking them and an ensemble method capable of boosting the predictions of zero-shot models through description enhancement. Empirical results on four different entity and relation classification datasets show that our proposed method outperform existing approaches and achieve new SOTA results on these datasets under the ZSL settings. The source code of the proposed solutions and the evaluation framework are open-sourced.

6/5/2024

cs.CL cs.IR cs.LG