Zero-Shot Out-of-Distribution Detection with Outlier Label Exposure

2406.01170

Published 6/4/2024 by Choubo Ding, Guansong Pang

Zero-Shot Out-of-Distribution Detection with Outlier Label Exposure

Abstract

As vision-language models like CLIP are widely applied to zero-shot tasks and gain remarkable performance on in-distribution (ID) data, detecting and rejecting out-of-distribution (OOD) inputs in the zero-shot setting have become crucial for ensuring the safety of using such models on the fly. Most existing zero-shot OOD detectors rely on ID class label-based prompts to guide CLIP in classifying ID images and rejecting OOD images. In this work we instead propose to leverage a large set of diverse auxiliary outlier class labels as pseudo OOD class text prompts to CLIP for enhancing zero-shot OOD detection, an approach we called Outlier Label Exposure (OLE). The key intuition is that ID images are expected to have lower similarity to these outlier class prompts than OOD images. One issue is that raw class labels often include noise labels, e.g., synonyms of ID labels, rendering raw OLE-based detection ineffective. To address this issue, we introduce an outlier prototype learning module that utilizes the prompt embeddings of the outlier labels to learn a small set of pivotal outlier prototypes for an embedding similarity-based OOD scoring. Additionally, the outlier classes and their prototypes can be loosely coupled with the ID classes, leading to an inseparable decision region between them. Thus, we also introduce an outlier label generation module that synthesizes our outlier prototypes and ID class embeddings to generate in-between outlier prototypes to further calibrate the detection in OLE. Despite its simplicity, extensive experiments show that OLE substantially improves detection performance and achieves new state-of-the-art performance in large-scale OOD and hard OOD detection benchmarks.

Create account to get full access

Overview

This paper proposes a method for detecting out-of-distribution (OOD) samples using a "zero-shot" approach, which means it can detect OOD samples without requiring any OOD data during training.
The key innovation is the use of "outlier label exposure," where the model is trained on examples of OOD samples labeled as "outliers" to help it learn to recognize OOD patterns.
The authors demonstrate the effectiveness of their approach on various OOD detection tasks, showing significant performance improvements over existing zero-shot OOD detection methods.

Plain English Explanation

In machine learning, the ability to detect "out-of-distribution" (OOD) data is an important challenge. OOD data refers to samples that are very different from the data the model was trained on, and being able to identify these samples is crucial for ensuring the model's reliability and safety in real-world applications.

The paper proposes a new technique called "zero-shot OOD detection with outlier label exposure." The key idea is to train the model not just on the normal, in-distribution data it's supposed to handle, but also on examples of OOD data that are explicitly labeled as "outliers." This helps the model learn the distinctive patterns and features of OOD data, so it can then recognize new OOD samples during deployment, even without having seen any OOD data during training.

The authors demonstrate that this approach outperforms other zero-shot OOD detection methods across a variety of tasks. By exposing the model to labeled OOD examples during training, it gains a better understanding of what OOD data looks like, allowing it to more accurately identify novel OOD samples in the future.

This is an important advancement, as traditional OOD detection methods often require access to OOD data during training, which can be difficult or expensive to obtain in many real-world scenarios. The zero-shot approach proposed in this paper provides a way to achieve strong OOD detection performance without that limitation.

Technical Explanation

The key technical contribution of this paper is the introduction of "outlier label exposure" for zero-shot OOD detection. Traditionally, zero-shot OOD detection methods like those discussed in this paper have relied on learning a representation of in-distribution data and using that to identify OOD samples. However, this can be challenging, as the model may struggle to capture the full diversity of OOD data.

The authors' approach instead exposes the model to labeled examples of OOD data during training. Specifically, they create a mixed training dataset that contains both in-distribution samples and OOD samples labeled as "outliers." By learning to distinguish these labeled outliers from the in-distribution data, the model gains a more robust understanding of OOD patterns and can then more effectively identify novel OOD samples at test time.

The authors evaluate their approach on a variety of OOD detection benchmarks, including CIFAR-10, CIFAR-100, and TinyImageNet. They show that their "outlier label exposure" method significantly outperforms other zero-shot OOD detection techniques, particularly on challenging "near-OOD" tasks where the OOD samples are visually similar to the in-distribution data.

Critical Analysis

One potential limitation of the proposed approach is that it requires access to labeled OOD data during training, which may not always be available in real-world scenarios. The authors acknowledge this and suggest that future work could explore methods to generate synthetic OOD examples or leverage large language models to envisage OOD samples for training.

Additionally, while the authors demonstrate strong performance on the evaluated benchmarks, it would be valuable to see how their method generalizes to more diverse and complex OOD scenarios, such as those involving noisy or adversarial OOD samples. Further research could explore the robustness of the proposed approach in such challenging settings.

Overall, this paper presents an intriguing and effective technique for zero-shot OOD detection, which could have significant practical implications for deploying machine learning models in the real world. The use of outlier label exposure is a clever innovation that helps the model better understand the characteristics of OOD data, leading to improved detection performance.

Conclusion

This paper introduces a novel approach to zero-shot out-of-distribution (OOD) detection that leverages "outlier label exposure" during training. By exposing the model to examples of OOD data labeled as outliers, it can learn to better recognize the distinctive patterns of OOD samples and identify them effectively at test time, without requiring any OOD data during the training process.

The authors demonstrate the effectiveness of their approach across various OOD detection benchmarks, showing significant performance improvements over existing zero-shot methods. This is an important advancement, as traditional OOD detection techniques often require access to OOD data during training, which can be challenging to obtain in many real-world applications.

While the proposed method shows promise, future research could explore ways to further enhance its robustness and generalization, such as by generating synthetic OOD examples or leveraging large language models to envisage OOD patterns. Overall, this work represents an important step forward in the field of OOD detection, with the potential to enable more reliable and trustworthy machine learning systems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection

Chentao Cao, Zhun Zhong, Zhanke Zhou, Yang Liu, Tongliang Liu, Bo Han

Detecting out-of-distribution (OOD) samples is essential when deploying machine learning models in open-world scenarios. Zero-shot OOD detection, requiring no training on in-distribution (ID) data, has been possible with the advent of vision-language models like CLIP. Existing methods build a text-based classifier with only closed-set labels. However, this largely restricts the inherent capability of CLIP to recognize samples from large and open label space. In this paper, we propose to tackle this constraint by leveraging the expert knowledge and reasoning capability of large language models (LLM) to Envision potential Outlier Exposure, termed EOE, without access to any actual OOD data. Owing to better adaptation to open-world scenarios, EOE can be generalized to different tasks, including far, near, and fine-grained OOD detection. Technically, we design (1) LLM prompts based on visual similarity to generate potential outlier class labels specialized for OOD detection, as well as (2) a new score function based on potential outlier penalty to distinguish hard OOD samples effectively. Empirically, EOE achieves state-of-the-art performance across different OOD tasks and can be effectively scaled to the ImageNet-1K dataset. The code is publicly available at: https://github.com/tmlr-group/EOE.

6/4/2024

cs.LG

🔎

CLIPScope: Enhancing Zero-Shot OOD Detection with Bayesian Scoring

Hao Fu, Naman Patel, Prashanth Krishnamurthy, Farshad Khorrami

Detection of out-of-distribution (OOD) samples is crucial for safe real-world deployment of machine learning models. Recent advances in vision language foundation models have made them capable of detecting OOD samples without requiring in-distribution (ID) images. However, these zero-shot methods often underperform as they do not adequately consider ID class likelihoods in their detection confidence scoring. Hence, we introduce CLIPScope, a zero-shot OOD detection approach that normalizes the confidence score of a sample by class likelihoods, akin to a Bayesian posterior update. Furthermore, CLIPScope incorporates a novel strategy to mine OOD classes from a large lexical database. It selects class labels that are farthest and nearest to ID classes in terms of CLIP embedding distance to maximize coverage of OOD samples. We conduct extensive ablation studies and empirical evaluations, demonstrating state of the art performance of CLIPScope across various OOD detection benchmarks.

5/24/2024

cs.CV

Exploiting Diffusion Prior for Out-of-Distribution Detection

Armando Zhu, Jiabei Liu, Keqin Li, Shuying Dai, Bo Hong, Peng Zhao, Changsong Wei

Out-of-distribution (OOD) detection is crucial for deploying robust machine learning models, especially in areas where security is critical. However, traditional OOD detection methods often fail to capture complex data distributions from large scale date. In this paper, we present a novel approach for OOD detection that leverages the generative ability of diffusion models and the powerful feature extraction capabilities of CLIP. By using these features as conditional inputs to a diffusion model, we can reconstruct the images after encoding them with CLIP. The difference between the original and reconstructed images is used as a signal for OOD identification. The practicality and scalability of our method is increased by the fact that it does not require class-specific labeled ID data, as is the case with many other methods. Extensive experiments on several benchmark datasets demonstrates the robustness and effectiveness of our method, which have significantly improved the detection accuracy.

6/18/2024

cs.CV cs.AI

Enhancing Near OOD Detection in Prompt Learning: Maximum Gains, Minimal Costs

Myong Chol Jung, He Zhao, Joanna Dipnall, Belinda Gabbe, Lan Du

Prompt learning has shown to be an efficient and effective fine-tuning method for vision-language models like CLIP. While numerous studies have focused on the generalisation of these models in few-shot classification, their capability in near out-of-distribution (OOD) detection has been overlooked. A few recent works have highlighted the promising performance of prompt learning in far OOD detection. However, the more challenging task of few-shot near OOD detection has not yet been addressed. In this study, we investigate the near OOD detection capabilities of prompt learning models and observe that commonly used OOD scores have limited performance in near OOD detection. To enhance the performance, we propose a fast and simple post-hoc method that complements existing logit-based scores, improving near OOD detection AUROC by up to 11.67% with minimal computational cost. Our method can be easily applied to any prompt learning model without change in architecture or re-training the models. Comprehensive empirical evaluations across 13 datasets and 8 models demonstrate the effectiveness and adaptability of our method.

5/28/2024

cs.CV