Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection

2406.00806

Published 6/4/2024 by Chentao Cao, Zhun Zhong, Zhanke Zhou, Yang Liu, Tongliang Liu, Bo Han

Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection

Abstract

Detecting out-of-distribution (OOD) samples is essential when deploying machine learning models in open-world scenarios. Zero-shot OOD detection, requiring no training on in-distribution (ID) data, has been possible with the advent of vision-language models like CLIP. Existing methods build a text-based classifier with only closed-set labels. However, this largely restricts the inherent capability of CLIP to recognize samples from large and open label space. In this paper, we propose to tackle this constraint by leveraging the expert knowledge and reasoning capability of large language models (LLM) to Envision potential Outlier Exposure, termed EOE, without access to any actual OOD data. Owing to better adaptation to open-world scenarios, EOE can be generalized to different tasks, including far, near, and fine-grained OOD detection. Technically, we design (1) LLM prompts based on visual similarity to generate potential outlier class labels specialized for OOD detection, as well as (2) a new score function based on potential outlier penalty to distinguish hard OOD samples effectively. Empirically, EOE achieves state-of-the-art performance across different OOD tasks and can be effectively scaled to the ImageNet-1K dataset. The code is publicly available at: https://github.com/tmlr-group/EOE.

Create account to get full access

Overview

This paper explores how large language models (LLMs) can be used for out-of-distribution (OOD) detection, which is the task of identifying data that is significantly different from the training data.
The researchers investigate how LLMs can be "exposed" to outliers during training to improve their ability to detect OOD samples during inference.
They propose several techniques for exposing LLMs to outliers, such as zero-shot OOD detection and finetuning on synthetic outliers.
The goal is to leverage the powerful representation learning capabilities of LLMs to improve OOD detection performance compared to traditional methods.

Plain English Explanation

Large language models (LLMs) like GPT-3 have become incredibly powerful at understanding and generating human language. But how well can they detect when something is completely different from the data they were trained on? This is the problem of out-of-distribution (OOD) detection.

The researchers in this paper wanted to see if they could "expose" LLMs to outlier data during training, to help them better recognize when something is radically different from their normal input. They tried a few different techniques for this, like zero-shot OOD detection (where the model is trained to detect outliers without seeing any examples) and finetuning on synthetic outliers (where the model is further trained on generated outlier data).

The key idea is that LLMs are very good at learning patterns and representations from data. If you can expose them to some outliers during training, they may be able to develop a better intuition for what counts as "normal" versus "abnormal" data. This could make them more effective at OOD detection compared to other approaches.

Technical Explanation

The paper begins by providing background on the OOD detection problem and discussing prior work on using language-enhanced latent representations and negative label guidance to improve OOD detection.

The core contribution of this work is exploring different techniques for "exposing" LLMs to outlier data during training. The first approach is zero-shot OOD detection, where the model is trained to detect OOD samples without ever seeing any examples of them. This is done by training the model on a core in-distribution dataset, as well as a set of "outlier" tokens or phrases that are treated as OOD during training.

Another technique is finetuning on synthetic outliers, where the pre-trained LLM is further trained on a dataset of generated outlier examples, in addition to the original in-distribution data. The researchers experiment with different methods for generating these synthetic outliers, such as perturbing the input or using a generative adversarial network (GAN).

The paper then evaluates the OOD detection performance of the LLMs trained with these outlier exposure techniques on several benchmark datasets. The results show that the exposed LLMs generally outperform both the base LLM and traditional OOD detection methods, indicating that the outlier exposure was effective at enhancing the model's ability to detect anomalous data.

Critical Analysis

The paper provides a novel and promising approach for improving OOD detection using large language models. By exposing the LLMs to outlier data during training, the researchers were able to enhance the models' ability to distinguish in-distribution from out-of-distribution samples.

However, the paper does not fully address some potential limitations and challenges of this approach. For example, the synthetic outlier generation methods used may not capture the full complexity and diversity of real-world OOD data, which could limit the generalization of the trained models. Additionally, the paper focuses on textual data, and it's unclear how well these techniques would transfer to other modalities like images or audio.

Further research is needed to explore more sophisticated outlier exposure techniques, as well as to assess the robustness and scalability of the approach on larger and more diverse datasets. It would also be valuable to investigate how the outlier-exposed LLMs perform on real-world OOD detection tasks, rather than just on benchmark datasets.

Conclusion

This paper presents a novel approach for enhancing large language models' ability to detect out-of-distribution data by exposing them to outliers during training. The results demonstrate that techniques like zero-shot OOD detection and finetuning on synthetic outliers can significantly improve LLM performance on OOD detection tasks compared to traditional methods.

While more research is needed to fully understand the limitations and potential of this approach, the work highlights the value of leveraging the powerful representation learning capabilities of LLMs to tackle the challenging problem of OOD detection. As LLMs become more widely deployed in real-world applications, the ability to reliably detect and handle OOD data will be increasingly important. The insights from this paper provide a promising direction for further advancements in this area.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Zero-Shot Out-of-Distribution Detection with Outlier Label Exposure

Choubo Ding, Guansong Pang

As vision-language models like CLIP are widely applied to zero-shot tasks and gain remarkable performance on in-distribution (ID) data, detecting and rejecting out-of-distribution (OOD) inputs in the zero-shot setting have become crucial for ensuring the safety of using such models on the fly. Most existing zero-shot OOD detectors rely on ID class label-based prompts to guide CLIP in classifying ID images and rejecting OOD images. In this work we instead propose to leverage a large set of diverse auxiliary outlier class labels as pseudo OOD class text prompts to CLIP for enhancing zero-shot OOD detection, an approach we called Outlier Label Exposure (OLE). The key intuition is that ID images are expected to have lower similarity to these outlier class prompts than OOD images. One issue is that raw class labels often include noise labels, e.g., synonyms of ID labels, rendering raw OLE-based detection ineffective. To address this issue, we introduce an outlier prototype learning module that utilizes the prompt embeddings of the outlier labels to learn a small set of pivotal outlier prototypes for an embedding similarity-based OOD scoring. Additionally, the outlier classes and their prototypes can be loosely coupled with the ID classes, leading to an inseparable decision region between them. Thus, we also introduce an outlier label generation module that synthesizes our outlier prototypes and ID class embeddings to generate in-between outlier prototypes to further calibrate the detection in OLE. Despite its simplicity, extensive experiments show that OLE substantially improves detection performance and achieves new state-of-the-art performance in large-scale OOD and hard OOD detection benchmarks.

6/4/2024

cs.CV

🛸

How Good Are LLMs at Out-of-Distribution Detection?

Bo Liu, Liming Zhan, Zexin Lu, Yujie Feng, Lei Xue, Xiao-Ming Wu

Out-of-distribution (OOD) detection plays a vital role in enhancing the reliability of machine learning (ML) models. The emergence of large language models (LLMs) has catalyzed a paradigm shift within the ML community, showcasing their exceptional capabilities across diverse natural language processing tasks. While existing research has probed OOD detection with relative small-scale Transformers like BERT, RoBERTa and GPT-2, the stark differences in scales, pre-training objectives, and inference paradigms call into question the applicability of these findings to LLMs. This paper embarks on a pioneering empirical investigation of OOD detection in the domain of LLMs, focusing on LLaMA series ranging from 7B to 65B in size. We thoroughly evaluate commonly-used OOD detectors, scrutinizing their performance in both zero-grad and fine-tuning scenarios. Notably, we alter previous discriminative in-distribution fine-tuning into generative fine-tuning, aligning the pre-training objective of LLMs with downstream tasks. Our findings unveil that a simple cosine distance OOD detector demonstrates superior efficacy, outperforming other OOD detectors. We provide an intriguing explanation for this phenomenon by highlighting the isotropic nature of the embedding spaces of LLMs, which distinctly contrasts with the anisotropic property observed in smaller BERT family models. The new insight enhances our understanding of how LLMs detect OOD data, thereby enhancing their adaptability and reliability in dynamic environments. We have released the source code at url{https://github.com/Awenbocc/LLM-OOD} for other researchers to reproduce our results.

4/17/2024

cs.CL

Your Finetuned Large Language Model is Already a Powerful Out-of-distribution Detector

Andi Zhang, Tim Z. Xiao, Weiyang Liu, Robert Bamler, Damon Wischik

We revisit the likelihood ratio between a pretrained large language model (LLM) and its finetuned variant as a criterion for out-of-distribution (OOD) detection. The intuition behind such a criterion is that, the pretrained LLM has the prior knowledge about OOD data due to its large amount of training data, and once finetuned with the in-distribution data, the LLM has sufficient knowledge to distinguish their difference. Leveraging the power of LLMs, we show that, for the first time, the likelihood ratio can serve as an effective OOD detector. Moreover, we apply the proposed LLM-based likelihood ratio to detect OOD questions in question-answering (QA) systems, which can be used to improve the performance of specialized LLMs for general questions. Given that likelihood can be easily obtained by the loss functions within contemporary neural network frameworks, it is straightforward to implement this approach in practice. Since both the pretrained LLMs and its various finetuned models are available, our proposed criterion can be effortlessly incorporated for OOD detection without the need for further training. We conduct comprehensive evaluation across on multiple settings, including far OOD, near OOD, spam detection, and QA scenarios, to demonstrate the effectiveness of the method.

4/16/2024

cs.CL cs.AI cs.LG stat.ML

Negative Label Guided OOD Detection with Pretrained Vision-Language Models

Xue Jiang, Feng Liu, Zhen Fang, Hong Chen, Tongliang Liu, Feng Zheng, Bo Han

Out-of-distribution (OOD) detection aims at identifying samples from unknown classes, playing a crucial role in trustworthy models against errors on unexpected inputs. Extensive research has been dedicated to exploring OOD detection in the vision modality. Vision-language models (VLMs) can leverage both textual and visual information for various multi-modal applications, whereas few OOD detection methods take into account information from the text modality. In this paper, we propose a novel post hoc OOD detection method, called NegLabel, which takes a vast number of negative labels from extensive corpus databases. We design a novel scheme for the OOD score collaborated with negative labels. Theoretical analysis helps to understand the mechanism of negative labels. Extensive experiments demonstrate that our method NegLabel achieves state-of-the-art performance on various OOD detection benchmarks and generalizes well on multiple VLM architectures. Furthermore, our method NegLabel exhibits remarkable robustness against diverse domain shifts. The codes are available at https://github.com/tmlr-group/NegLabel.

4/1/2024

cs.CV cs.LG