How Good Are LLMs at Out-of-Distribution Detection?

2308.10261

Published 4/17/2024 by Bo Liu, Liming Zhan, Zexin Lu, Yujie Feng, Lei Xue, Xiao-Ming Wu

🛸

Abstract

Out-of-distribution (OOD) detection plays a vital role in enhancing the reliability of machine learning (ML) models. The emergence of large language models (LLMs) has catalyzed a paradigm shift within the ML community, showcasing their exceptional capabilities across diverse natural language processing tasks. While existing research has probed OOD detection with relative small-scale Transformers like BERT, RoBERTa and GPT-2, the stark differences in scales, pre-training objectives, and inference paradigms call into question the applicability of these findings to LLMs. This paper embarks on a pioneering empirical investigation of OOD detection in the domain of LLMs, focusing on LLaMA series ranging from 7B to 65B in size. We thoroughly evaluate commonly-used OOD detectors, scrutinizing their performance in both zero-grad and fine-tuning scenarios. Notably, we alter previous discriminative in-distribution fine-tuning into generative fine-tuning, aligning the pre-training objective of LLMs with downstream tasks. Our findings unveil that a simple cosine distance OOD detector demonstrates superior efficacy, outperforming other OOD detectors. We provide an intriguing explanation for this phenomenon by highlighting the isotropic nature of the embedding spaces of LLMs, which distinctly contrasts with the anisotropic property observed in smaller BERT family models. The new insight enhances our understanding of how LLMs detect OOD data, thereby enhancing their adaptability and reliability in dynamic environments. We have released the source code at url{https://github.com/Awenbocc/LLM-OOD} for other researchers to reproduce our results.

Create account to get full access

Overview

This paper investigates out-of-distribution (OOD) detection in large language models (LLMs), which is crucial for ensuring the reliability of machine learning models.
The researchers evaluated common OOD detectors on LLMs ranging from 7 billion to 65 billion parameters, exploring both zero-shot and fine-tuning scenarios.
They found that a simple cosine distance-based OOD detector outperformed other methods, and attributed this to the isotropic nature of LLM embedding spaces, which contrasts with smaller BERT-like models.
The insights from this research can help enhance the adaptability and reliability of LLMs in dynamic environments.

Plain English Explanation

Machine learning models are becoming increasingly powerful, thanks in part to the emergence of large language models (LLMs). However, ensuring the reliability of these models is critical, and out-of-distribution (OOD) detection plays a vital role in this.

OOD detection is the ability of a model to identify data that is significantly different from the type of information it was trained on. This is important because if a model is used in the real world, it may encounter data that is very different from what it was trained on, and it needs to be able to recognize when this is happening.

In this paper, the researchers looked at how well different OOD detection methods work with large language models, which are much more powerful than the smaller models that have been studied in the past. They tested a range of OOD detectors on LLMs of different sizes, from 7 billion parameters up to 65 billion.

Interestingly, the researchers found that a simple method based on cosine distance was the most effective at detecting OOD data, outperforming more complex approaches. They believe this is because the way the embeddings (internal representations) are organized in large language models is different from smaller models, with a more isotropic (uniform) structure.

This discovery helps us better understand how large language models detect OOD data, which is important for making these powerful AI systems more reliable and adaptable to real-world conditions. The researchers have made their code publicly available so that others can build on this work.

Technical Explanation

The researchers conducted a comprehensive empirical investigation of OOD detection in the context of large language models (LLMs). They focused on the LLaMA series, which ranges from 7 billion to 65 billion parameters, to understand how OOD detection performance scales with model size.

The team thoroughly evaluated several commonly-used OOD detectors, including methods based on density estimation, reconstruction error, and representation distance. They examined the detectors' performance in both zero-shot and fine-tuning scenarios.

Notably, the researchers altered the typical fine-tuning approach, shifting from discriminative to generative fine-tuning. This aligns the pre-training objective of the LLMs with the downstream OOD detection task, potentially enhancing the models' ability to generalize.

The results revealed that a simple cosine distance-based OOD detector demonstrated superior efficacy, outperforming more complex alternatives. The researchers provide an intriguing explanation for this finding by highlighting the isotropic nature of the embedding spaces in LLMs, which contrasts with the anisotropic (non-uniform) property observed in smaller BERT-like models.

This insight into the underlying structure of LLM representations enhances our understanding of how these models detect OOD data, which is crucial for improving their adaptability and reliability in dynamic environments.

Critical Analysis

The researchers have made a valuable contribution to the understanding of OOD detection in large language models. By exploring a range of OOD detectors and model sizes, they have provided important insights that can inform the development of more reliable and adaptable AI systems.

One potential limitation of the study is the focus on the LLaMA series, which may not fully capture the diversity of LLM architectures and pre-training approaches. It would be interesting to see how the findings extend to other prominent LLMs, such as GPT-3, InstructGPT, or PALM.

Additionally, the researchers' explanation for the superior performance of the cosine distance-based OOD detector relies on the isotropic nature of LLM embeddings. While this is a plausible hypothesis, further investigation and validation would be beneficial to solidify this understanding.

It would also be valuable to explore the implications of these findings for real-world applications of LLMs, where the distribution of input data may be even more complex and challenging than the controlled experimental settings. Investigating the robustness of the proposed OOD detection methods in more realistic scenarios could provide additional insights.

Overall, this paper represents an important step forward in the quest to enhance the reliability and trustworthiness of large language models. By continuing to critically examine these models and their capabilities, researchers can help ensure that the remarkable advances in AI technology are accompanied by corresponding improvements in safety and reliability.

Conclusion

This research paper offers a pioneering exploration of out-of-distribution (OOD) detection in the domain of large language models (LLMs). The key findings reveal that a simple cosine distance-based OOD detector outperforms more complex alternatives, due to the unique isotropic structure of LLM embedding spaces.

These insights enhance our understanding of how LLMs detect OOD data, which is crucial for improving the adaptability and reliability of these powerful AI systems in dynamic real-world environments. The researchers have made their source code publicly available, enabling other researchers to build upon this work and further advance the state of the art in OOD detection for large language models.

As AI continues to play an increasingly prominent role in our lives, ensuring the reliability and trustworthiness of these models is of paramount importance. The insights provided by this paper represent an important step towards that goal, paving the way for more robust and adaptable machine learning systems that can be safely deployed in a wide range of applications.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Your Finetuned Large Language Model is Already a Powerful Out-of-distribution Detector

Andi Zhang, Tim Z. Xiao, Weiyang Liu, Robert Bamler, Damon Wischik

We revisit the likelihood ratio between a pretrained large language model (LLM) and its finetuned variant as a criterion for out-of-distribution (OOD) detection. The intuition behind such a criterion is that, the pretrained LLM has the prior knowledge about OOD data due to its large amount of training data, and once finetuned with the in-distribution data, the LLM has sufficient knowledge to distinguish their difference. Leveraging the power of LLMs, we show that, for the first time, the likelihood ratio can serve as an effective OOD detector. Moreover, we apply the proposed LLM-based likelihood ratio to detect OOD questions in question-answering (QA) systems, which can be used to improve the performance of specialized LLMs for general questions. Given that likelihood can be easily obtained by the loss functions within contemporary neural network frameworks, it is straightforward to implement this approach in practice. Since both the pretrained LLMs and its various finetuned models are available, our proposed criterion can be effortlessly incorporated for OOD detection without the need for further training. We conduct comprehensive evaluation across on multiple settings, including far OOD, near OOD, spam detection, and QA scenarios, to demonstrate the effectiveness of the method.

4/16/2024

cs.CL cs.AI cs.LG stat.ML

Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection

Chentao Cao, Zhun Zhong, Zhanke Zhou, Yang Liu, Tongliang Liu, Bo Han

Detecting out-of-distribution (OOD) samples is essential when deploying machine learning models in open-world scenarios. Zero-shot OOD detection, requiring no training on in-distribution (ID) data, has been possible with the advent of vision-language models like CLIP. Existing methods build a text-based classifier with only closed-set labels. However, this largely restricts the inherent capability of CLIP to recognize samples from large and open label space. In this paper, we propose to tackle this constraint by leveraging the expert knowledge and reasoning capability of large language models (LLM) to Envision potential Outlier Exposure, termed EOE, without access to any actual OOD data. Owing to better adaptation to open-world scenarios, EOE can be generalized to different tasks, including far, near, and fine-grained OOD detection. Technically, we design (1) LLM prompts based on visual similarity to generate potential outlier class labels specialized for OOD detection, as well as (2) a new score function based on potential outlier penalty to distinguish hard OOD samples effectively. Empirically, EOE achieves state-of-the-art performance across different OOD tasks and can be effectively scaled to the ImageNet-1K dataset. The code is publicly available at: https://github.com/tmlr-group/EOE.

6/4/2024

cs.LG

Continual Unsupervised Out-of-Distribution Detection

Lars Doorenbos, Raphael Sznitman, Pablo M'arquez-Neila

Deep learning models excel when the data distribution during training aligns with testing data. Yet, their performance diminishes when faced with out-of-distribution (OOD) samples, leading to great interest in the field of OOD detection. Current approaches typically assume that OOD samples originate from an unconcentrated distribution complementary to the training distribution. While this assumption is appropriate in the traditional unsupervised OOD (U-OOD) setting, it proves inadequate when considering the place of deployment of the underlying deep learning model. To better reflect this real-world scenario, we introduce the novel setting of continual U-OOD detection. To tackle this new setting, we propose a method that starts from a U-OOD detector, which is agnostic to the OOD distribution, and slowly updates during deployment to account for the actual OOD distribution. Our method uses a new U-OOD scoring function that combines the Mahalanobis distance with a nearest-neighbor approach. Furthermore, we design a confidence-scaled few-shot OOD detector that outperforms previous methods. We show our method greatly improves upon strong baselines from related fields.

6/5/2024

cs.CV cs.LG

OAML: Outlier Aware Metric Learning for OOD Detection Enhancement

Heng Gao, Zhuolin He, Shoumeng Qiu, Jian Pu

Out-of-distribution (OOD) detection methods have been developed to identify objects that a model has not seen during training. The Outlier Exposure (OE) methods use auxiliary datasets to train OOD detectors directly. However, the collection and learning of representative OOD samples may pose challenges. To tackle these issues, we propose the Outlier Aware Metric Learning (OAML) framework. The main idea of our method is to use the k-NN algorithm and Stable Diffusion model to generate outliers for training at the feature level without making any distributional assumptions. To increase feature discrepancies in the semantic space, we develop a mutual information-based contrastive learning approach for learning from OOD data effectively. Both theoretical and empirical results confirm the effectiveness of this contrastive learning technique. Furthermore, we incorporate knowledge distillation into our learning framework to prevent degradation of in-distribution classification accuracy. The combination of contrastive learning and knowledge distillation algorithms significantly enhances the performance of OOD detection. Experimental results across various datasets show that our method significantly outperforms previous OE methods.

6/26/2024

stat.ML cs.LG