A View on Out-of-Distribution Identification from a Statistical Testing Theory Perspective






Published 5/13/2024 by Alberto Caron, Chris Hicks, Vasilios Mavroudis



We study the problem of efficiently detecting Out-of-Distribution (OOD) samples at test time in supervised and unsupervised learning contexts. While ML models are typically trained under the assumption that training and test data stem from the same distribution, this is often not the case in realistic settings, thus reliably detecting distribution shifts is crucial at deployment. We re-formulate the OOD problem under the lenses of statistical testing and then discuss conditions that render the OOD problem identifiable in statistical terms. Building on this framework, we study convergence guarantees of an OOD test based on the Wasserstein distance, and provide a simple empirical evaluation.

Create account to get full access


If you already have an account, we'll log you in


  • The paper proposes a statistical testing theory perspective on the problem of out-of-distribution (OOD) identification.
  • OOD identification is the task of detecting whether a given input data point is from the same distribution as the training data or from a different, "out-of-distribution" data source.
  • The authors argue that the OOD identification problem can be reframed as a statistical hypothesis testing problem, which provides a principled framework for analyzing and designing OOD detection methods.

Plain English Explanation

In the world of machine learning, there's a common challenge called "out-of-distribution (OOD) identification." This is when a machine learning model is presented with data that's different from the data it was trained on. For example, imagine a model that was trained to recognize images of cats and dogs, but then it's shown an image of a giraffe. The model might not be able to recognize that the giraffe is an "out-of-distribution" image, meaning it's not from the same distribution as the cat and dog images the model was trained on.

This paper proposes a new way of thinking about the OOD identification problem, using ideas from the field of statistical testing theory. The authors argue that OOD identification can be reframed as a statistical hypothesis test, where the goal is to determine whether a given input data point is from the same distribution as the training data, or from a different, "out-of-distribution" source.

By reframing the problem in this way, the authors believe that we can develop more principled and effective methods for detecting OOD data. This could be particularly useful in medical image analysis, where it's important to detect when a medical scan is showing something that's different from what the model was trained on.

Technical Explanation

The paper proposes a statistical testing theory perspective on the problem of out-of-distribution (OOD) identification. The authors argue that the OOD identification problem can be reframed as a statistical hypothesis testing problem, where the goal is to determine whether a given input data point is from the same distribution as the training data, or from a different, "out-of-distribution" source.

Specifically, the authors define the null hypothesis (H0) as the data point being from the same distribution as the training data, and the alternative hypothesis (H1) as the data point being from a different distribution. They then show how various OOD detection methods can be interpreted as different ways of constructing the test statistic and setting the decision threshold for this hypothesis test.

The authors also discuss the connections between OOD identification and other related problems, such as anomaly detection and uncertainty quantification. They argue that the statistical testing theory perspective can provide a unifying framework for analyzing and designing methods for these related problems.

Critical Analysis

The statistical testing theory perspective proposed in this paper provides a principled and rigorous framework for analyzing and designing OOD detection methods. By reframing the problem as a hypothesis test, the authors are able to draw on a wealth of statistical theory and techniques to guide the development of new OOD detection algorithms.

However, it's worth noting that the success of this approach still depends on the ability to construct appropriate test statistics and decision thresholds for a given problem. The authors acknowledge that this can be challenging in practice, and that further research is needed to develop systematic methods for choosing these components.

Additionally, the paper focuses primarily on the theoretical aspects of the OOD identification problem, and does not provide a comprehensive evaluation of the practical performance of the proposed approach. It would be interesting to see how the statistical testing theory perspective performs compared to other state-of-the-art OOD detection methods, especially on challenging real-world datasets.


This paper proposes a novel way of thinking about the out-of-distribution (OOD) identification problem, using ideas from the field of statistical testing theory. By reframing OOD identification as a hypothesis testing problem, the authors believe that we can develop more principled and effective methods for detecting OOD data.

The statistical testing theory perspective provides a unifying framework for analyzing and designing OOD detection algorithms, and could have important applications in domains like medical image analysis, where it's critical to identify when a given input is different from the training data.

While the paper focuses primarily on the theoretical aspects of the problem, it lays the groundwork for further research into developing practical OOD detection methods based on this new perspective. As the field of machine learning continues to grapple with the challenges of OOD identification, approaches like the one proposed in this paper may prove increasingly valuable.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

On the Learnability of Out-of-distribution Detection

On the Learnability of Out-of-distribution Detection

Zhen Fang, Yixuan Li, Feng Liu, Bo Han, Jie Lu





Supervised learning aims to train a classifier under the assumption that training and test data are from the same distribution. To ease the above assumption, researchers have studied a more realistic setting: out-of-distribution (OOD) detection, where test data may come from classes that are unknown during training (i.e., OOD data). Due to the unavailability and diversity of OOD data, good generalization ability is crucial for effective OOD detection algorithms, and corresponding learning theory is still an open problem. To study the generalization of OOD detection, this paper investigates the probably approximately correct (PAC) learning theory of OOD detection that fits the commonly used evaluation metrics in the literature. First, we find a necessary condition for the learnability of OOD detection. Then, using this condition, we prove several impossibility theorems for the learnability of OOD detection under some scenarios. Although the impossibility theorems are frustrating, we find that some conditions of these impossibility theorems may not hold in some practical scenarios. Based on this observation, we next give several necessary and sufficient conditions to characterize the learnability of OOD detection in some practical scenarios. Lastly, we offer theoretical support for representative OOD detection works based on our OOD theory.

Read more


Continual Unsupervised Out-of-Distribution Detection

Continual Unsupervised Out-of-Distribution Detection

Lars Doorenbos, Raphael Sznitman, Pablo M'arquez-Neila





Deep learning models excel when the data distribution during training aligns with testing data. Yet, their performance diminishes when faced with out-of-distribution (OOD) samples, leading to great interest in the field of OOD detection. Current approaches typically assume that OOD samples originate from an unconcentrated distribution complementary to the training distribution. While this assumption is appropriate in the traditional unsupervised OOD (U-OOD) setting, it proves inadequate when considering the place of deployment of the underlying deep learning model. To better reflect this real-world scenario, we introduce the novel setting of continual U-OOD detection. To tackle this new setting, we propose a method that starts from a U-OOD detector, which is agnostic to the OOD distribution, and slowly updates during deployment to account for the actual OOD distribution. Our method uses a new U-OOD scoring function that combines the Mahalanobis distance with a nearest-neighbor approach. Furthermore, we design a confidence-scaled few-shot OOD detector that outperforms previous methods. We show our method greatly improves upon strong baselines from related fields.

Read more


Out-of-distribution Detection in Medical Image Analysis: A survey

Out-of-distribution Detection in Medical Image Analysis: A survey

Zesheng Hong, Yubiao Yue, Yubin Chen, Huanjie Lin, Yuanmei Luo, Mini Han Wang, Weidong Wang, Jialong Xu, Xiaoqi Yang, Zhenzhang Li, Sihong Xie





Computer-aided diagnostics has benefited from the development of deep learning-based computer vision techniques in these years. Traditional supervised deep learning methods assume that the test sample is drawn from the identical distribution as the training data. However, it is possible to encounter out-of-distribution samples in real-world clinical scenarios, which may cause silent failure in deep learning-based medical image analysis tasks. Recently, research has explored various out-of-distribution (OOD) detection situations and techniques to enable a trustworthy medical AI system. In this survey, we systematically review the recent advances in OOD detection in medical image analysis. We first explore several factors that may cause a distributional shift when using a deep-learning-based model in clinic scenarios, with three different types of distributional shift well defined on top of these factors. Then a framework is suggested to categorize and feature existing solutions, while the previous studies are reviewed based on the methodology taxonomy. Our discussion also includes evaluation protocols and metrics, as well as the challenge and a research direction lack of exploration.

Read more


When and How Does In-Distribution Label Help Out-of-Distribution Detection?

When and How Does In-Distribution Label Help Out-of-Distribution Detection?

Xuefeng Du, Yiyou Sun, Yixuan Li





Detecting data points deviating from the training distribution is pivotal for ensuring reliable machine learning. Extensive research has been dedicated to the challenge, spanning classical anomaly detection techniques to contemporary out-of-distribution (OOD) detection approaches. While OOD detection commonly relies on supervised learning from a labeled in-distribution (ID) dataset, anomaly detection may treat the entire ID data as a single class and disregard ID labels. This fundamental distinction raises a significant question that has yet to be rigorously explored: when and how does ID label help OOD detection? This paper bridges this gap by offering a formal understanding to theoretically delineate the impact of ID labels on OOD detection. We employ a graph-theoretic approach, rigorously analyzing the separability of ID data from OOD data in a closed-form manner. Key to our approach is the characterization of data representations through spectral decomposition on the graph. Leveraging these representations, we establish a provable error bound that compares the OOD detection performance with and without ID labels, unveiling conditions for achieving enhanced OOD detection. Lastly, we present empirical results on both simulated and real datasets, validating theoretical guarantees and reinforcing our insights. Code is publicly available at https://github.com/deeplearning-wisc/id_label.

Read more
