Conformal Predictive Systems Under Covariate Shift

2404.15018

Published 4/24/2024 by Jef Jonkers, Glenn Van Wallendael, Luc Duchateau, Sofie Van Hoecke

🌐

Abstract

Conformal Predictive Systems (CPS) offer a versatile framework for constructing predictive distributions, allowing for calibrated inference and informative decision-making. However, their applicability has been limited to scenarios adhering to the Independent and Identically Distributed (IID) model assumption. This paper extends CPS to accommodate scenarios characterized by covariate shifts. We therefore propose Weighted CPS (WCPS), akin to Weighted Conformal Prediction (WCP), leveraging likelihood ratios between training and testing covariate distributions. This extension enables the construction of nonparametric predictive distributions capable of handling covariate shifts. We present theoretical underpinnings and conjectures regarding the validity and efficacy of WCPS and demonstrate its utility through empirical evaluations on both synthetic and real-world datasets. Our simulation experiments indicate that WCPS are probabilistically calibrated under covariate shift.

Create account to get full access

Overview

Conformal Predictive Systems (CPS) are a versatile framework for constructing predictive distributions, enabling calibrated inference and informative decision-making.
However, CPS have been limited to scenarios adhering to the Independent and Identically Distributed (IID) model assumption.
This paper extends CPS to accommodate scenarios characterized by covariate shifts, proposing Weighted CPS (WCPS) to handle such situations.

Plain English Explanation

Conformal Predictive Systems (CPS) are a powerful tool for making predictions and informing decision-making. They work by constructing predictive distributions that are well-calibrated, meaning they accurately reflect the uncertainty in the predictions.

However, CPS have only been useful in situations where the data follows a specific pattern, known as the Independent and Identically Distributed (IID) model. This means the data must be unrelated to any other factors and have the same statistical properties throughout.

In the real world, data often doesn't follow this IID pattern. Instead, the data can change over time or vary based on certain characteristics, a phenomenon known as covariate shift. This paper proposes a new approach called Weighted CPS (WCPS) that can handle these covariate shifts, allowing CPS to be used in a wider range of real-world scenarios.

Technical Explanation

The paper introduces Weighted Conformal Predictive Systems (WCPS), an extension of the Conformal Prediction framework that can accommodate situations with covariate shifts. Covariate shifts occur when the distribution of the input features (covariates) changes between the training and testing data, violating the IID assumption.

WCPS leverages likelihood ratios between the training and testing covariate distributions to construct nonparametric predictive distributions that can handle such covariate shifts. This allows WCPS to make calibrated predictions even when the data deviates from the IID model.

The paper presents the theoretical foundations of WCPS and conjectures about its validity and efficacy. Through empirical evaluations on both synthetic and real-world datasets, the authors demonstrate the utility of WCPS in scenarios with covariate shifts, showing that it maintains probabilistic calibration under these conditions.

Critical Analysis

The paper introduces an important extension of Conformal Predictive Systems that addresses a key limitation – the reliance on the IID assumption. By proposing WCPS, the authors have expanded the applicability of CPS to more realistic scenarios with covariate shifts.

However, the paper acknowledges that the theoretical underpinnings of WCPS are not fully established, and the authors present their work as conjectures rather than proven theorems. Further research may be needed to solidify the theoretical foundations of WCPS and understand its properties in depth.

Additionally, the empirical evaluation, while promising, is still limited in scope. Exploring the performance of WCPS on a wider range of datasets and scenarios would help strengthen the case for its adoption in practice.

Conclusion

This paper presents a significant advancement in the field of Conformal Predictive Systems by introducing Weighted CPS (WCPS), a framework that can handle covariate shifts in the data. This expansion of the CPS methodology enables its use in a broader range of real-world applications, where the data often deviates from the restrictive IID assumptions.

The theoretical and empirical insights provided in this work lay the groundwork for further research and development in this area, potentially leading to more robust and versatile predictive systems that can better accommodate the complexities of the real world.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🌀

Training-Conditional Coverage Bounds under Covariate Shift

Mehrdad Pournaderi, Yu Xiang

Training-conditional coverage guarantees in conformal prediction concern the concentration of the error distribution, conditional on the training data, below some nominal level. The conformal prediction methodology has recently been generalized to the covariate shift setting, namely, the covariate distribution changes between the training and test data. In this paper, we study the training-conditional coverage properties of a range of conformal prediction methods under covariate shift via a weighted version of the Dvoretzky-Kiefer-Wolfowitz (DKW) inequality tailored for distribution change. The result for the split conformal method is almost assumption-free, while the results for the full conformal and jackknife+ methods rely on strong assumptions including the uniform stability of the training algorithm.

5/28/2024

stat.ML cs.LG

Robust Conformal Prediction Using Privileged Information

Shai Feldman, Yaniv Romano

We develop a method to generate prediction sets with a guaranteed coverage rate that is robust to corruptions in the training data, such as missing or noisy variables. Our approach builds on conformal prediction, a powerful framework to construct prediction sets that are valid under the i.i.d assumption. Importantly, naively applying conformal prediction does not provide reliable predictions in this setting, due to the distribution shift induced by the corruptions. To account for the distribution shift, we assume access to privileged information (PI). The PI is formulated as additional features that explain the distribution shift, however, they are only available during training and absent at test time. We approach this problem by introducing a novel generalization of weighted conformal prediction and support our method with theoretical coverage guarantees. Empirical experiments on both real and synthetic datasets indicate that our approach achieves a valid coverage rate and constructs more informative predictions compared to existing methods, which are not supported by theoretical guarantees.

6/11/2024

cs.LG

Adapting Conformal Prediction to Distribution Shifts Without Labels

Kevin Kasa, Zhiyu Zhang, Heng Yang, Graham W. Taylor

Conformal prediction (CP) enables machine learning models to output prediction sets with guaranteed coverage rate, assuming exchangeable data. Unfortunately, the exchangeability assumption is frequently violated due to distribution shifts in practice, and the challenge is often compounded by the lack of ground truth labels at test time. Focusing on classification in this paper, our goal is to improve the quality of CP-generated prediction sets using only unlabeled data from the test domain. This is achieved by two new methods called ECP and EACP, that adjust the score function in CP according to the base model's uncertainty on the unlabeled test data. Through extensive experiments on a number of large-scale datasets and neural network architectures, we show that our methods provide consistent improvement over existing baselines and nearly match the performance of supervised algorithms.

6/4/2024

cs.LG stat.ML

🤯

Conformal Counterfactual Inference under Hidden Confounding

Zonghao Chen, Ruocheng Guo, Jean-Franc{c}ois Ton, Yang Liu

Personalized decision making requires the knowledge of potential outcomes under different treatments, and confidence intervals about the potential outcomes further enrich this decision-making process and improve its reliability in high-stakes scenarios. Predicting potential outcomes along with its uncertainty in a counterfactual world poses the foundamental challenge in causal inference. Existing methods that construct confidence intervals for counterfactuals either rely on the assumption of strong ignorability, or need access to un-identifiable lower and upper bounds that characterize the difference between observational and interventional distributions. To overcome these limitations, we first propose a novel approach wTCP-DR based on transductive weighted conformal prediction, which provides confidence intervals for counterfactual outcomes with marginal converage guarantees, even under hidden confounding. With less restrictive assumptions, our approach requires access to a fraction of interventional data (from randomized controlled trials) to account for the covariate shift from observational distributoin to interventional distribution. Theoretical results explicitly demonstrate the conditions under which our algorithm is strictly advantageous to the naive method that only uses interventional data. After ensuring valid intervals on counterfactuals, it is straightforward to construct intervals for individual treatment effects (ITEs). We demonstrate our method across synthetic and real-world data, including recommendation systems, to verify the superiority of our methods compared against state-of-the-art baselines in terms of both coverage and efficiency

5/22/2024

cs.LG