Performative Prediction with Bandit Feedback: Learning through Reparameterization

Read original: arXiv:2305.01094 - Published 8/14/2024 by Yatong Chen, Wei Tang, Chien-Ju Ho, Yang Liu

🔮

Overview

Performative prediction is a framework for studying social prediction where the data distribution changes in response to the deployment of a model.
Existing work in this field often makes assumptions that are easily violated in practice.
This paper explores performative prediction problems without these assumptions.

Plain English Explanation

Performative prediction is a way of looking at how AI models can affect the data they are trained on. Normally, we assume the data stays the same no matter what model is used. But in some cases, the model itself can change the real-world data it is trying to predict.

For example, imagine an AI model that predicts the popularity of songs. If the model becomes very accurate, and music companies start using it to decide which songs to promote, that could actually change which songs become popular in the future. The model has "performed" and altered the data it was trying to predict.

Existing research on this topic usually relies on three key assumptions:

The "performative risk" (how the model affects the data) is a simple, convex function.
The relationship between the model and data distribution is known in advance.
The model has access to detailed information about how it affects the data.

But in the real world, these assumptions are often violated. This paper explores new ways to do performative prediction without needing those assumptions to hold.

Technical Explanation

The paper proposes a reparameterization framework that reformulates the performative prediction objective as a function of the induced data distribution, rather than the model parameters directly. This allows the researchers to transform the non-convex objective into a convex one, and develop a two-level optimization procedure to provably optimize it.

The first level of optimization iterates on the distribution parameter space, while the second level learns the model that induces a particular target distribution at each iteration. Under mild conditions, this approach achieves sublinear regret - meaning the model's performance improves over time and eventually gets very close to optimal.

Crucially, this method does not require the three assumptions that often limit existing performative prediction research. The authors show their approach can work even when the performative risk is non-convex, the relationship between model and data is unknown, and only zeroth-order (black-box) information about the performative risk is available.

Critical Analysis

The paper makes important theoretical contributions by expanding the scope of performative prediction to more realistic settings. However, the analysis is mostly theoretical, focusing on regret bounds and convexity properties. More empirical validation would be needed to assess the practical benefits and limitations of the proposed approach.

Additionally, the paper does not address some key challenges in performative prediction, such as how to handle distributional shift over time, or how to ensure the model converges to a socially desirable outcome when there are multiple stable equilibria. Further research is needed to tackle these important real-world concerns.

Conclusion

This paper takes a significant step forward in the field of performative prediction by developing a new optimization framework that does not rely on restrictive assumptions. By reparameterizing the objective and using a two-level optimization procedure, the authors show it's possible to achieve provable performance guarantees without requiring detailed knowledge of how the model affects the data distribution.

While more empirical validation is needed, this work opens up new possibilities for applying performative prediction to a wider range of real-world scenarios where the assumptions of prior research do not hold. As AI systems become increasingly pervasive, understanding and accounting for their performative effects will only grow more important.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🔮

Performative Prediction with Bandit Feedback: Learning through Reparameterization

Yatong Chen, Wei Tang, Chien-Ju Ho, Yang Liu

Performative prediction, as introduced by Perdomo et al, is a framework for studying social prediction in which the data distribution itself changes in response to the deployment of a model. Existing work in this field usually hinges on three assumptions that are easily violated in practice: that the performative risk is convex over the deployed model, that the mapping from the model to the data distribution is known to the model designer in advance, and the first-order information of the performative risk is available. In this paper, we initiate the study of performative prediction problems that do not require these assumptions. Specifically, we develop a reparameterization framework that reparametrizes the performative prediction objective as a function of the induced data distribution. We then develop a two-level zeroth-order optimization procedure, where the first level performs iterative optimization on the distribution parameter space, and the second level learns the model that induces a particular target distribution at each iteration. Under mild conditions, this reparameterization allows us to transform the non-convex objective into a convex one and achieve provable regret guarantees. In particular, we provide a regret bound that is sublinear in the total number of performative samples taken and is only polynomial in the dimension of the model parameter.

8/14/2024

🔮

Performative Prediction with Neural Networks

Mehrnaz Mofakhami, Ioannis Mitliagkas, Gauthier Gidel

Performative prediction is a framework for learning models that influence the data they intend to predict. We focus on finding classifiers that are performatively stable, i.e. optimal for the data distribution they induce. Standard convergence results for finding a performatively stable classifier with the method of repeated risk minimization assume that the data distribution is Lipschitz continuous to the model's parameters. Under this assumption, the loss must be strongly convex and smooth in these parameters; otherwise, the method will diverge for some problems. In this work, we instead assume that the data distribution is Lipschitz continuous with respect to the model's predictions, a more natural assumption for performative systems. As a result, we are able to significantly relax the assumptions on the loss function. In particular, we do not need to assume convexity with respect to the model's parameters. As an illustration, we introduce a resampling procedure that models realistic distribution shifts and show that it satisfies our assumptions. We support our theory by showing that one can learn performatively stable classifiers with neural networks making predictions about real data that shift according to our proposed procedure.

8/27/2024

🛠️

Plug-in Performative Optimization

Licong Lin, Tijana Zrnic

When predictions are performative, the choice of which predictor to deploy influences the distribution of future observations. The overarching goal in learning under performativity is to find a predictor that has low emph{performative risk}, that is, good performance on its induced distribution. One family of solutions for optimizing the performative risk, including bandits and other derivative-free methods, is agnostic to any structure in the performative feedback, leading to exceedingly slow convergence rates. A complementary family of solutions makes use of explicit emph{models} for the feedback, such as best-response models in strategic classification, enabling faster rates. However, these rates critically rely on the feedback model being correct. In this work we study a general protocol for making use of possibly misspecified models in performative prediction, called emph{plug-in performative optimization}. We show this solution can be far superior to model-agnostic strategies, as long as the misspecification is not too extreme. Our results support the hypothesis that models, even if misspecified, can indeed help with learning in performative settings.

5/29/2024

Performative Prediction on Games and Mechanism Design

Ant'onio G'ois, Mehrnaz Mofakhami, Fernando P. Santos, Simon Lacoste-Julien, Gauthier Gidel

Predictions often influence the reality which they aim to predict, an effect known as performativity. Existing work focuses on accuracy maximization under this effect, but model deployment may have important unintended impacts, especially in multiagent scenarios. In this work, we investigate performative prediction in a concrete game-theoretic setting where social welfare is an alternative objective to accuracy maximization. We explore a collective risk dilemma scenario where maximising accuracy can negatively impact social welfare, when predicting collective behaviours. By assuming knowledge of a Bayesian agent behavior model, we then show how to achieve better trade-offs and use them for mechanism design.

8/12/2024