Principled Preferential Bayesian Optimization

2402.05367

Published 5/30/2024 by Wenjie Xu, Wenbin Wang, Yuning Jiang, Bratislav Svetozarevic, Colin N. Jones

🛠️

Abstract

We study the problem of preferential Bayesian optimization (BO), where we aim to optimize a black-box function with only preference feedback over a pair of candidate solutions. Inspired by the likelihood ratio idea, we construct a confidence set of the black-box function using only the preference feedback. An optimistic algorithm with an efficient computational method is then developed to solve the problem, which enjoys an information-theoretic bound on the total cumulative regret, a first-of-its-kind for preferential BO. This bound further allows us to design a scheme to report an estimated best solution, with a guaranteed convergence rate. Experimental results on sampled instances from Gaussian processes, standard test functions, and a thermal comfort optimization problem all show that our method stably achieves better or competitive performance as compared to the existing state-of-the-art heuristics, which, however, do not have theoretical guarantees on regret bounds or convergence.

Create account to get full access

Overview

Preferential Bayesian Optimization (BO) aims to optimize a black-box function using only preference feedback over pairs of candidate solutions, rather than numerical function values.
The paper proposes a method to construct a confidence set of the black-box function using the preference feedback, and then develops an optimistic algorithm to solve the problem.
The method comes with theoretical guarantees on the total cumulative regret, which is a first for preferential BO, and also allows for reporting an estimated best solution with a guaranteed convergence rate.
Experiments show the method outperforms existing heuristic approaches, which lack these theoretical guarantees.

Plain English Explanation

In many real-world optimization problems, we may only have access to relative preferences between different solutions, rather than the actual numerical values of the function we are trying to optimize. This is known as preferential Bayesian optimization.

The researchers in this paper propose a new method to tackle this challenge. Instead of trying to estimate the actual function values, they construct a confidence set of the function using only the preference feedback. This confidence set represents the range of function values that are consistent with the observed preferences.

They then develop an "optimistic" algorithm that chooses the next solution to evaluate based on this confidence set. This algorithm enjoys strong theoretical guarantees on its performance - specifically, it has an upper bound on the total "regret" (or difference between the optimal function value and the values of the solutions it has chosen so far).

This is the first time such theoretical guarantees have been shown for a preferential Bayesian optimization method. The guarantees also allow the researchers to design a scheme to report an estimated best solution, with a promise that this solution will converge to the true optimum.

The experiments show that this new method outperforms existing heuristic approaches, which may work well in practice but lack any formal performance guarantees.

Technical Explanation

The core technical idea is to construct a confidence set of the black-box function using only the preference feedback. Inspired by the likelihood ratio idea from statistics, the researchers develop a computationally efficient way to build this confidence set.

They then propose an optimistic algorithm that chooses the next solution to evaluate based on maximizing an "optimistic" estimate of the function value within this confidence set. This algorithm is shown to have an information-theoretic bound on the total cumulative regret, a first-of-its-kind result for preferential Bayesian optimization.

The regret bound further allows the researchers to design a scheme to report an estimated best solution, with a guaranteed convergence rate to the true optimum. This is an important practical consideration, as it allows the method to not only optimize well, but also provide a high-quality solution at the end.

The experiments evaluate the new method on a variety of test problems, including sampled instances from Gaussian processes, standard test functions, and a thermal comfort optimization problem. The results show that the proposed approach stably achieves better or competitive performance compared to existing heuristic methods, which however lack the theoretical guarantees.

Critical Analysis

The paper makes a significant contribution by providing the first theoretical guarantees for preferential Bayesian optimization. This is an important step forward, as most existing approaches in this domain rely on heuristics without any formal performance analysis.

That said, the theoretical results assume the preference feedback is noiseless, which may not always hold in practice. The authors acknowledge this limitation and suggest extending the analysis to the heteroscedastic case with noisy preferences as an interesting direction for future work.

Additionally, the paper does not address the issue of safety constraints in the optimization process, which is an important consideration in many real-world applications. Incorporating such constraints into the preferential BO framework could be another fruitful area for further research.

Finally, while the experiments demonstrate the empirical advantages of the proposed method, it would be helpful to have a deeper analysis of its performance on a wider range of problems, including those with unknown physical constraints. This could provide more insights into the method's strengths, weaknesses, and the types of optimization tasks it is best suited for.

Conclusion

This paper presents a novel approach to preferential Bayesian optimization that comes with strong theoretical guarantees on the optimization performance. By constructing a confidence set of the black-box function using only preference feedback, and then developing an optimistic algorithm to navigate this set, the method is able to efficiently optimize the function while also providing a high-quality estimated solution.

The theoretical and empirical results showcase the advantages of this approach over existing heuristic methods, which lack such formal performance bounds. While the current work has some limitations, it opens up several promising directions for future research in preferential optimization, with the potential to make these techniques more robust and applicable to a wider range of real-world problems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🛠️

Pseudo-Bayesian Optimization

Haoxian Chen, Henry Lam

Bayesian Optimization is a popular approach for optimizing expensive black-box functions. Its key idea is to use a surrogate model to approximate the objective and, importantly, quantify the associated uncertainty that allows a sequential search of query points that balance exploitation-exploration. Gaussian process (GP) has been a primary candidate for the surrogate model, thanks to its Bayesian-principled uncertainty quantification power and modeling flexibility. However, its challenges have also spurred an array of alternatives whose convergence properties could be more opaque. Motivated by these, we study in this paper an axiomatic framework that elicits the minimal requirements to guarantee black-box optimization convergence that could apply beyond GP-based methods. Moreover, we leverage the design freedom in our framework, which we call Pseudo-Bayesian Optimization, to construct empirically superior algorithms. In particular, we show how using simple local regression, and a suitable randomized prior construction to quantify uncertainty, not only guarantees convergence but also consistently outperforms state-of-the-art benchmarks in examples ranging from high-dimensional synthetic experiments to realistic hyperparameter tuning and robotic applications.

6/21/2024

stat.ML cs.LG

Efficient Black-box Adversarial Attacks via Bayesian Optimization Guided by a Function Prior

Shuyu Cheng, Yibo Miao, Yinpeng Dong, Xiao Yang, Xiao-Shan Gao, Jun Zhu

This paper studies the challenging black-box adversarial attack that aims to generate adversarial examples against a black-box model by only using output feedback of the model to input queries. Some previous methods improve the query efficiency by incorporating the gradient of a surrogate white-box model into query-based attacks due to the adversarial transferability. However, the localized gradient is not informative enough, making these methods still query-intensive. In this paper, we propose a Prior-guided Bayesian Optimization (P-BO) algorithm that leverages the surrogate model as a global function prior in black-box adversarial attacks. As the surrogate model contains rich prior information of the black-box one, P-BO models the attack objective with a Gaussian process whose mean function is initialized as the surrogate model's loss. Our theoretical analysis on the regret bound indicates that the performance of P-BO may be affected by a bad prior. Therefore, we further propose an adaptive integration strategy to automatically adjust a coefficient on the function prior by minimizing the regret bound. Extensive experiments on image classifiers and large vision-language models demonstrate the superiority of the proposed algorithm in reducing queries and improving attack success rates compared with the state-of-the-art black-box attacks. Code is available at https://github.com/yibo-miao/PBO-Attack.

5/30/2024

cs.LG cs.AI cs.CR cs.CV stat.ML

Provably Efficient Bayesian Optimization with Unbiased Gaussian Process Hyperparameter Estimation

Huong Ha, Vu Nguyen, Hung Tran-The, Hongyu Zhang, Xiuzhen Zhang, Anton van den Hengel

Gaussian process (GP) based Bayesian optimization (BO) is a powerful method for optimizing black-box functions efficiently. The practical performance and theoretical guarantees of this approach depend on having the correct GP hyperparameter values, which are usually unknown in advance and need to be estimated from the observed data. However, in practice, these estimations could be incorrect due to biased data sampling strategies used in BO. This can lead to degraded performance and break the sub-linear global convergence guarantee of BO. To address this issue, we propose a new BO method that can sub-linearly converge to the objective function's global optimum even when the true GP hyperparameters are unknown in advance and need to be estimated from the observed data. Our method uses a multi-armed bandit technique (EXP3) to add random data points to the BO process, and employs a novel training loss function for the GP hyperparameter estimation process that ensures consistent estimation. We further provide theoretical analysis of our proposed method. Finally, we demonstrate empirically that our method outperforms existing approaches on various synthetic and real-world problems.

6/7/2024

stat.ML cs.LG

🛠️

Heteroscedastic Preferential Bayesian Optimization with Informative Noise Distributions

Marshal Arijona Sinaga, Julien Martinelli, Vikas Garg, Samuel Kaski

Preferential Bayesian optimization (PBO) is a sample-efficient framework for learning human preferences between candidate designs. PBO classically relies on homoscedastic noise models to represent human aleatoric uncertainty. Yet, such noise fails to accurately capture the varying levels of human aleatoric uncertainty, particularly when the user possesses partial knowledge among different pairs of candidates. For instance, a chemist with solid expertise in glucose-related molecules may easily compare two compounds from that family while struggling to compare alcohol-related molecules. Currently, PBO overlooks this uncertainty during the search for a new candidate through the maximization of the acquisition function, consequently underestimating the risk associated with human uncertainty. To address this issue, we propose a heteroscedastic noise model to capture human aleatoric uncertainty. This model adaptively assigns noise levels based on the distance of a specific input to a predefined set of reliable inputs known as anchors provided by the human. Anchors encapsulate partial knowledge and offer insight into the comparative difficulty of evaluating different candidate pairs. Such a model can be seamlessly integrated into the acquisition function, thus leading to candidate design pairs that elegantly trade informativeness and ease of comparison for the human expert. We perform an extensive empirical evaluation of the proposed approach, demonstrating a consistent improvement over homoscedastic PBO.

5/24/2024

cs.LG stat.ML