A Quadrature Approach for General-Purpose Batch Bayesian Optimization via Probabilistic Lifting

2404.12219

Published 4/22/2024 by Masaki Adachi, Satoshi Hayakawa, Martin J{o}rgensen, Saad Hamid, Harald Oberhauser, Michael A. Osborne

cs.LG cs.NA stat.ML

A Quadrature Approach for General-Purpose Batch Bayesian Optimization via Probabilistic Lifting

Abstract

Parallelisation in Bayesian optimisation is a common strategy but faces several challenges: the need for flexibility in acquisition functions and kernel choices, flexibility dealing with discrete and continuous variables simultaneously, model misspecification, and lastly fast massive parallelisation. To address these challenges, we introduce a versatile and modular framework for batch Bayesian optimisation via probabilistic lifting with kernel quadrature, called SOBER, which we present as a Python library based on GPyTorch/BoTorch. Our framework offers the following unique benefits: (1) Versatility in downstream tasks under a unified approach. (2) A gradient-free sampler, which does not require the gradient of acquisition functions, offering domain-agnostic sampling (e.g., discrete and mixed variables, non-Euclidean space). (3) Flexibility in domain prior distribution. (4) Adaptive batch size (autonomous determination of the optimal batch size). (5) Robustness against a misspecified reproducing kernel Hilbert space. (6) Natural stopping criterion.

Create account to get full access

Overview

This paper introduces a new approach for general-purpose batch Bayesian optimization using a quadrature method and probabilistic lifting.
The authors propose a framework that can handle complex objective functions and constraints, and provide theoretical guarantees on the convergence of their method.
The paper compares the performance of their approach to existing Bayesian optimization methods on various test problems, demonstrating its effectiveness.

Plain English Explanation

Bayesian optimization is a powerful technique for optimizing complex functions, such as those used in machine learning models or engineering design problems. The key idea is to build a probabilistic model of the function (often using a Gaussian process) and then strategically sample new points to evaluate, with the goal of finding the global optimum as efficiently as possible.

However, traditional Bayesian optimization methods have limitations when it comes to handling batch evaluations (where multiple points are evaluated in parallel) or complex constraints on the optimization problem. This paper presents a new approach that addresses these challenges.

The authors propose a "quadrature" method, which essentially means they use a numerical integration technique to estimate the expected improvement of potential new sample points. This allows them to handle more complex objective functions and constraints, as opposed to relying on simpler analytical expressions.

Additionally, the authors introduce the concept of "probabilistic lifting", which involves representing the optimization problem in a higher-dimensional space to better capture the underlying structure. This added flexibility helps the method converge to the global optimum more efficiently.

The paper demonstrates the effectiveness of this new approach on a variety of test problems, showing that it can outperform existing Bayesian optimization techniques, especially when dealing with batch evaluations and complex constraints.

Technical Explanation

The core of this paper is a new Bayesian optimization framework that uses a quadrature method and probabilistic lifting to handle general-purpose batch optimization problems with complex objective functions and constraints.

The key components of the method are:

Gaussian Process Modeling: The authors model the unknown objective function using a Gaussian process, which provides a probabilistic representation of the function.
Quadrature-based Acquisition Function: Instead of relying on analytical expressions for the acquisition function (which guides the selection of new sample points), the authors use a quadrature-based approach. This allows them to handle more complex objective functions and constraints.
Probabilistic Lifting: The authors introduce the concept of "probabilistic lifting", which involves representing the optimization problem in a higher-dimensional space. This added flexibility helps the method converge to the global optimum more efficiently.
Batch Optimization: The proposed framework can handle batch evaluations, where multiple points are evaluated in parallel, rather than sequentially.

The authors provide theoretical guarantees on the convergence of their method, and compare its performance to existing Bayesian optimization techniques on a range of test problems. The results demonstrate the effectiveness of their approach, particularly when dealing with complex objective functions and constraints.

Critical Analysis

The authors have addressed an important challenge in Bayesian optimization, which is the need for more general-purpose methods that can handle complex objective functions and constraints. Their use of quadrature-based acquisition functions and probabilistic lifting is a promising approach, and the theoretical guarantees and empirical results are compelling.

However, the paper does not discuss the computational cost of the proposed method, which is an important practical consideration. The quadrature-based approach and higher-dimensional representation may incur significant computational overhead, especially for high-dimensional optimization problems.

Additionally, the paper focuses on batch optimization, but does not explore the performance of the method in the sequential setting (where only one point is evaluated at a time). It would be valuable to understand how the method compares to existing Bayesian optimization techniques in the sequential case, as this is a common use case.

Finally, the authors could have provided more insights into the limitations of their approach, such as the types of objective functions and constraints that may pose challenges, or the sensitivity of the method to hyperparameter choices. Acknowledging and discussing these issues would help readers develop a more nuanced understanding of the method's strengths and weaknesses.

Conclusion

This paper presents a novel Bayesian optimization framework that addresses the limitations of existing methods when it comes to handling complex objective functions, constraints, and batch evaluations. The authors' use of quadrature-based acquisition functions and probabilistic lifting is a promising approach that can expand the applicability of Bayesian optimization to a wider range of real-world problems.

While the paper provides strong theoretical and empirical support for the method, there are still some open questions regarding its computational efficiency and performance in the sequential optimization setting. Nonetheless, this work represents an important step forward in the field of Bayesian optimization, and the ideas presented here could inspire further research and development in this area.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🛠️

Pseudo-Bayesian Optimization

Haoxian Chen, Henry Lam

Bayesian Optimization is a popular approach for optimizing expensive black-box functions. Its key idea is to use a surrogate model to approximate the objective and, importantly, quantify the associated uncertainty that allows a sequential search of query points that balance exploitation-exploration. Gaussian process (GP) has been a primary candidate for the surrogate model, thanks to its Bayesian-principled uncertainty quantification power and modeling flexibility. However, its challenges have also spurred an array of alternatives whose convergence properties could be more opaque. Motivated by these, we study in this paper an axiomatic framework that elicits the minimal requirements to guarantee black-box optimization convergence that could apply beyond GP-based methods. Moreover, we leverage the design freedom in our framework, which we call Pseudo-Bayesian Optimization, to construct empirically superior algorithms. In particular, we show how using simple local regression, and a suitable randomized prior construction to quantify uncertainty, not only guarantees convergence but also consistently outperforms state-of-the-art benchmarks in examples ranging from high-dimensional synthetic experiments to realistic hyperparameter tuning and robotic applications.

6/21/2024

stat.ML cs.LG

A survey and benchmark of high-dimensional Bayesian optimization of discrete sequences

Miguel Gonz'alez-Duque, Richard Michael, Simon Bartels, Yevgen Zainchkovskyy, S{o}ren Hauberg, Wouter Boomsma

Optimizing discrete black-box functions is key in several domains, e.g. protein engineering and drug design. Due to the lack of gradient information and the need for sample efficiency, Bayesian optimization is an ideal candidate for these tasks. Several methods for high-dimensional continuous and categorical Bayesian optimization have been proposed recently. However, our survey of the field reveals highly heterogeneous experimental set-ups across methods and technical barriers for the replicability and application of published algorithms to real-world tasks. To address these issues, we develop a unified framework to test a vast array of high-dimensional Bayesian optimization methods and a collection of standardized black-box functions representing real-world application domains in chemistry and biology. These two components of the benchmark are each supported by flexible, scalable, and easily extendable software libraries (poli and poli-baselines), allowing practitioners to readily incorporate new optimization objectives or discrete optimizers. Project website: https://machinelearninglifescience.github.io/hdbo_benchmark

6/10/2024

cs.LG stat.ML

Pareto Front-Diverse Batch Multi-Objective Bayesian Optimization

Alaleh Ahmadianshalchi, Syrine Belakaria, Janardhan Rao Doppa

We consider the problem of multi-objective optimization (MOO) of expensive black-box functions with the goal of discovering high-quality and diverse Pareto fronts where we are allowed to evaluate a batch of inputs. This problem arises in many real-world applications including penicillin production where diversity of solutions is critical. We solve this problem in the framework of Bayesian optimization (BO) and propose a novel approach referred to as Pareto front-Diverse Batch Multi-Objective BO (PDBO). PDBO tackles two important challenges: 1) How to automatically select the best acquisition function in each BO iteration, and 2) How to select a diverse batch of inputs by considering multiple objectives. We propose principled solutions to address these two challenges. First, PDBO employs a multi-armed bandit approach to select one acquisition function from a given library. We solve a cheap MOO problem by assigning the selected acquisition function for each expensive objective function to obtain a candidate set of inputs for evaluation. Second, it utilizes Determinantal Point Processes (DPPs) to choose a Pareto-front-diverse batch of inputs for evaluation from the candidate set obtained from the first step. The key parameters for the methods behind these two steps are updated after each round of function evaluations. Experiments on multiple MOO benchmarks demonstrate that PDBO outperforms prior methods in terms of both the quality and diversity of Pareto solutions.

6/14/2024

cs.LG cs.AI

Provably Efficient Bayesian Optimization with Unbiased Gaussian Process Hyperparameter Estimation

Huong Ha, Vu Nguyen, Hung Tran-The, Hongyu Zhang, Xiuzhen Zhang, Anton van den Hengel

Gaussian process (GP) based Bayesian optimization (BO) is a powerful method for optimizing black-box functions efficiently. The practical performance and theoretical guarantees of this approach depend on having the correct GP hyperparameter values, which are usually unknown in advance and need to be estimated from the observed data. However, in practice, these estimations could be incorrect due to biased data sampling strategies used in BO. This can lead to degraded performance and break the sub-linear global convergence guarantee of BO. To address this issue, we propose a new BO method that can sub-linearly converge to the objective function's global optimum even when the true GP hyperparameters are unknown in advance and need to be estimated from the observed data. Our method uses a multi-armed bandit technique (EXP3) to add random data points to the BO process, and employs a novel training loss function for the GP hyperparameter estimation process that ensures consistent estimation. We further provide theoretical analysis of our proposed method. Finally, we demonstrate empirically that our method outperforms existing approaches on various synthetic and real-world problems.

6/7/2024

stat.ML cs.LG