Fast post-process Bayesian inference with Variational Sparse Bayesian Quadrature

Read original: arXiv:2303.05263 - Published 6/19/2024 by Chengkun Li, Gr'egoire Clart'e, Martin J{o}rgensen, Luigi Acerbi

🤯

Overview

Researchers propose a new framework called "post-process Bayesian inference" to quickly approximate Bayesian posterior distributions using existing model evaluations.
They introduce a specific method called "Variational Sparse Bayesian Quadrature" (VSBQ) that builds a sparse Gaussian process surrogate model of the log posterior and then uses variational inference to approximate the posterior.
The method allows for fast posterior inference without requiring any additional model evaluations, by reusing optimization traces that may already be available.
Experiments show VSBQ can produce high-quality posterior approximations on challenging synthetic and real-world problems from computational neuroscience.

Plain English Explanation

In many real-world applications, researchers may have access to a large number of model evaluations from previous optimization runs. However, traditional Bayesian inference techniques often don't take advantage of this existing information. The researchers propose a new framework called "post-process Bayesian inference" to quickly approximate Bayesian posterior distributions using these pre-existing model evaluations.

Within this framework, they introduce a specific method called "Variational Sparse Bayesian Quadrature" (VSBQ). VSBQ works by first building a sparse Gaussian process surrogate model of the log posterior density function, using the available model evaluations. It then leverages sparse Gaussian process Bayesian quadrature combined with variational inference to efficiently approximate the posterior distribution over the surrogate model, without requiring any additional model calls.

The key advantage of VSBQ is that it can produce high-quality posterior approximations by simply post-processing existing optimization traces, rather than having to run the model from scratch. This can save a significant amount of computational time, especially for models with expensive evaluations. The researchers validate their method on challenging synthetic scenarios as well as real-world applications from computational neuroscience, demonstrating its effectiveness.

Technical Explanation

The paper proposes a framework called "post-process Bayesian inference" to quickly approximate Bayesian posterior distributions using a collection of pre-existing model evaluations, such as from maximum-a-posteriori (MAP) optimization runs. Traditional approximate inference techniques often do not take full advantage of this available information.

Within this framework, the authors introduce a specific method called "Variational Sparse Bayesian Quadrature" (VSBQ). VSBQ first builds a sparse Gaussian process (GP) surrogate model of the log posterior density function, using the existing target density evaluations. It then leverages sparse GP Bayesian quadrature combined with variational inference to efficiently approximate the posterior distribution over the surrogate model.

The key steps of VSBQ are:

Construct a sparse GP model of the log posterior using the existing target density evaluations.
Utilize sparse GP Bayesian quadrature to compute the posterior moments over the surrogate model.
Apply variational inference to obtain a fast approximate posterior distribution.

The authors validate VSBQ on challenging synthetic scenarios as well as real-world applications from computational neuroscience, such as Bayesian inference for spiking neural network models. The experiments demonstrate that VSBQ can build high-quality posterior approximations by post-processing existing optimization traces, without requiring any additional model evaluations.

Critical Analysis

The paper presents a promising framework and method for efficiently approximating Bayesian posteriors using pre-existing model evaluations. One potential limitation is that the quality of the VSBQ posterior approximation may depend on the quality and coverage of the available target density evaluations. If the existing evaluations do not sufficiently explore the parameter space, the resulting GP surrogate model may be inaccurate, leading to poor posterior approximations.

Additionally, while VSBQ avoids the need for additional model calls, it still requires constructing the sparse GP surrogate model and performing variational inference, which may incur significant computational overhead for high-dimensional problems. The authors acknowledge this and suggest that further research is needed to improve the scalability of the method.

Another area for potential exploration is the application of VSBQ to settings with heterogeneous data sources or models with non-Gaussian likelihoods. The current paper focuses on black-box and potentially noisy likelihoods, but extending the method to handle more complex scenarios could broaden its applicability.

Overall, the "post-process Bayesian inference" framework and the VSBQ method represent an interesting and valuable contribution to the field of approximate Bayesian inference. The paper provides a solid foundation for further research and development in this direction.

Conclusion

The researchers propose a new framework called "post-process Bayesian inference" to quickly approximate Bayesian posterior distributions using existing model evaluations. Within this framework, they introduce a specific method called "Variational Sparse Bayesian Quadrature" (VSBQ) that builds a sparse Gaussian process surrogate model of the log posterior and then uses variational inference to approximate the posterior.

VSBQ allows for fast posterior inference without requiring any additional model evaluations, by reusing optimization traces that may already be available. The experiments show that VSBQ can produce high-quality posterior approximations on challenging synthetic and real-world problems from computational neuroscience, demonstrating the potential of this approach to save significant computational time, especially for models with expensive evaluations.

While the method has some limitations and areas for further research, the "post-process Bayesian inference" framework and the VSBQ technique represent an important contribution to the field of approximate Bayesian inference, with promising implications for a wide range of applications.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🤯

Fast post-process Bayesian inference with Variational Sparse Bayesian Quadrature

Chengkun Li, Gr'egoire Clart'e, Martin J{o}rgensen, Luigi Acerbi

In applied Bayesian inference scenarios, users may have access to a large number of pre-existing model evaluations, for example from maximum-a-posteriori (MAP) optimization runs. However, traditional approximate inference techniques make little to no use of this available information. We propose the framework of post-process Bayesian inference as a means to obtain a quick posterior approximation from existing target density evaluations, with no further model calls. Within this framework, we introduce Variational Sparse Bayesian Quadrature (VSBQ), a method for post-process approximate inference for models with black-box and potentially noisy likelihoods. VSBQ reuses existing target density evaluations to build a sparse Gaussian process (GP) surrogate model of the log posterior density function. Subsequently, we leverage sparse-GP Bayesian quadrature combined with variational inference to achieve fast approximate posterior inference over the surrogate. We validate our method on challenging synthetic scenarios and real-world applications from computational neuroscience. The experiments show that VSBQ builds high-quality posterior approximations by post-processing existing optimization traces, with no further model evaluations.

6/19/2024

🤯

Variational Bayesian surrogate modelling with application to robust design optimisation

Thomas A. Archbold, Ieva Kazlauskaite, Fehmi Cirak

Surrogate models provide a quick-to-evaluate approximation to complex computational models and are essential for multi-query problems like design optimisation. The inputs of current computational models are usually high-dimensional and uncertain. We consider Bayesian inference for constructing statistical surrogates with input uncertainties and intrinsic dimensionality reduction. The surrogates are trained by fitting to data from prevalent deterministic computational models. The assumed prior probability density of the surrogate is a Gaussian process. We determine the respective posterior probability density and parameters of the posited statistical model using variational Bayes. The non-Gaussian posterior is approximated by a simpler trial density with free variational parameters and the discrepancy between them is measured using the Kullback-Leibler (KL) divergence. We employ the stochastic gradient method to compute the variational parameters and other statistical model parameters by minimising the KL divergence. We demonstrate the accuracy and versatility of the proposed reduced dimension variational Gaussian process (RDVGP) surrogate on illustrative and robust structural optimisation problems with cost functions depending on a weighted sum of the mean and standard deviation of model outputs.

4/24/2024

Approximation-Aware Bayesian Optimization

Natalie Maus, Kyurae Kim, Geoff Pleiss, David Eriksson, John P. Cunningham, Jacob R. Gardner

High-dimensional Bayesian optimization (BO) tasks such as molecular design often require 10,000 function evaluations before obtaining meaningful results. While methods like sparse variational Gaussian processes (SVGPs) reduce computational requirements in these settings, the underlying approximations result in suboptimal data acquisitions that slow the progress of optimization. In this paper we modify SVGPs to better align with the goals of BO: targeting informed data acquisition rather than global posterior fidelity. Using the framework of utility-calibrated variational inference, we unify GP approximation and data acquisition into a joint optimization problem, thereby ensuring optimal decisions under a limited computational budget. Our approach can be used with any decision-theoretic acquisition function and is compatible with trust region methods like TuRBO. We derive efficient joint objectives for the expected improvement and knowledge gradient acquisition functions in both the standard and batch BO settings. Our approach outperforms standard SVGPs on high-dimensional benchmark tasks in control and molecular design.

6/7/2024

Quasi-Bayes meets Vines

David Huk, Yuanhe Zhang, Mark Steel, Ritabrata Dutta

Recently proposed quasi-Bayesian (QB) methods initiated a new era in Bayesian computation by directly constructing the Bayesian predictive distribution through recursion, removing the need for expensive computations involved in sampling the Bayesian posterior distribution. This has proved to be data-efficient for univariate predictions, but extensions to multiple dimensions rely on a conditional decomposition resulting from predefined assumptions on the kernel of the Dirichlet Process Mixture Model, which is the implicit nonparametric model used. Here, we propose a different way to extend Quasi-Bayesian prediction to high dimensions through the use of Sklar's theorem by decomposing the predictive distribution into one-dimensional predictive marginals and a high-dimensional copula. Thus, we use the efficient recursive QB construction for the one-dimensional marginals and model the dependence using highly expressive vine copulas. Further, we tune hyperparameters using robust divergences (eg. energy score) and show that our proposed Quasi-Bayesian Vine (QB-Vine) is a fully non-parametric density estimator with emph{an analytical form} and convergence rate independent of the dimension of data in some situations. Our experiments illustrate that the QB-Vine is appropriate for high dimensional distributions ($sim$64), needs very few samples to train ($sim$200) and outperforms state-of-the-art methods with analytical forms for density estimation and supervised tasks by a considerable margin.

6/19/2024