Fast, accurate and lightweight sequential simulation-based inference using Gaussian locally linear mappings

2403.07454

Published 6/26/2024 by Henrik Haggstrom, Pedro L. C. Rodrigues, Geoffroy Oudoumanessah, Florence Forbes, Umberto Picchini

stat.ML cs.LG

Fast, accurate and lightweight sequential simulation-based inference using Gaussian locally linear mappings

Abstract

Bayesian inference for complex models with an intractable likelihood can be tackled using algorithms performing many calls to computer simulators. These approaches are collectively known as simulation-based inference (SBI). Recent SBI methods have made use of neural networks (NN) to provide approximate, yet expressive constructs for the unavailable likelihood function and the posterior distribution. However, the trade-off between accuracy and computational demand leaves much space for improvement. In this work, we propose an alternative that provides both approximations to the likelihood and the posterior distribution, using structured mixtures of probability distributions. Our approach produces accurate posterior inference when compared to state-of-the-art NN-based SBI methods, even for multimodal posteriors, while exhibiting a much smaller computational footprint. We illustrate our results on several benchmark models from the SBI literature and on a biological model of the translation kinetics after mRNA transfection.

Create account to get full access

Overview

This paper proposes a new method called "Gaussian Locally Linear Mappings" (GLLiM) for fast, accurate, and lightweight sequential simulation-based inference.
The method uses a surrogate model to approximate the likelihood and posterior distributions, allowing for efficient Bayesian inference without the need for costly simulations.
The approach is demonstrated on several benchmark problems and shows promising results in terms of computational efficiency and accuracy compared to existing simulation-based inference methods.

Plain English Explanation

When scientists or engineers want to understand a complex system, they often use computer simulations to model the behavior of that system. However, running these simulations can be time-consuming and computationally expensive, especially when performing Bayesian inference, which requires repeatedly running the simulations to explore the parameter space.

The researchers in this paper have developed a new method called Gaussian Locally Linear Mappings (GLLiM) that can approximate the likelihood and posterior distributions of the system without needing to run the full simulations. This allows for much faster and more efficient Bayesian inference, as the method can quickly explore the parameter space and make accurate predictions about the system's behavior.

The key idea behind GLLiM is to use a surrogate model, which is a simplified mathematical representation of the complex system. This surrogate model is trained on a small number of simulations, and then it can be used to generate approximate likelihood and posterior distributions much more quickly than running the full simulations.

The researchers demonstrate the effectiveness of GLLiM on several benchmark problems, showing that it can achieve similar accuracy to traditional simulation-based inference methods, but with a fraction of the computational cost. This could have important implications for fields like physics, biology, or machine learning, where simulation-based inference is commonly used but can be computationally prohibitive.

Technical Explanation

The paper introduces a new method called Gaussian Locally Linear Mappings (GLLiM) for fast, accurate, and lightweight sequential simulation-based inference. The key idea is to use a surrogate model to approximate the likelihood and posterior distributions, rather than relying on costly simulations.

The GLLiM approach works by first training a Gaussian mixture model on a small set of simulation runs. This model learns a nonlinear mapping between the input parameters and the outputs of the simulation. Once trained, the GLLiM model can be used to efficiently generate approximate likelihood and posterior distributions, without the need to run the full simulations.

The authors demonstrate the effectiveness of GLLiM on several benchmark problems, including parameter estimation for a scientific simulator, diffusion-based Bayesian inference, and neural posterior estimation for likelihood-free inference. The results show that GLLiM can achieve similar accuracy to traditional simulation-based inference methods, but with significantly lower computational cost.

Additionally, the authors show that GLLiM can be used in a sequential manner, updating the surrogate model as new simulation data becomes available. This makes it well-suited for iterative simulation-based inference tasks, where the goal is to gradually refine the parameter estimates over time.

Critical Analysis

The paper presents a promising approach for accelerating simulation-based inference, but there are a few caveats and limitations to consider:

The accuracy of the GLLiM surrogate model is heavily dependent on the quality and diversity of the initial training data. If the set of simulations used to train the model is not representative of the full parameter space, the approximations made by GLLiM may be inaccurate.
The paper only evaluates GLLiM on relatively low-dimensional problems (up to 10 input parameters). It's unclear how well the method would scale to higher-dimensional settings, which are common in many real-world applications of simulation-based inference, such as Bayesian inference for complex scientific models.
The paper does not provide a rigorous theoretical analysis of the convergence properties or approximation guarantees of the GLLiM method. This makes it difficult to assess the method's reliability and robustness compared to other simulation-based inference techniques.
The authors do not discuss potential issues with the GLLiM method, such as the sensitivity of the Gaussian mixture model to outliers or the challenges of selecting the appropriate number of mixture components.

Despite these limitations, the paper presents an interesting and potentially useful approach for accelerating simulation-based inference. Further research and evaluation on more challenging real-world problems would help to better understand the strengths and weaknesses of the GLLiM method.

Conclusion

The paper introduces a new method called Gaussian Locally Linear Mappings (GLLiM) that can significantly accelerate simulation-based inference by using a surrogate model to approximate the likelihood and posterior distributions. The results on several benchmark problems are promising, demonstrating that GLLiM can achieve similar accuracy to traditional simulation-based methods but with much lower computational cost.

This work has the potential to have a significant impact on fields that rely heavily on simulation-based inference, such as physics, biology, and machine learning. By reducing the computational burden of these inference tasks, GLLiM could enable faster and more efficient exploration of complex scientific and engineering systems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🤖

Simultaneous identification of models and parameters of scientific simulators

Cornelius Schroder, Jakob H. Macke

Many scientific models are composed of multiple discrete components, and scientists often make heuristic decisions about which components to include. Bayesian inference provides a mathematical framework for systematically selecting model components, but defining prior distributions over model components and developing associated inference schemes has been challenging. We approach this problem in a simulation-based inference framework: We define model priors over candidate components and, from model simulations, train neural networks to infer joint probability distributions over both model components and associated parameters. Our method, simulation-based model inference (SBMI), represents distributions over model components as a conditional mixture of multivariate binary distributions in the Grassmann formalism. SBMI can be applied to any compositional stochastic simulator without requiring likelihood evaluations. We evaluate SBMI on a simple time series model and on two scientific models from neuroscience, and show that it can discover multiple data-consistent model configurations, and that it reveals non-identifiable model components and parameters. SBMI provides a powerful tool for data-driven scientific inquiry which will allow scientists to identify essential model components and make uncertainty-informed modelling decisions.

5/31/2024

cs.LG

Diffusion posterior sampling for simulation-based inference in tall data settings

Julia Linhart, Gabriel Victorino Cardoso, Alexandre Gramfort, Sylvain Le Corff, Pedro L. C. Rodrigues

Determining which parameters of a non-linear model best describe a set of experimental data is a fundamental problem in science and it has gained much traction lately with the rise of complex large-scale simulators. The likelihood of such models is typically intractable, which is why classical MCMC methods can not be used. Simulation-based inference (SBI) stands out in this context by only requiring a dataset of simulations to train deep generative models capable of approximating the posterior distribution that relates input parameters to a given observation. In this work, we consider a tall data extension in which multiple observations are available to better infer the parameters of the model. The proposed method is built upon recent developments from the flourishing score-based diffusion literature and allows to estimate the tall data posterior distribution, while simply using information from a score network trained for a single context observation. We compare our method to recently proposed competing approaches on various numerical experiments and demonstrate its superiority in terms of numerical stability and computational cost.

6/10/2024

stat.ML cs.LG

Preconditioned Neural Posterior Estimation for Likelihood-free Inference

Xiaoyu Wang, Ryan P. Kelly, David J. Warne, Christopher Drovandi

Simulation based inference (SBI) methods enable the estimation of posterior distributions when the likelihood function is intractable, but where model simulation is feasible. Popular neural approaches to SBI are the neural posterior estimator (NPE) and its sequential version (SNPE). These methods can outperform statistical SBI approaches such as approximate Bayesian computation (ABC), particularly for relatively small numbers of model simulations. However, we show in this paper that the NPE methods are not guaranteed to be highly accurate, even on problems with low dimension. In such settings the posterior cannot be accurately trained over the prior predictive space, and even the sequential extension remains sub-optimal. To overcome this, we propose preconditioned NPE (PNPE) and its sequential version (PSNPE), which uses a short run of ABC to effectively eliminate regions of parameter space that produce large discrepancy between simulations and data and allow the posterior emulator to be more accurately trained. We present comprehensive empirical evidence that this melding of neural and statistical SBI methods improves performance over a range of examples, including a motivating example involving a complex agent-based model applied to real tumour growth data.

4/23/2024

stat.ML cs.LG

All-in-one simulation-based inference

Manuel Gloeckler, Michael Deistler, Christian Weilbach, Frank Wood, Jakob H. Macke

Amortized Bayesian inference trains neural networks to solve stochastic inference problems using model simulations, thereby making it possible to rapidly perform Bayesian inference for any newly observed data. However, current simulation-based amortized inference methods are simulation-hungry and inflexible: They require the specification of a fixed parametric prior, simulator, and inference tasks ahead of time. Here, we present a new amortized inference method -- the Simformer -- which overcomes these limitations. By training a probabilistic diffusion model with transformer architectures, the Simformer outperforms current state-of-the-art amortized inference approaches on benchmark tasks and is substantially more flexible: It can be applied to models with function-valued parameters, it can handle inference scenarios with missing or unstructured data, and it can sample arbitrary conditionals of the joint distribution of parameters and data, including both posterior and likelihood. We showcase the performance and flexibility of the Simformer on simulators from ecology, epidemiology, and neuroscience, and demonstrate that it opens up new possibilities and application domains for amortized Bayesian inference on simulation-based models.

6/3/2024

cs.LG cs.AI stat.ML