Neural Likelihood Approximation for Integer Valued Time Series Data

2310.12544

Published 4/15/2024 by Luke O'Loughlin, John Maclean, Andrew Black

🧠

Abstract

Stochastic processes defined on integer valued state spaces are popular within the physical and biological sciences. These models are necessary for capturing the dynamics of small systems where the individual nature of the populations cannot be ignored and stochastic effects are important. The inference of the parameters of such models, from time series data, is challenging due to intractability of the likelihood. To work at all, current simulation based inference methods require the generation of realisations of the model conditional on the data, which can be both tricky to implement and computationally expensive. In this paper we instead construct a neural likelihood approximation that can be trained using unconditional simulation of the underlying model, which is much simpler. We demonstrate our method by performing inference on a number of ecological and epidemiological models, showing that we can accurately approximate the true posterior while achieving significant computational speed ups compared to current best methods.

Create account to get full access

Overview

This paper presents a method for performing inference on stochastic models with discrete state spaces, which are commonly used in the physical and biological sciences.
Existing simulation-based inference methods for these models are challenging to implement and computationally expensive, as they require generating realizations of the model conditional on the data.
The authors instead propose a neural likelihood approximation that can be trained using unconditional simulations of the underlying model, which is much simpler.
The method is demonstrated on several ecological and epidemiological models, showing accurate approximation of the true posterior and significant computational speedups compared to current best methods.

Plain English Explanation

Many processes in the physical and biological sciences can be modeled as stochastic systems with discrete states, such as the dynamics of small populations. Inferring the parameters of these models from observed data is challenging because the mathematical formulas (the "likelihood") are often impossible to calculate exactly.

Current methods try to get around this by simulating the model and using the simulations to approximate the likelihood. However, this can be tricky to set up and computationally expensive.

In this paper, the researchers propose a different approach: they train a neural network to approximate the likelihood directly, using only unconditional simulations of the model, which are much easier to generate. This allows them to do accurate parameter inference much more efficiently than previous methods.

They demonstrate their technique on several real-world examples from ecology and epidemiology, showing that it can accurately capture the true uncertainty in the model parameters while being significantly faster to run.

Technical Explanation

The key idea of the paper is to construct a neural network-based approximation of the likelihood function for stochastic models with discrete state spaces, which can then be used for efficient Bayesian inference.

The authors first observe that while directly calculating the true likelihood is often intractable for these models, it is relatively straightforward to generate unconditional simulations of the model. They then train a neural network, using these unconditional simulations as training data, to learn an approximation of the true likelihood.

The neural likelihood approximation is then used within a Bayesian inference framework to perform parameter estimation. This avoids the need to generate costly conditional simulations, as required by previous simulation-based inference methods.

The authors demonstrate their approach on several ecological and epidemiological case studies, including spatial models, showing that it can accurately capture the true posterior distribution of the model parameters while being significantly faster to run than existing techniques.

Critical Analysis

The authors acknowledge several limitations of their approach. Firstly, the neural network approximation may not be accurate in all cases, particularly for models with complex, high-dimensional state spaces. The quality of the approximation will depend on the expressiveness of the neural network and the amount of training data available.

Additionally, the authors note that their method, like other simulation-based inference techniques, can be sensitive to model misspecification. If the true underlying model differs from the assumed model, the inferred parameters may be biased.

Further research could explore ways to make the neural likelihood approximation more robust, such as by incorporating uncertainty quantification or adapting the network architecture to the problem at hand. Comparisons to other recent advancements in simulation-based inference, such as amortized variational inference, could also provide valuable insights.

Overall, the authors present an interesting and promising approach for efficient Bayesian inference on discrete-state stochastic models, which have important applications across the physical and life sciences.

Conclusion

This paper introduces a neural network-based method for approximating the likelihood function of stochastic models with discrete state spaces, which can significantly improve the efficiency of Bayesian parameter inference compared to existing simulation-based techniques.

By leveraging unconditional model simulations to train the neural approximation, the authors avoid the need for computationally expensive conditional simulations required by prior approaches. This allows for accurate posterior inference with substantial speedups, as demonstrated on several real-world ecological and epidemiological case studies.

While the method has some limitations, it represents an important advancement in the field of simulation-based inference and could have wide-ranging impacts across the physical and biological sciences, where discrete-state stochastic models are ubiquitous.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Fast, accurate and lightweight sequential simulation-based inference using Gaussian locally linear mappings

Henrik Haggstrom, Pedro L. C. Rodrigues, Geoffroy Oudoumanessah, Florence Forbes, Umberto Picchini

Bayesian inference for complex models with an intractable likelihood can be tackled using algorithms performing many calls to computer simulators. These approaches are collectively known as simulation-based inference (SBI). Recent SBI methods have made use of neural networks (NN) to provide approximate, yet expressive constructs for the unavailable likelihood function and the posterior distribution. However, the trade-off between accuracy and computational demand leaves much space for improvement. In this work, we propose an alternative that provides both approximations to the likelihood and the posterior distribution, using structured mixtures of probability distributions. Our approach produces accurate posterior inference when compared to state-of-the-art NN-based SBI methods, even for multimodal posteriors, while exhibiting a much smaller computational footprint. We illustrate our results on several benchmark models from the SBI literature and on a biological model of the translation kinetics after mRNA transfection.

6/26/2024

stat.ML cs.LG

Sample-efficient neural likelihood-free Bayesian inference of implicit HMMs

Sanmitra Ghosh, Paul J. Birrell, Daniela De Angelis

Likelihood-free inference methods based on neural conditional density estimation were shown to drastically reduce the simulation burden in comparison to classical methods such as ABC. When applied in the context of any latent variable model, such as a Hidden Markov model (HMM), these methods are designed to only estimate the parameters, rather than the joint distribution of the parameters and the hidden states. Naive application of these methods to a HMM, ignoring the inference of this joint posterior distribution, will thus produce an inaccurate estimate of the posterior predictive distribution, in turn hampering the assessment of goodness-of-fit. To rectify this problem, we propose a novel, sample-efficient likelihood-free method for estimating the high-dimensional hidden states of an implicit HMM. Our approach relies on learning directly the intractable posterior distribution of the hidden states, using an autoregressive-flow, by exploiting the Markov property. Upon evaluating our approach on some implicit HMMs, we found that the quality of the estimates retrieved using our method is comparable to what can be achieved using a much more computationally expensive SMC algorithm.

5/6/2024

stat.ML cs.LG

New!Time Series Clustering with General State Space Models via Stochastic Variational Inference

Ryoichi Ishizuka, Takashi Imai, Kaoru Kawamoto

In this paper, we propose a novel method of model-based time series clustering with mixtures of general state space models (MSSMs). Each component of MSSMs is associated with each cluster. An advantage of the proposed method is that it enables the use of time series models appropriate to the specific time series. This not only improves clustering and prediction accuracy but also enhances the interpretability of the estimated parameters. The parameters of the MSSMs are estimated using stochastic variational inference, a subtype of variational inference. The proposed method estimates the latent variables of an arbitrary state space model by using neural networks with a normalizing flow as a variational estimator. The number of clusters can be estimated using the Bayesian information criterion. In addition, to prevent MSSMs from converging to the local optimum, we propose several optimization tricks, including an additional penalty term called entropy annealing. Experiments on simulated datasets show that the proposed method is effective for clustering, parameter estimation, and estimating the number of clusters.

7/2/2024

cs.LG cs.AI

Latent Variable Sequence Identification for Cognitive Models with Neural Bayes Estimation

Ti-Fen Pan, Jing-Jing Li, Bill Thompson, Anne Collins

Extracting time-varying latent variables from computational cognitive models is a key step in model-based neural analysis, which aims to understand the neural correlates of cognitive processes. However, existing methods only allow researchers to infer latent variables that explain subjects' behavior in a relatively small class of cognitive models. For example, a broad class of relevant cognitive models with analytically intractable likelihood is currently out of reach from standard techniques, based on Maximum a Posteriori parameter estimation. Here, we present an approach that extends neural Bayes estimation to learn a direct mapping between experimental data and the targeted latent variable space using recurrent neural networks and simulated datasets. We show that our approach achieves competitive performance in inferring latent variable sequences in both tractable and intractable models. Furthermore, the approach is generalizable across different computational models and is adaptable for both continuous and discrete latent spaces. We then demonstrate its applicability in real world datasets. Our work underscores that combining recurrent neural networks and simulation-based inference to identify latent variable sequences can enable researchers to access a wider class of cognitive models for model-based neural analyses, and thus test a broader set of theories.

6/24/2024

cs.LG stat.ML