A convergent scheme for the Bayesian filtering problem based on the Fokker--Planck equation and deep splitting

Read original: arXiv:2409.14585 - Published 9/24/2024 by Kasper B{aa}gmark, Adam Andersson, Stig Larsson, Filip Rydin

🤿

Overview

A numerical scheme is introduced to approximate the nonlinear filtering density.
The convergence rate of this scheme is established theoretically and empirically.
The scheme uses a deep splitting approach to approximate the Fokker-Planck equation for the prediction step, and Bayes' formula for the update step.
The algorithm employs a sampling-based Feynman-Kac approach to mitigate the curse of dimensionality.
The convergence proof relies on the Malliavin integration-by-parts formula.

Plain English Explanation

The provided paper introduces a new numerical scheme for approximating the nonlinear filtering density. This is a critical step in Bayesian filtering, which is used to estimate the state of a dynamic system from noisy and partial observations.

The key idea is to use a two-step process: first, the Fokker-Planck equation is approximated using a deep splitting scheme to handle the prediction step between observations. Then, Bayes' formula is used to perform an exact update step when a new observation becomes available.

This approach has several advantages. It can operate online, meaning it can process new observation sequences without the need for retraining. Additionally, it employs a sampling-based Feynman-Kac method to mitigate the curse of dimensionality - a common challenge in high-dimensional filtering problems.

The authors provide a rigorous theoretical analysis, proving the convergence of their scheme under certain assumptions. They also demonstrate its empirical performance on two example problems.

Technical Explanation

The paper introduces a numerical scheme for approximating the nonlinear filtering density and establishes its convergence rate, both theoretically and empirically.

For the prediction step, which occurs between the noisy and partial measurements at discrete times, the scheme approximates the Fokker-Planck equation using a deep splitting approach. This handles the evolution of the system's state distribution in the absence of new observations.

The update step then performs an exact calculation using Bayes' formula when a new observation becomes available. This results in a classical prediction-update filtering algorithm that can operate online for new observation sequences post-training.

To mitigate the curse of dimensionality, the algorithm employs a sampling-based Feynman-Kac approach.

The authors' convergence proof relies on the Malliavin integration-by-parts formula, a powerful tool from stochastic analysis. As a corollary, they also obtain the convergence rate for the approximation of the Fokker-Planck equation alone, independent of the filtering problem.

Critical Analysis

The paper presents a rigorous theoretical analysis of the proposed numerical scheme, which is a strength of the work. The authors establish convergence under a parabolic Hörmander condition, which is a technical assumption that may limit the generality of the results.

Additionally, the authors only provide empirical evaluation on two example problems. It would be helpful to see the scheme applied to a broader range of filtering problems to better understand its performance characteristics and limitations.

The paper also does not discuss potential challenges or caveats related to the use of the Feynman-Kac sampling approach, which is critical to the scheme's ability to handle high-dimensional problems. Further exploration of the practical aspects of implementing this algorithm would be valuable.

Conclusion

The provided paper introduces a novel numerical scheme for approximating the nonlinear filtering density, a key component of Bayesian filtering. The scheme combines a deep splitting approach for the prediction step and Bayes' formula for the update step, resulting in an online algorithm that can mitigate the curse of dimensionality using a Feynman-Kac sampling method.

The authors provide a strong theoretical foundation for their approach, proving convergence under certain assumptions. However, the practical implications and limitations of the scheme would benefit from further exploration and empirical evaluation on a wider range of filtering problems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🤿

A convergent scheme for the Bayesian filtering problem based on the Fokker--Planck equation and deep splitting

Kasper B{aa}gmark, Adam Andersson, Stig Larsson, Filip Rydin

A numerical scheme for approximating the nonlinear filtering density is introduced and its convergence rate is established, theoretically under a parabolic H{o}rmander condition, and empirically for two examples. For the prediction step, between the noisy and partial measurements at discrete times, the scheme approximates the Fokker--Planck equation with a deep splitting scheme, and performs an exact update through Bayes' formula. This results in a classical prediction-update filtering algorithm that operates online for new observation sequences post-training. The algorithm employs a sampling-based Feynman--Kac approach, designed to mitigate the curse of dimensionality. Our convergence proof relies on the Malliavin integration-by-parts formula. As a corollary we obtain the convergence rate for the approximation of the Fokker--Planck equation alone, disconnected from the filtering problem.

9/24/2024

Deep Bayesian Filter for Bayes-faithful Data Assimilation

Yuta Tarumi, Keisuke Fukuda, Shin-ichi Maeda

State estimation for nonlinear state space models (SSMs) is a challenging task. Existing assimilation methodologies predominantly assume Gaussian posteriors on physical space, where true posteriors become inevitably non-Gaussian. We propose Deep Bayesian Filtering (DBF) for data assimilation on nonlinear SSMs. DBF constructs new latent variables $h_t$ in addition to the original physical variables $z_t$ and assimilates observations $o_t$. By (i) constraining the state transition on the new latent space to be linear and (ii) learning a Gaussian inverse observation operator $r(h_t|o_t)$, posteriors remain Gaussian. Notably, the structured design of test distributions enables an analytical formula for the recursive computation, eliminating the accumulation of Monte Carlo sampling errors across time steps. DBF trains the Gaussian inverse observation operators $r(h_t|o_t)$ and other latent SSM parameters (e.g., dynamics matrix) by maximizing the evidence lower bound. Experiments demonstrate that DBF outperforms model-based approaches and latent assimilation methods in tasks where the true posterior distribution on physical space is significantly non-Gaussian.

10/3/2024

🛠️

Convergence analysis of kernel learning FBSDE filter

Yunzheng Lyu, Feng Bao

Kernel learning forward backward SDE filter is an iterative and adaptive meshfree approach to solve the nonlinear filtering problem. It builds from forward backward SDE for Fokker-Planker equation, which defines evolving density for the state variable, and employs KDE to approximate density. This algorithm has shown more superior performance than mainstream particle filter method, in both convergence speed and efficiency of solving high dimension problems. However, this method has only been shown to converge empirically. In this paper, we present a rigorous analysis to demonstrate its local and global convergence, and provide theoretical support for its empirical results.

7/2/2024

Learning Optimal Filters Using Variational Inference

Enoch Luk, Eviatar Bach, Ricardo Baptista, Andrew Stuart

Filtering - the task of estimating the conditional distribution of states of a dynamical system given partial, noisy, observations - is important in many areas of science and engineering, including weather and climate prediction. However, the filtering distribution is generally intractable to obtain for high-dimensional, nonlinear systems. Filters used in practice, such as the ensemble Kalman filter (EnKF), are biased for nonlinear systems and have numerous tuning parameters. Here, we present a framework for learning a parameterized analysis map - the map that takes a forecast distribution and observations to the filtering distribution - using variational inference. We show that this methodology can be used to learn gain matrices for filtering linear and nonlinear dynamical systems, as well as inflation and localization parameters for an EnKF. Future work will apply this framework to learn new filtering algorithms.

8/14/2024