The Stochastic Occupation Kernel Method for System Identification

2406.15661

Published 6/26/2024 by Michael Wells, Kamel Lahouel, Bruno Jedynak

The Stochastic Occupation Kernel Method for System Identification

Abstract

The method of occupation kernels has been used to learn ordinary differential equations from data in a non-parametric way. We propose a two-step method for learning the drift and diffusion of a stochastic differential equation given snapshots of the process. In the first step, we learn the drift by applying the occupation kernel algorithm to the expected value of the process. In the second step, we learn the diffusion given the drift using a semi-definite program. Specifically, we learn the diffusion squared as a non-negative function in a RKHS associated with the square of a kernel. We present examples and simulations.

Create account to get full access

Overview

This paper introduces the Stochastic Occupation Kernel Method (SOKM) for system identification, which can be used to learn the dynamics of unknown stochastic dynamical systems.
SOKM leverages the Nyström kernel Stein discrepancy to learn the infinitesimal generator of the system's underlying stochastic process, without requiring a parametric model.
The method can handle partially observed systems and is demonstrated on both simulated and real-world examples, showing its effectiveness in capturing complex nonlinear dynamics.

Plain English Explanation

The Stochastic Occupation Kernel Method (SOKM) is a way to understand the behavior of complicated, uncertain systems without needing to know the exact mathematical model in advance. This is useful for applications like robotics, finance, and biology, where the underlying dynamics are often complex and not easily described by simple equations.

SOKM works by using a special mathematical technique called the Nyström kernel Stein discrepancy to learn the "infinitesimal generator" of the system's stochastic (random) process. This generator encapsulates how the system evolves over time, without requiring the researchers to specify a particular model structure.

The method can handle situations where only partial information about the system is available, which is common in real-world applications. SOKM has been tested on both simulated examples and real-world data, demonstrating its ability to capture complex nonlinear dynamics that would be difficult to model using traditional approaches.

Technical Explanation

The core idea behind the Stochastic Occupation Kernel Method (SOKM) is to learn the infinitesimal generator of the underlying stochastic process governing the dynamics of an unknown system, without assuming a parametric model. This is achieved by leveraging the Nyström kernel Stein discrepancy, which provides a way to estimate the generator from data.

The method first constructs a kernel-based approximation of the system's occupation measure, which describes the distribution of the system's state over time. It then uses the Nyström method to estimate the infinitesimal generator from this occupation measure, without requiring a parametric model of the dynamics.

The authors demonstrate the effectiveness of SOKM on both simulated examples and real-world applications, such as modeling an unknown stochastic dynamical system subject to partial observations. They show that SOKM can capture complex nonlinear dynamics and outperform existing techniques, especially in cases where the system's behavior is difficult to describe using a pre-specified parametric model.

Critical Analysis

The Stochastic Occupation Kernel Method presents a promising approach for learning the dynamics of unknown stochastic systems without relying on restrictive parametric assumptions. By leveraging the Nyström kernel Stein discrepancy, SOKM can effectively estimate the infinitesimal generator of the underlying stochastic process, which is a powerful way to characterize the system's behavior.

One potential limitation of the method is that it may be sensitive to the choice of kernel function and hyperparameters, which could impact the quality of the learned generator. The authors acknowledge this and suggest further research into kernel selection and optimization techniques to address this issue.

Additionally, while SOKM can handle partially observed systems, the performance may degrade as the amount of available data decreases. It would be valuable to explore the method's robustness to different levels of data sparsity and noise, as real-world applications often involve incomplete or noisy observations.

Finally, the authors mention the computational complexity of SOKM, which scales with the size of the dataset. Developing more efficient implementation strategies or leveraging advances in recurrent deep kernel learning for dynamical systems could help extend the method's applicability to larger-scale problems.

Conclusion

The Stochastic Occupation Kernel Method introduces a novel approach for learning the dynamics of unknown stochastic systems without relying on pre-specified parametric models. By leveraging the Nyström kernel Stein discrepancy, SOKM can effectively estimate the infinitesimal generator of the underlying stochastic process, enabling the characterization of complex nonlinear dynamics.

The method's ability to handle partially observed systems and its demonstrated performance on both simulated and real-world examples suggest its potential for a wide range of applications, from robotics and finance to biology and beyond. As with any new technique, further research is needed to address potential limitations, such as the sensitivity to kernel choice and the computational complexity. Overall, SOKM represents a significant advancement in the field of system identification and opens up new possibilities for understanding the dynamics of complex, uncertain systems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

👀

Non-Parametric Learning of Stochastic Differential Equations with Non-asymptotic Fast Rates of Convergence

Riccardo Bonalli, Alessandro Rudi

We propose a novel non-parametric learning paradigm for the identification of drift and diffusion coefficients of multi-dimensional non-linear stochastic differential equations, which relies upon discrete-time observations of the state. The key idea essentially consists of fitting a RKHS-based approximation of the corresponding Fokker-Planck equation to such observations, yielding theoretical estimates of non-asymptotic learning rates which, unlike previous works, become increasingly tighter when the regularity of the unknown drift and diffusion coefficients becomes higher. Our method being kernel-based, offline pre-processing may be profitably leveraged to enable efficient numerical implementation, offering excellent balance between precision and computational complexity.

4/24/2024

cs.LG cs.SY eess.SY

🔄

Learning the Infinitesimal Generator of Stochastic Diffusion Processes

Vladimir R. Kostic, Karim Lounici, Helene Halconruy, Timothee Devergne, Massimiliano Pontil

We address data-driven learning of the infinitesimal generator of stochastic diffusion processes, essential for understanding numerical simulations of natural and physical systems. The unbounded nature of the generator poses significant challenges, rendering conventional analysis techniques for Hilbert-Schmidt operators ineffective. To overcome this, we introduce a novel framework based on the energy functional for these stochastic processes. Our approach integrates physical priors through an energy-based risk metric in both full and partial knowledge settings. We evaluate the statistical performance of a reduced-rank estimator in reproducing kernel Hilbert spaces (RKHS) in the partial knowledge setting. Notably, our approach provides learning bounds independent of the state space dimension and ensures non-spurious spectral estimation. Additionally, we elucidate how the distortion between the intrinsic energy-induced metric of the stochastic diffusion and the RKHS metric used for generator estimation impacts the spectral learning bounds.

5/22/2024

stat.ML cs.LG

👨‍🏫

System Identification for Continuous-time Linear Dynamical Systems

Peter Halmos, Jonathan Pillow, David A. Knowles

The problem of system identification for the Kalman filter, relying on the expectation-maximization (EM) procedure to learn the underlying parameters of a dynamical system, has largely been studied assuming that observations are sampled at equally-spaced time points. However, in many applications this is a restrictive and unrealistic assumption. This paper addresses system identification for the continuous-discrete filter, with the aim of generalizing learning for the Kalman filter by relying on a solution to a continuous-time It^o stochastic differential equation (SDE) for the latent state and covariance dynamics. We introduce a novel two-filter, analytical form for the posterior with a Bayesian derivation, which yields analytical updates which do not require the forward-pass to be pre-computed. Using this analytical and efficient computation of the posterior, we provide an EM procedure which estimates the parameters of the SDE, naturally incorporating irregularly sampled measurements. Generalizing the learning of latent linear dynamical systems (LDS) to continuous-time may extend the use of the hybrid Kalman filter to data which is not regularly sampled or has intermittent missing values, and can extend the power of non-linear system identification methods such as switching LDS (SLDS), which rely on EM for the linear discrete-time Kalman filter as a sub-unit for learning locally linearized behavior of a non-linear system. We apply the method by learning the parameters of a latent, multivariate Fokker-Planck SDE representing a toggle-switch genetic circuit using biologically realistic parameters, and compare the efficacy of learning relative to the discrete-time Kalman filter as the step-size irregularity and spectral-radius of the dynamics-matrix increases.

6/28/2024

cs.LG cs.SY eess.SY

Modeling Unknown Stochastic Dynamical System Subject to External Excitation

Yuan Chen, Dongbin Xiu

We present a numerical method for learning unknown nonautonomous stochastic dynamical system, i.e., stochastic system subject to time dependent excitation or control signals. Our basic assumption is that the governing equations for the stochastic system are unavailable. However, short bursts of input/output (I/O) data consisting of certain known excitation signals and their corresponding system responses are available. When a sufficient amount of such I/O data are available, our method is capable of learning the unknown dynamics and producing an accurate predictive model for the stochastic responses of the system subject to arbitrary excitation signals not in the training data. Our method has two key components: (1) a local approximation of the training I/O data to transfer the learning into a parameterized form; and (2) a generative model to approximate the underlying unknown stochastic flow map in distribution. After presenting the method in detail, we present a comprehensive set of numerical examples to demonstrate the performance of the proposed method, especially for long-term system predictions.

6/26/2024

cs.LG cs.SY eess.SY