Reinforcement learning-based estimation for partial differential equations

2302.01189

Published 4/5/2024 by Saviz Mowlavi, Mouhacine Benosman

🏅

Abstract

In systems governed by nonlinear partial differential equations such as fluid flows, the design of state estimators such as Kalman filters relies on a reduced-order model (ROM) that projects the original high-dimensional dynamics onto a computationally tractable low-dimensional space. However, ROMs are prone to large errors, which negatively affects the performance of the estimator. Here, we introduce the reinforcement learning reduced-order estimator (RL-ROE), a ROM-based estimator in which the correction term that takes in the measurements is given by a nonlinear policy trained through reinforcement learning. The nonlinearity of the policy enables the RL-ROE to compensate efficiently for errors of the ROM, while still taking advantage of the imperfect knowledge of the dynamics. Using examples involving the Burgers and Navier-Stokes equations, we show that in the limit of very few sensors, the trained RL-ROE outperforms a Kalman filter designed using the same ROM. Moreover, it yields accurate high-dimensional state estimates for trajectories corresponding to various physical parameter values, without direct knowledge of the latter.

Create account to get full access

Overview

Fluid dynamics and other nonlinear systems are modeled using partial differential equations, which are computationally complex
State estimators like Kalman filters rely on reduced-order models (ROMs) to make these systems tractable, but ROMs can have large errors
This paper introduces the reinforcement learning reduced-order estimator (RL-ROE), which uses a nonlinear policy trained via reinforcement learning to compensate for ROM errors

Plain English Explanation

Fluid flows and other complex systems are often described using sophisticated mathematical models called partial differential equations. However, these models can be very computationally intensive to work with. To make things more manageable, researchers often use reduced-order models (ROMs) - simplified versions of the original models that capture the most important dynamics.

One common application of ROMs is in state estimators like Kalman filters, which aim to infer the full state of a system from limited sensor measurements. But the simplifications made in ROMs can introduce significant errors, degrading the performance of these estimators.

To address this, the researchers developed the reinforcement learning reduced-order estimator (RL-ROE). This approach uses a nonlinear "policy" - a kind of mathematical function - that is trained using reinforcement learning techniques. The policy allows the RL-ROE to effectively compensate for the errors in the underlying ROM, while still taking advantage of the information it provides.

The key idea is that the nonlinearity of the trained policy enables the RL-ROE to make more accurate state estimates, especially when there are very few sensors available. This could be valuable in applications like weather forecasting, aerodynamics, or other areas where detailed measurements are difficult or expensive to obtain.

Technical Explanation

The paper introduces the reinforcement learning reduced-order estimator (RL-ROE), a state estimation framework for systems governed by nonlinear partial differential equations. These types of systems, which include fluid flows, are computationally complex to model directly.

Traditionally, state estimators like Kalman filters have relied on reduced-order models (ROMs) to make the estimation problem tractable. ROMs project the original high-dimensional dynamics onto a low-dimensional space. However, ROMs are prone to significant errors, which can negatively impact the performance of the estimator.

The RL-ROE addresses this issue by incorporating a nonlinear "policy" function, trained using reinforcement learning techniques. This policy takes in the sensor measurements and provides a correction term to the ROM-based estimate, allowing the RL-ROE to compensate for the ROM's errors.

The researchers demonstrate the RL-ROE's effectiveness using examples involving the Burgers and Navier-Stokes equations. They show that in the limit of very few sensors, the trained RL-ROE outperforms a Kalman filter designed using the same ROM. Additionally, the RL-ROE can provide accurate high-dimensional state estimates for a variety of physical parameter values, without requiring direct knowledge of those parameters.

Critical Analysis

The RL-ROE approach represents a promising advancement in state estimation for complex, nonlinear systems. By leveraging reinforcement learning to train a nonlinear policy, the method can effectively compensate for the errors inherent in reduced-order models (ROMs), which are crucial for making these systems computationally tractable.

However, the paper does not address several potential limitations and areas for further research. For example, it is unclear how the RL-ROE would scale to very high-dimensional systems, or how sensitive the method is to the quality of the underlying ROM. Additionally, the training process for the reinforcement learning policy may be computationally intensive, which could limit its real-world applicability.

Decentralized learning strategies could be an interesting area to explore, as they may help address scalability and computational challenges. It would also be valuable to see the RL-ROE tested on a wider range of nonlinear systems, beyond the Burgers and Navier-Stokes examples provided.

Overall, the RL-ROE represents an important step forward in state estimation for complex systems, but further research is needed to fully understand its capabilities, limitations, and potential applications.

Conclusion

This paper introduces the reinforcement learning reduced-order estimator (RL-ROE), a novel state estimation framework for systems governed by nonlinear partial differential equations. By incorporating a nonlinear policy trained through reinforcement learning, the RL-ROE can effectively compensate for the errors inherent in the reduced-order models (ROMs) typically used in state estimators like Kalman filters.

The key innovation is the ability of the RL-ROE to leverage the imperfect knowledge of the system dynamics provided by the ROM, while using the trained policy to correct for the ROM's shortcomings. This allows the RL-ROE to outperform a Kalman filter based on the same ROM, particularly in the challenging scenario of very few sensors.

The potential impact of this research is significant, as it could lead to more accurate state estimates in a wide range of applications, from fluid dynamics and aerodynamics to weather forecasting and beyond. By making it possible to infer high-dimensional system states from limited measurements, the RL-ROE could enable new breakthroughs in modeling, control, and decision-making for complex, nonlinear systems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Koopman-based Deep Learning for Nonlinear System Estimation

Zexin Sun, Mingyu Chen, John Baillieul

Nonlinear differential equations are encountered as models of fluid flow, spiking neurons, and many other systems of interest in the real world. Common features of these systems are that their behaviors are difficult to describe exactly and invariably unmodeled dynamics present challenges in making precise predictions. In many cases the models exhibit extremely complicated behavior due to bifurcations and chaotic regimes. In this paper, we present a novel data-driven linear estimator that uses Koopman operator theory to extract finite-dimensional representations of complex nonlinear systems. The extracted model is used together with a deep reinforcement learning network that learns the optimal stepwise actions to predict future states of the original nonlinear system. Our estimator is also adaptive to a diffeomorphic transformation of the nonlinear system which enables transfer learning to compute state estimates of the transformed system without relearning from scratch.

5/2/2024

eess.SY cs.LG cs.SY

📊

Physics-constrained robust learning of open-form partial differential equations from limited and noisy data

Mengge Du, Yuntian Chen, Longfeng Nie, Siyu Lou, Dongxiao Zhang

Unveiling the underlying governing equations of nonlinear dynamic systems remains a significant challenge. Insufficient prior knowledge hinders the determination of an accurate candidate library, while noisy observations lead to imprecise evaluations, which in turn result in redundant function terms or erroneous equations. This study proposes a framework to robustly uncover open-form partial differential equations (PDEs) from limited and noisy data. The framework operates through two alternating update processes: discovering and embedding. The discovering phase employs symbolic representation and a novel reinforcement learning (RL)-guided hybrid PDE generator to efficiently produce diverse open-form PDEs with tree structures. A neural network-based predictive model fits the system response and serves as the reward evaluator for the generated PDEs. PDEs with higher rewards are utilized to iteratively optimize the generator via the RL strategy and the best-performing PDE is selected by a parameter-free stability metric. The embedding phase integrates the initially identified PDE from the discovering process as a physical constraint into the predictive model for robust training. The traversal of PDE trees automates the construction of the computational graph and the embedding process without human intervention. Numerical experiments demonstrate our framework's capability to uncover governing equations from nonlinear dynamic systems with limited and highly noisy data and outperform other physics-informed neural network-based discovery methods. This work opens new potential for exploring real-world systems with limited understanding.

4/30/2024

cs.LG cs.NA

🤿

PTPI-DL-ROMs: pre-trained physics-informed deep learning-based reduced order models for nonlinear parametrized PDEs

Simone Brivio, Stefania Fresca, Andrea Manzoni

The coupling of Proper Orthogonal Decomposition (POD) and deep learning-based ROMs (DL-ROMs) has proved to be a successful strategy to construct non-intrusive, highly accurate, surrogates for the real time solution of parametric nonlinear time-dependent PDEs. Inexpensive to evaluate, POD-DL-ROMs are also relatively fast to train, thanks to their limited complexity. However, POD-DL-ROMs account for the physical laws governing the problem at hand only through the training data, that are usually obtained through a full order model (FOM) relying on a high-fidelity discretization of the underlying equations. Moreover, the accuracy of POD-DL-ROMs strongly depends on the amount of available data. In this paper, we consider a major extension of POD-DL-ROMs by enforcing the fulfillment of the governing physical laws in the training process -- that is, by making them physics-informed -- to compensate for possible scarce and/or unavailable data and improve the overall reliability. To do that, we first complement POD-DL-ROMs with a trunk net architecture, endowing them with the ability to compute the problem's solution at every point in the spatial domain, and ultimately enabling a seamless computation of the physics-based loss by means of the strong continuous formulation. Then, we introduce an efficient training strategy that limits the notorious computational burden entailed by a physics-informed training phase. In particular, we take advantage of the few available data to develop a low-cost pre-training procedure; then, we fine-tune the architecture in order to further improve the prediction reliability. Accuracy and efficiency of the resulting pre-trained physics-informed DL-ROMs (PTPI-DL-ROMs) are then assessed on a set of test cases ranging from non-affinely parametrized advection-diffusion-reaction equations, to nonlinear problems like the Navier-Stokes equations for fluid flows.

5/15/2024

cs.LG cs.NA

Recurrent Deep Kernel Learning of Dynamical Systems

Nicol`o Botteghi, Paolo Motta, Andrea Manzoni, Paolo Zunino, Mengwu Guo

Digital twins require computationally-efficient reduced-order models (ROMs) that can accurately describe complex dynamics of physical assets. However, constructing ROMs from noisy high-dimensional data is challenging. In this work, we propose a data-driven, non-intrusive method that utilizes stochastic variational deep kernel learning (SVDKL) to discover low-dimensional latent spaces from data and a recurrent version of SVDKL for representing and predicting the evolution of latent dynamics. The proposed method is demonstrated with two challenging examples -- a double pendulum and a reaction-diffusion system. Results show that our framework is capable of (i) denoising and reconstructing measurements, (ii) learning compact representations of system states, (iii) predicting system evolution in low-dimensional latent spaces, and (iv) quantifying modeling uncertainties.

5/31/2024

cs.LG stat.ML