Lyapunov-stable Neural Control for State and Output Feedback: A Novel Formulation for Efficient Synthesis and Verification

2404.07956

Published 6/6/2024 by Lujie Yang, Hongkai Dai, Zhouxing Shi, Cho-Jui Hsieh, Russ Tedrake, Huan Zhang

Lyapunov-stable Neural Control for State and Output Feedback: A Novel Formulation for Efficient Synthesis and Verification

Abstract

Learning-based neural network (NN) control policies have shown impressive empirical performance in a wide range of tasks in robotics and control. However, formal (Lyapunov) stability guarantees over the region-of-attraction (ROA) for NN controllers with nonlinear dynamical systems are challenging to obtain, and most existing approaches rely on expensive solvers such as sums-of-squares (SOS), mixed-integer programming (MIP), or satisfiability modulo theories (SMT). In this paper, we demonstrate a new framework for learning NN controllers together with Lyapunov certificates using fast empirical falsification and strategic regularizations. We propose a novel formulation that defines a larger verifiable region-of-attraction (ROA) than shown in the literature, and refines the conventional restrictive constraints on Lyapunov derivatives to focus only on certifiable ROAs. The Lyapunov condition is rigorously verified post-hoc using branch-and-bound with scalable linear bound propagation-based NN verification techniques. The approach is efficient and flexible, and the full training and verification procedure is accelerated on GPUs without relying on expensive solvers for SOS, MIP, nor SMT. The flexibility and efficiency of our framework allow us to demonstrate Lyapunov-stable output feedback control with synthesized NN-based controllers and NN-based observers with formal stability guarantees, for the first time in literature. Source code at https://github.com/Verified-Intelligence/Lyapunov_Stable_NN_Controllers

Create account to get full access

Overview

Presents a novel approach for designing and verifying Lyapunov-stable neural network controllers for state and output feedback control problems
Aims to address the limitations of existing methods by introducing a more efficient synthesis and verification process
Demonstrates the effectiveness of the proposed approach through simulations and real-world experiments

Plain English Explanation

This research paper introduces a new way to design and verify neural network controllers that are Lyapunov-stable. Lyapunov stability is a crucial property that ensures the system's behavior remains within desired bounds, even in the presence of disturbances or uncertainties.

The researchers recognized that existing methods for designing Lyapunov-stable neural network controllers can be complex and time-consuming. To address this, they developed a novel formulation that streamlines the synthesis and verification process, making it more efficient.

The key idea is to directly incorporate Lyapunov stability constraints into the neural network training process, rather than having to verify stability as a separate step. This allows the researchers to efficiently obtain neural network controllers that are guaranteed to be Lyapunov-stable by design.

The researchers demonstrate the effectiveness of their approach through simulations and real-world experiments, showing that the resulting controllers can effectively control the system while maintaining Lyapunov stability. This is an important advancement, as Lyapunov-stable controllers are crucial for safety-critical applications where system stability must be ensured.

Technical Explanation

The researchers present a novel formulation for designing and verifying Lyapunov-stable neural network controllers for both state and output feedback control problems. This work builds upon previous research on using control Lyapunov functions to synthesize neural network controllers and using Lyapunov certificates to learn robust policies.

The key innovation of this work is the introduction of a direct Lyapunov stability constraint into the neural network training process. This allows the researchers to obtain neural network controllers that are guaranteed to be Lyapunov-stable by design, without the need for a separate verification step.

The researchers demonstrate the effectiveness of their approach through simulations and real-world experiments on several control problems, including a hybrid system example and a robust control problem with Hamilton-Jacobi-Isaacs analysis. The results show that the proposed method can efficiently synthesize neural network controllers that maintain Lyapunov stability while achieving desired control objectives.

Critical Analysis

The researchers acknowledge that their approach relies on the existence of a suitable Lyapunov function, which may not always be easy to find, especially for complex systems. They suggest that their framework could potentially be extended to use data-driven Lyapunov function learning, as explored in the stability-based abstraction framework for reach-avoid control.

Additionally, the researchers note that their approach may be computationally more expensive than some existing methods, due to the additional Lyapunov stability constraints. This could be a potential limitation for real-time applications with strict computational requirements.

Overall, the researchers have presented a novel and promising approach for designing Lyapunov-stable neural network controllers. While there are some potential limitations, the ability to directly incorporate stability constraints into the neural network training process is a significant advancement in the field of neural control.

Conclusion

This research paper introduces a new method for designing and verifying Lyapunov-stable neural network controllers for state and output feedback control problems. By directly incorporating Lyapunov stability constraints into the neural network training process, the researchers have developed a more efficient synthesis and verification approach compared to existing methods.

The proposed framework has been demonstrated to be effective through simulations and real-world experiments, showcasing its ability to generate neural network controllers that maintain Lyapunov stability while achieving desired control objectives. This is an important advancement, as Lyapunov-stable controllers are crucial for safety-critical applications where system stability must be ensured.

While the approach has some potential limitations, such as the need for a suitable Lyapunov function and the potential for increased computational complexity, the researchers have made a significant contribution to the field of neural control. Their work paves the way for further research and development of efficient, Lyapunov-stable neural network controllers for a wide range of applications.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Distributionally Robust Policy and Lyapunov-Certificate Learning

Kehan Long, Jorge Cortes, Nikolay Atanasov

This article presents novel methods for synthesizing distributionally robust stabilizing neural controllers and certificates for control systems under model uncertainty. A key challenge in designing controllers with stability guarantees for uncertain systems is the accurate determination of and adaptation to shifts in model parametric uncertainty during online deployment. We tackle this with a novel distributionally robust formulation of the Lyapunov derivative chance constraint ensuring a monotonic decrease of the Lyapunov certificate. To avoid the computational complexity involved in dealing with the space of probability measures, we identify a sufficient condition in the form of deterministic convex constraints that ensures the Lyapunov derivative constraint is satisfied. We integrate this condition into a loss function for training a neural network-based controller and show that, for the resulting closed-loop system, the global asymptotic stability of its equilibrium can be certified with high confidence, even with Out-of-Distribution (OoD) model uncertainties. To demonstrate the efficacy and efficiency of the proposed methodology, we compare it with an uncertainty-agnostic baseline approach and several reinforcement learning approaches in two control problems in simulation.

4/8/2024

eess.SY cs.LG cs.RO cs.SY

🧠

Learning-Based Verification of Stochastic Dynamical Systems with Neural Network Policies

Thom Badings, Wietze Koops, Sebastian Junges, Nils Jansen

We consider the verification of neural network policies for reach-avoid control tasks in stochastic dynamical systems. We use a verification procedure that trains another neural network, which acts as a certificate proving that the policy satisfies the task. For reach-avoid tasks, it suffices to show that this certificate network is a reach-avoid supermartingale (RASM). As our main contribution, we significantly accelerate algorithmic approaches for verifying that a neural network is indeed a RASM. The main bottleneck of these approaches is the discretization of the state space of the dynamical system. The following two key contributions allow us to use a coarser discretization than existing approaches. First, we present a novel and fast method to compute tight upper bounds on Lipschitz constants of neural networks based on weighted norms. We further improve these bounds on Lipschitz constants based on the characteristics of the certificate network. Second, we integrate an efficient local refinement scheme that dynamically refines the state space discretization where necessary. Our empirical evaluation shows the effectiveness of our approach for verifying neural network policies in several benchmarks and trained with different reinforcement learning algorithms.

6/4/2024

cs.LG cs.SY eess.SY

🏅

Verified Safe Reinforcement Learning for Neural Network Dynamic Models

Junlin Wu, Huan Zhang, Yevgeniy Vorobeychik

Learning reliably safe autonomous control is one of the core problems in trustworthy autonomy. However, training a controller that can be formally verified to be safe remains a major challenge. We introduce a novel approach for learning verified safe control policies in nonlinear neural dynamical systems while maximizing overall performance. Our approach aims to achieve safety in the sense of finite-horizon reachability proofs, and is comprised of three key parts. The first is a novel curriculum learning scheme that iteratively increases the verified safe horizon. The second leverages the iterative nature of gradient-based learning to leverage incremental verification, reusing information from prior verification runs. Finally, we learn multiple verified initial-state-dependent controllers, an idea that is especially valuable for more complex domains where learning a single universal verified safe controller is extremely challenging. Our experiments on five safe control problems demonstrate that our trained controllers can achieve verified safety over horizons that are as much as an order of magnitude longer than state-of-the-art baselines, while maintaining high reward, as well as a perfect safety record over entire episodes.

5/28/2024

cs.LG cs.AI

Ensuring Both Positivity and Stability Using Sector-Bounded Nonlinearity for Systems with Neural Network Controllers

Hamidreza Montazeri Hedesh, Milad Siami

This paper introduces a novel method for the stability analysis of positive feedback systems with a class of fully connected feedforward neural networks (FFNN) controllers. By establishing sector bounds for fully connected FFNNs without biases, we present a stability theorem that demonstrates the global exponential stability of linear systems under fully connected FFNN control. Utilizing principles from positive Lur'e systems and the positive Aizerman conjecture, our approach effectively addresses the challenge of ensuring stability in highly nonlinear systems. The crux of our method lies in maintaining sector bounds that preserve the positivity and Hurwitz property of the overall Lur'e system. We showcase the practical applicability of our methodology through its implementation in a linear system managed by a FFNN trained on output feedback controller data, highlighting its potential for enhancing stability in dynamic systems.

6/19/2024

eess.SY cs.AI cs.SY