Verified Safe Reinforcement Learning for Neural Network Dynamic Models

2405.15994

Published 5/28/2024 by Junlin Wu, Huan Zhang, Yevgeniy Vorobeychik

🏅

Abstract

Learning reliably safe autonomous control is one of the core problems in trustworthy autonomy. However, training a controller that can be formally verified to be safe remains a major challenge. We introduce a novel approach for learning verified safe control policies in nonlinear neural dynamical systems while maximizing overall performance. Our approach aims to achieve safety in the sense of finite-horizon reachability proofs, and is comprised of three key parts. The first is a novel curriculum learning scheme that iteratively increases the verified safe horizon. The second leverages the iterative nature of gradient-based learning to leverage incremental verification, reusing information from prior verification runs. Finally, we learn multiple verified initial-state-dependent controllers, an idea that is especially valuable for more complex domains where learning a single universal verified safe controller is extremely challenging. Our experiments on five safe control problems demonstrate that our trained controllers can achieve verified safety over horizons that are as much as an order of magnitude longer than state-of-the-art baselines, while maintaining high reward, as well as a perfect safety record over entire episodes.

Create account to get full access

Overview

This paper proposes a method for verifying the safety of reinforcement learning agents with neural network dynamic models.
The approach combines formal verification techniques with reinforcement learning to ensure the agent's actions remain within safe bounds, even in the presence of uncertainty or disturbances.
The authors demonstrate the effectiveness of their method on several benchmark control tasks, showing it can provide strong safety guarantees while maintaining good performance.

Plain English Explanation

The paper tackles the challenge of making reinforcement learning (RL) systems safe and reliable, even when they are controlling complex, unknown environments. RL is a powerful technique where an agent learns to take actions that maximize a reward signal, but it can be risky if the agent makes mistakes or encounters unanticipated situations.

The researchers developed a new method that combines RL with formal verification techniques. Formal verification involves mathematically proving that a system will behave correctly, no matter what inputs or disturbances it encounters. By integrating this into the RL process, the researchers were able to create agents that not only learn effective behaviors, but are also guaranteed to remain within safe operating bounds.

The key innovation is using a neural network to model the system dynamics, which provides a flexible way to capture complex environments, while still allowing for formal verification as described in this paper. This allows the agent to learn optimal policies while ensuring safety as explored in this related work.

The researchers demonstrated their approach on several benchmark control problems, showing it can provide strong safety guarantees while maintaining good performance. This represents an important step towards making RL systems safe enough for real-world deployment, where safety is critical.

Technical Explanation

The paper presents a method for verifying the safety of reinforcement learning (RL) agents with neural network dynamic models. The approach combines formal verification techniques with RL to ensure the agent's actions remain within safe bounds, even in the presence of uncertainty or disturbances.

The key components of the method are:

Neural network model: The system dynamics are modeled using a neural network, which can capture complex, nonlinear relationships.
Safety verification: Formal verification is used to mathematically prove that the agent will remain within a safe region of the state space, regardless of the inputs or disturbances it encounters. This is done by constructing a Lyapunov function that serves as a certificate of safety.
Safe RL: The safety verification is integrated into the RL process, allowing the agent to learn an optimal policy while respecting the safety constraints. This is achieved through a constrained optimization formulation.

The authors demonstrate the effectiveness of their method on several benchmark control tasks, including inverted pendulum, cartpole, and quadrotor control. They show that the approach can provide strong safety guarantees while maintaining good performance, outperforming baseline RL methods that do not consider safety.

Critical Analysis

The paper presents a promising approach for ensuring the safety of RL systems, which is a critical challenge in deploying these techniques in the real world. The integration of formal verification with RL is a clever way to overcome the limitations of each individual technique.

One potential limitation of the method is the reliance on a neural network model of the system dynamics. While this provides flexibility, it may also introduce modeling errors that could compromise the safety guarantees. The authors acknowledge this and suggest exploring ways to account for model uncertainty in the safety verification process.

Additionally, the method may be computationally intensive, as the safety verification step adds significant complexity to the RL optimization problem. The authors do not provide a detailed analysis of the computational requirements, which could be an important consideration for real-world deployment.

Finally, the paper focuses on single-agent scenarios, but many real-world systems involve multiple interacting agents. Extending the approach to handle multi-agent settings could be an important area for future research.

Conclusion

This paper presents a novel approach for verifying the safety of reinforcement learning agents with neural network dynamic models. By combining formal verification techniques with RL, the method can provide strong safety guarantees while maintaining good performance on benchmark control tasks.

The work represents an important step towards making RL systems safe enough for real-world deployment, where safety is critical. The integration of formal methods with learning-based approaches is a promising direction that could have significant impact across a range of applications, from autonomous vehicles to industrial control systems.

While the method has some limitations, such as the reliance on accurate dynamic models and potential computational complexity, the authors have demonstrated the feasibility and effectiveness of their approach. Further research to address these challenges could lead to even more robust and reliable RL systems that can be safely deployed in high-stakes environments.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🧠

Provably Safe Neural Network Controllers via Differential Dynamic Logic

Samuel Teuber, Stefan Mitsch, Andr'e Platzer

While neural networks (NNs) have potential as autonomous controllers for Cyber-Physical Systems, verifying the safety of NN based control systems (NNCSs) poses significant challenges for the practical use of NNs, especially when safety is needed for unbounded time horizons. One reason is the intractability of analyzing NNs, ODEs and hybrid systems. To this end, we introduce VerSAILLE (Verifiably Safe AI via Logically Linked Envelopes): The first general approach that allows reusing control theory results for NNCS verification. By joining forces, we exploit the efficiency of NN verification tools while retaining the rigor of differential dynamic logic (dL). Based on provably safe control envelopes in dL, we derive specifications for the NN which is proven via NN verification. We show that a proof of the NN adhering to the specification is mirrored by a dL proof on the infinite-time safety of the NNCS. The NN verification properties resulting from hybrid systems typically contain nonlinear arithmetic and arbitrary logical structures while efficient NN verification merely supports linear constraints. To overcome this divide, we present Mosaic: An efficient, sound and complete verification approach for polynomial real arithmetic properties on piece-wise linear NNs. Mosaic partitions complex verification queries into simple queries and lifts off-the-shelf linear constraint tools to the nonlinear setting in a completeness-preserving manner by combining approximation with exact reasoning for counterexample regions. Our evaluation demonstrates the versatility of VerSAILLE and Mosaic: We prove infinite-time safety on the classical Vertical Airborne Collision Avoidance NNCS verification benchmark for two scenarios while (exhaustively) enumerating counterexample regions in unsafe scenarios. We also show that our approach significantly outperforms State-of-the-Art tools in closed-loop NNV.

6/17/2024

cs.SY cs.AI cs.LG cs.LO

Providing Safety Assurances for Systems with Unknown Dynamics

Hao Wang, Javier Borquez, Somil Bansal

As autonomous systems become more complex and integral in our society, the need to accurately model and safely control these systems has increased significantly. In the past decade, there has been tremendous success in using deep learning techniques to model and control systems that are difficult to model using first principles. However, providing safety assurances for such systems remains difficult, partially due to the uncertainty in the learned model. In this work, we aim to provide safety assurances for systems whose dynamics are not readily derived from first principles and, hence, are more advantageous to be learned using deep learning techniques. Given the system of interest and safety constraints, we learn an ensemble model of the system dynamics from data. Leveraging ensemble uncertainty as a measure of uncertainty in the learned dynamics model, we compute a maximal robust control invariant set, starting from which the system is guaranteed to satisfy the safety constraints under the condition that realized model uncertainties are contained in the predefined set of admissible model uncertainty. We demonstrate the effectiveness of our method using a simulated case study with an inverted pendulum and a hardware experiment with a TurtleBot. The experiments show that our method robustifies the control actions of the system against model uncertainty and generates safe behaviors without being overly restrictive. The codes and accompanying videos can be found on the project website.

5/7/2024

cs.RO cs.SY eess.SY

Formally Verifying Deep Reinforcement Learning Controllers with Lyapunov Barrier Certificates

Udayan Mandal, Guy Amir, Haoze Wu, Ieva Daukantas, Fletcher Lee Newell, Umberto J. Ravaioli, Baoluo Meng, Michael Durling, Milan Ganai, Tobey Shim, Guy Katz, Clark Barrett

Deep reinforcement learning (DRL) is a powerful machine learning paradigm for generating agents that control autonomous systems. However, the black box nature of DRL agents limits their deployment in real-world safety-critical applications. A promising approach for providing strong guarantees on an agent's behavior is to use Neural Lyapunov Barrier (NLB) certificates, which are learned functions over the system whose properties indirectly imply that an agent behaves as desired. However, NLB-based certificates are typically difficult to learn and even more difficult to verify, especially for complex systems. In this work, we present a novel method for training and verifying NLB-based certificates for discrete-time systems. Specifically, we introduce a technique for certificate composition, which simplifies the verification of highly-complex systems by strategically designing a sequence of certificates. When jointly verified with neural network verification engines, these certificates provide a formal guarantee that a DRL agent both achieves its goals and avoids unsafe behavior. Furthermore, we introduce a technique for certificate filtering, which significantly simplifies the process of producing formally verified certificates. We demonstrate the merits of our approach with a case study on providing safety and liveness guarantees for a DRL-controlled spacecraft.

5/24/2024

cs.AI cs.LG cs.SY eess.SY

🧠

Learning-Based Verification of Stochastic Dynamical Systems with Neural Network Policies

Thom Badings, Wietze Koops, Sebastian Junges, Nils Jansen

We consider the verification of neural network policies for reach-avoid control tasks in stochastic dynamical systems. We use a verification procedure that trains another neural network, which acts as a certificate proving that the policy satisfies the task. For reach-avoid tasks, it suffices to show that this certificate network is a reach-avoid supermartingale (RASM). As our main contribution, we significantly accelerate algorithmic approaches for verifying that a neural network is indeed a RASM. The main bottleneck of these approaches is the discretization of the state space of the dynamical system. The following two key contributions allow us to use a coarser discretization than existing approaches. First, we present a novel and fast method to compute tight upper bounds on Lipschitz constants of neural networks based on weighted norms. We further improve these bounds on Lipschitz constants based on the characteristics of the certificate network. Second, we integrate an efficient local refinement scheme that dynamically refines the state space discretization where necessary. Our empirical evaluation shows the effectiveness of our approach for verifying neural network policies in several benchmarks and trained with different reinforcement learning algorithms.

6/4/2024

cs.LG cs.SY eess.SY