ChatScene: Knowledge-Enabled Safety-Critical Scenario Generation for Autonomous Vehicles

2405.14062

Published 5/24/2024 by Jiawei Zhang, Chejian Xu, Bo Li

🛸

Abstract

We present ChatScene, a Large Language Model (LLM)-based agent that leverages the capabilities of LLMs to generate safety-critical scenarios for autonomous vehicles. Given unstructured language instructions, the agent first generates textually described traffic scenarios using LLMs. These scenario descriptions are subsequently broken down into several sub-descriptions for specified details such as behaviors and locations of vehicles. The agent then distinctively transforms the textually described sub-scenarios into domain-specific languages, which then generate actual code for prediction and control in simulators, facilitating the creation of diverse and complex scenarios within the CARLA simulation environment. A key part of our agent is a comprehensive knowledge retrieval component, which efficiently translates specific textual descriptions into corresponding domain-specific code snippets by training a knowledge database containing the scenario description and code pairs. Extensive experimental results underscore the efficacy of ChatScene in improving the safety of autonomous vehicles. For instance, the scenarios generated by ChatScene show a 15% increase in collision rates compared to state-of-the-art baselines when tested against different reinforcement learning-based ego vehicles. Furthermore, we show that by using our generated safety-critical scenarios to fine-tune different RL-based autonomous driving models, they can achieve a 9% reduction in collision rates, surpassing current SOTA methods. ChatScene effectively bridges the gap between textual descriptions of traffic scenarios and practical CARLA simulations, providing a unified way to conveniently generate safety-critical scenarios for safety testing and improvement for AVs.

Create account to get full access

Overview

This paper presents ChatScene, a Large Language Model (LLM)-based agent that generates safety-critical scenarios for autonomous vehicles.
ChatScene leverages the capabilities of LLMs to create textually described traffic scenarios, which are then transformed into domain-specific languages and code for simulation in CARLA.
The agent uses a comprehensive knowledge retrieval component to efficiently translate textual descriptions into corresponding code snippets, enabling the generation of diverse and complex scenarios.
The research aims to improve the safety of autonomous vehicles by providing a unified way to generate safety-critical scenarios for testing and improvement.

Plain English Explanation

ChatScene is a system that uses powerful language models to create detailed descriptions of traffic scenarios that could be challenging for self-driving cars. These scenarios are then automatically translated into the specific computer code needed to simulate them in a virtual environment, like CARLA.

Imagine you're training a self-driving car to navigate the roads safely. You want to test it in a wide variety of situations, including some that might be rare or particularly hazardous. With ChatScene, you can simply describe these scenarios in plain language, like "A pedestrian suddenly steps out in front of the car on a rainy night." The system then takes that description and generates the necessary code to recreate that scenario in a simulation, allowing the self-driving car to be trained and tested on it.

This is useful because it's often difficult and time-consuming to manually create the diverse range of scenarios needed to thoroughly test autonomous vehicles. ChatScene streamlines this process by bridging the gap between high-level descriptions and the technical details required for simulation. By generating more challenging and safety-critical scenarios, the researchers hope to improve the overall safety and performance of self-driving cars.

Technical Explanation

The key components of ChatScene are:

Scenario Generation: The agent uses LLMs to generate textually described traffic scenarios based on unstructured language instructions.
Scenario Breakdown: The textual descriptions are broken down into sub-descriptions for specific details, such as the behaviors and locations of vehicles.
Scenario Translation: The textually described sub-scenarios are transformed into domain-specific languages, which are then used to generate code for prediction and control in the CARLA simulation environment.
Knowledge Retrieval: A key component is a comprehensive knowledge retrieval system that efficiently translates textual descriptions into corresponding code snippets by training a database of scenario descriptions and their associated code.

Through extensive experiments, the researchers demonstrate that the scenarios generated by ChatScene lead to a 15% increase in collision rates when tested against different reinforcement learning-based autonomous driving models, compared to state-of-the-art baselines. Furthermore, by using the generated safety-critical scenarios to fine-tune these RL-based models, the authors achieve a 9% reduction in collision rates, surpassing current state-of-the-art methods.

Critical Analysis

The research presented in this paper is a promising step towards improving the safety of autonomous vehicles through the generation of diverse and challenging traffic scenarios. By leveraging the power of large language models, ChatScene provides a convenient and efficient way to create scenarios that can be used for testing and improving self-driving car algorithms.

One potential limitation of the approach is the reliance on the accuracy and completeness of the knowledge retrieval component. If the system's database of scenario descriptions and corresponding code is incomplete or biased, the generated scenarios may not fully capture the complexity of real-world traffic situations. Ongoing research into enhancing autonomous vehicle training through language model integration and generating probabilistic scenario programs from natural language may help address this issue.

Additionally, while the researchers demonstrate the effectiveness of ChatScene in improving the performance of reinforcement learning-based autonomous driving models, it would be interesting to see how the system performs with other types of self-driving car algorithms, such as those based on object detection or collaborative driving. Expanding the evaluation to a wider range of autonomous driving approaches could provide a more comprehensive understanding of the system's capabilities and limitations.

Conclusion

ChatScene represents a significant advancement in the generation of safety-critical scenarios for autonomous vehicle testing and improvement. By bridging the gap between textual descriptions and simulation-ready code, the system provides a powerful tool for creating diverse and challenging traffic scenarios that can help enhance the safety and performance of self-driving cars. As research in this area continues to evolve, ChatScene and similar approaches may play an increasingly important role in ensuring the safe deployment of autonomous vehicles on our roads.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

⛏️

Chat2Scenario: Scenario Extraction From Dataset Through Utilization of Large Language Model

Yongqi Zhao, Wenbo Xiao, Tomislav Mihalj, Jia Hu, Arno Eichberger

The advent of Large Language Models (LLM) provides new insights to validate Automated Driving Systems (ADS). In the herein-introduced work, a novel approach to extracting scenarios from naturalistic driving datasets is presented. A framework called Chat2Scenario is proposed leveraging the advanced Natural Language Processing (NLP) capabilities of LLM to understand and identify different driving scenarios. By inputting descriptive texts of driving conditions and specifying the criticality metric thresholds, the framework efficiently searches for desired scenarios and converts them into ASAM OpenSCENARIO and IPG CarMaker text files. This methodology streamlines the scenario extraction process and enhances efficiency. Simulations are executed to validate the efficiency of the approach. The framework is presented based on a user-friendly web app and is accessible via the following link: https://github.com/ftgTUGraz/Chat2Scenario.

4/29/2024

cs.RO

🚀

Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents

Yuxi Wei, Zi Wang, Yifan Lu, Chenxin Xu, Changxing Liu, Hao Zhao, Siheng Chen, Yanfeng Wang

Scene simulation in autonomous driving has gained significant attention because of its huge potential for generating customized data. However, existing editable scene simulation approaches face limitations in terms of user interaction efficiency, multi-camera photo-realistic rendering and external digital assets integration. To address these challenges, this paper introduces ChatSim, the first system that enables editable photo-realistic 3D driving scene simulations via natural language commands with external digital assets. To enable editing with high command flexibility,~ChatSim leverages a large language model (LLM) agent collaboration framework. To generate photo-realistic outcomes, ChatSim employs a novel multi-camera neural radiance field method. Furthermore, to unleash the potential of extensive high-quality digital assets, ChatSim employs a novel multi-camera lighting estimation method to achieve scene-consistent assets' rendering. Our experiments on Waymo Open Dataset demonstrate that ChatSim can handle complex language commands and generate corresponding photo-realistic scene videos.

6/27/2024

cs.CV

Enhancing Autonomous Vehicle Training with Language Model Integration and Critical Scenario Generation

Hanlin Tian, Kethan Reddy, Yuxiang Feng, Mohammed Quddus, Yiannis Demiris, Panagiotis Angeloudis

This paper introduces CRITICAL, a novel closed-loop framework for autonomous vehicle (AV) training and testing. CRITICAL stands out for its ability to generate diverse scenarios, focusing on critical driving situations that target specific learning and performance gaps identified in the Reinforcement Learning (RL) agent. The framework achieves this by integrating real-world traffic dynamics, driving behavior analysis, surrogate safety measures, and an optional Large Language Model (LLM) component. It is proven that the establishment of a closed feedback loop between the data generation pipeline and the training process can enhance the learning rate during training, elevate overall system performance, and augment safety resilience. Our evaluations, conducted using the Proximal Policy Optimization (PPO) and the HighwayEnv simulation environment, demonstrate noticeable performance improvements with the integration of critical case generation and LLM analysis, indicating CRITICAL's potential to improve the robustness of AV systems and streamline the generation of critical scenarios. This ultimately serves to hasten the development of AV agents, expand the general scope of RL training, and ameliorate validation efforts for AV safety.

4/15/2024

cs.RO cs.AI cs.LG

🤿

Scene-Extrapolation: Generating Interactive Traffic Scenarios

Maximilian Zipfl, Barbara Schutt, J. Marius Zollner

Verifying highly automated driving functions can be challenging, requiring identifying relevant test scenarios. Scenario-based testing will likely play a significant role in verifying these systems, predominantly occurring within simulation. In our approach, we use traffic scenes as a starting point (seed-scene) to address the individuality of various highly automated driving functions and to avoid the problems associated with a predefined test traffic scenario. Different highly autonomous driving functions, or their distinct iterations, may display different behaviors under the same operating conditions. To make a generalizable statement about a seed-scene, we simulate possible outcomes based on various behavior profiles. We utilize our lightweight simulation environment and populate it with rule-based and machine learning behavior models for individual actors in the scenario. We analyze resulting scenarios using a variety of criticality metrics. The density distributions of the resulting criticality values enable us to make a profound statement about the significance of a particular scene, considering various eventualities.

4/29/2024

cs.RO