RealGen: Retrieval Augmented Generation for Controllable Traffic Scenarios

Read original: arXiv:2312.13303 - Published 8/14/2024 by Wenhao Ding, Yulong Cao, Ding Zhao, Chaowei Xiao, Marco Pavone

RealGen: Retrieval Augmented Generation for Controllable Traffic Scenarios

Overview

The paper proposes a novel approach called RealGen for generating realistic and controllable traffic scenarios.
RealGen leverages retrieval-augmented generation, where a retrieval module is used to complement a language model for scenario generation.
The system allows users to control various aspects of the traffic scene, such as the number of vehicles, their types, and their behaviors.

Plain English Explanation

The paper presents a new system called [RealGen](link to section) that can generate realistic and customizable traffic scenarios. The key idea is to combine a [language model](link to section) with a [retrieval module](link to section) to produce these scenarios.

The language model is responsible for the overall structure and flow of the traffic scene, while the retrieval module pulls in relevant details from a database of real-world traffic data. This allows the system to create scenarios that are both coherent and grounded in reality.

One of the main advantages of RealGen is that it gives users a high degree of control over the traffic scenes. [Users can specify](link to section) things like the number of vehicles, the types of vehicles, and how the vehicles behave. This makes the system useful for a variety of applications, such as testing self-driving car algorithms or training traffic simulation models.

Technical Explanation

The paper introduces a system called RealGen that generates realistic and controllable traffic scenarios. The core of the approach is a [retrieval-augmented generation](link to section) framework, which combines a [language model](link to section) with a [retrieval module](link to section).

The language model is responsible for the overall structure and flow of the traffic scene, determining high-level aspects like the number of vehicles, their types, and their behaviors. The retrieval module then complements this by pulling in relevant details from a database of real-world traffic data, ensuring the scenarios have a grounded, realistic feel.

A key innovation is the ability for [users to control](link to section) various aspects of the generated traffic scenes. Through a set of input parameters, users can specify details like the number of vehicles, the distribution of vehicle types, and the behaviors of the vehicles. This gives the system a high degree of controllability, making it useful for a range of applications.

The paper evaluates RealGen through both quantitative and qualitative experiments. The authors demonstrate that the system can generate diverse, realistic traffic scenarios that match user specifications. They also show that the retrieval-augmented approach outperforms purely generative baselines in terms of realism and controllability.

Critical Analysis

The paper makes a compelling case for the value of RealGen in generating realistic and controllable traffic scenarios. The [retrieval-augmented generation](link to section) approach is a well-designed solution to the challenges of balancing coherence and realism.

One potential limitation is the reliance on a fixed database of traffic data. While this ensures the scenarios are grounded in reality, it may limit the system's ability to generate truly novel or unexpected traffic patterns. [Expanding the data sources](link to section) or incorporating more advanced generative techniques could help address this.

Additionally, the paper does not delve deeply into the specific algorithms or architectural details of the retrieval module and its integration with the language model. [More technical insight](link to section) into these components would be helpful for readers interested in replicating or building upon the work.

Overall, RealGen represents a valuable contribution to the field of traffic scenario generation. The [high degree of user control](link to section) and the demonstrated performance improvements over baselines make it a promising tool for a variety of applications. Continued research and refinement of the approach could further enhance its capabilities and impact.

Conclusion

The RealGen system presented in this paper offers a novel approach to generating realistic and controllable traffic scenarios. By [combining a language model with a retrieval module](link to section), the system is able to produce coherent and grounded traffic scenes that can be customized to user specifications.

The [high degree of controllability](link to section) provided by RealGen makes it a valuable tool for applications such as testing self-driving car algorithms, training traffic simulation models, and exploring traffic safety scenarios. The retrieval-augmented approach demonstrated in this paper represents an important step forward in the field of traffic scenario generation.

As the research and development of autonomous vehicles and smart transportation systems continues to advance, tools like RealGen will become increasingly important for ensuring the safety and reliability of these technologies. The insights and techniques presented in this paper lay the groundwork for further advancements in this critical area.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

RealGen: Retrieval Augmented Generation for Controllable Traffic Scenarios

Wenhao Ding, Yulong Cao, Ding Zhao, Chaowei Xiao, Marco Pavone

Simulation plays a crucial role in the development of autonomous vehicles (AVs) due to the potential risks associated with real-world testing. Although significant progress has been made in the visual aspects of simulators, generating complex behavior among agents remains a formidable challenge. It is not only imperative to ensure realism in the scenarios generated but also essential to incorporate preferences and conditions to facilitate controllable generation for AV training and evaluation. Traditional methods, mainly relying on memorizing the distribution of training datasets, often fall short in generating unseen scenarios. Inspired by the success of retrieval augmented generation in large language models, we present RealGen, a novel retrieval-based in-context learning framework for traffic scenario generation. RealGen synthesizes new scenarios by combining behaviors from multiple retrieved examples in a gradient-free way, which may originate from templates or tagged scenarios. This in-context learning framework endows versatile generative capabilities, including the ability to edit scenarios, compose various behaviors, and produce critical scenarios. Evaluations show that RealGen offers considerable flexibility and controllability, marking a new direction in the field of controllable traffic scenario generation. Check our project website for more information: https://realgen.github.io.

8/14/2024

ReGentS: Real-World Safety-Critical Driving Scenario Generation Made Stable

Yuan Yin, Pegah Khayatan, 'Eloi Zablocki, Alexandre Boulch, Matthieu Cord

Machine learning based autonomous driving systems often face challenges with safety-critical scenarios that are rare in real-world data, hindering their large-scale deployment. While increasing real-world training data coverage could address this issue, it is costly and dangerous. This work explores generating safety-critical driving scenarios by modifying complex real-world regular scenarios through trajectory optimization. We propose ReGentS, which stabilizes generated trajectories and introduces heuristics to avoid obvious collisions and optimization problems. Our approach addresses unrealistic diverging trajectories and unavoidable collision scenarios that are not useful for training robust planner. We also extend the scenario generation framework to handle real-world data with up to 32 agents. Additionally, by using a differentiable simulator, our approach simplifies gradient descent-based optimization involving a simulator, paving the way for future advancements. The code is available at https://github.com/valeoai/ReGentS.

9/14/2024

SimGen: Simulator-conditioned Driving Scene Generation

Yunsong Zhou, Michael Simon, Zhenghao Peng, Sicheng Mo, Hongzi Zhu, Minyi Guo, Bolei Zhou

Controllable synthetic data generation can substantially lower the annotation cost of training data in autonomous driving research and development. Prior works use diffusion models to generate driving images conditioned on the 3D object layout. However, those models are trained on small-scale datasets like nuScenes, which lack appearance and layout diversity. Moreover, the trained models can only generate images based on the real-world layout data from the validation set of the same dataset, where overfitting might happen. In this work, we introduce a simulator-conditioned scene generation framework called SimGen that can learn to generate diverse driving scenes by mixing data from the simulator and the real world. It uses a novel cascade diffusion pipeline to address challenging sim-to-real gaps and multi-condition conflicts. A driving video dataset DIVA is collected to enhance the generative diversity of SimGen, which contains over 147.5 hours of real-world driving videos from 73 locations worldwide and simulated driving data from the MetaDrive simulator. SimGen achieves superior generation quality and diversity while preserving controllability based on the text prompt and the layout pulled from a simulator. We further demonstrate the improvements brought by SimGen for synthetic data augmentation on the BEV detection and segmentation task and showcase its capability in safety-critical data generation. Code, data, and models will be made available.

6/14/2024

Multimodal Large Language Model Driven Scenario Testing for Autonomous Vehicles

Qiujing Lu, Xuanhan Wang, Yiwei Jiang, Guangming Zhao, Mingyue Ma, Shuo Feng

The generation of corner cases has become increasingly crucial for efficiently testing autonomous vehicles prior to road deployment. However, existing methods struggle to accommodate diverse testing requirements and often lack the ability to generalize to unseen situations, thereby reducing the convenience and usability of the generated scenarios. A method that facilitates easily controllable scenario generation for efficient autonomous vehicles (AV) testing with realistic and challenging situations is greatly needed. To address this, we proposed OmniTester: a multimodal Large Language Model (LLM) based framework that fully leverages the extensive world knowledge and reasoning capabilities of LLMs. OmniTester is designed to generate realistic and diverse scenarios within a simulation environment, offering a robust solution for testing and evaluating AVs. In addition to prompt engineering, we employ tools from Simulation of Urban Mobility to simplify the complexity of codes generated by LLMs. Furthermore, we incorporate Retrieval-Augmented Generation and a self-improvement mechanism to enhance the LLM's understanding of scenarios, thereby increasing its ability to produce more realistic scenes. In the experiments, we demonstrated the controllability and realism of our approaches in generating three types of challenging and complex scenarios. Additionally, we showcased its effectiveness in reconstructing new scenarios described in crash report, driven by the generalization capability of LLMs.

9/11/2024