What Matters to Enhance Traffic Rule Compliance of Imitation Learning for End-to-End Autonomous Driving

Read original: arXiv:2309.07808 - Published 9/14/2024 by Hongkuan Zhou, Wei Cao, Aifen Sui, Zhenshan Bing

🤿

Overview

Autonomous driving is an active area of research, with end-to-end approaches gaining attention for their simpler structure and faster inference.
However, these end-to-end models can struggle with adhering to traffic rules, posing safety concerns.
The proposed P-CSG approach aims to address this issue through a penalty-based imitation learning method and cross-modal sensor fusion.

Plain English Explanation

The paper describes a new approach called P-CSG for end-to-end autonomous driving. In traditional autonomous driving systems, there are many different components (e.g., perception, planning, control) that work together to navigate the vehicle. The appeal of end-to-end approaches is that they replace this entire pipeline with a single neural network, simplifying the system and enabling faster decision-making.

However, a key challenge with end-to-end models is that they don't always follow traffic rules, which can lead to safety issues. To address this, the P-CSG method introduces three types of penalties to make the model more sensitive to red lights, stop signs, and speed limits around curves. Additionally, the cross-semantics generation component helps the model better integrate information from different sensor inputs, like cameras and lidar.

The researchers tested P-CSG on benchmark driving scenarios and found it achieved significant performance improvements over baseline models. They also evaluated its robustness to adversarial attacks, where small perturbations are added to the input to try to confuse the model. P-CSG showed substantial increases in robustness compared to other approaches.

Technical Explanation

The core of the P-CSG approach is a penalty-based imitation learning framework that aims to improve traffic rule compliance in end-to-end autonomous driving. The three penalties introduced are:

Red Light Penalty: Penalizes the agent for going through red lights.
Stop Sign Penalty: Penalizes the agent for not stopping at stop signs.
Curvature Speed Penalty: Penalizes the agent for exceeding safe speeds around curves.

These penalties are incorporated into the loss function during training to encourage the model to learn driving behaviors that adhere to traffic rules.

Additionally, the paper proposes a "cross-semantics generation" module that fuses information from different sensor modalities, such as camera images and lidar point clouds. This helps the model better align the shared information across these inputs, improving its overall understanding of the driving environment.

The researchers evaluated P-CSG on the CARLA Leaderboard - Town 05 Long Benchmark and Longest6 Benchmark, where it achieved 8.5% and 2.0% improvements in driving score compared to baseline methods. They also conducted robustness tests against FGSM and Dot adversarial attacks, demonstrating substantial improvements in the model's resilience to these perturbations.

Critical Analysis

The paper presents a promising approach to improving the safety and reliability of end-to-end autonomous driving systems. By incorporating penalties for specific traffic violations, the model is encouraged to learn driving behaviors that are more compliant with traffic rules. The cross-semantics generation component also appears to be an effective way to enhance the model's understanding of the driving environment.

However, the paper does not discuss potential limitations or areas for further research. For example, it would be interesting to explore how the penalty-based approach might handle more nuanced situations, such as navigating complex intersections or yielding to pedestrians. Additionally, the robustness tests focused on specific types of adversarial attacks, and it would be valuable to investigate the model's performance against a wider range of perturbations.

Overall, the P-CSG method represents a meaningful step forward in addressing the safety concerns associated with end-to-end autonomous driving. Continued research and refinement of these techniques could lead to significant advancements in the field.

Conclusion

This paper proposes the P-CSG approach, which combines penalty-based imitation learning and cross-semantics generation sensor fusion to enhance the performance and safety of end-to-end autonomous driving systems. By introducing penalties for violating traffic rules and leveraging multi-modal sensor information, the model demonstrates significant improvements in driving score and robustness to adversarial attacks compared to baseline methods.

The research highlights the importance of addressing safety and compliance concerns in the development of autonomous driving technologies. The P-CSG framework offers a promising solution that could help pave the way for more reliable and trustworthy end-to-end autonomous driving systems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🤿

What Matters to Enhance Traffic Rule Compliance of Imitation Learning for End-to-End Autonomous Driving

Hongkuan Zhou, Wei Cao, Aifen Sui, Zhenshan Bing

End-to-end autonomous driving, where the entire driving pipeline is replaced with a single neural network, has recently gained research attention because of its simpler structure and faster inference time. Despite this appealing approach largely reducing the complexity in the driving pipeline, it also leads to safety issues because the trained policy is not always compliant with the traffic rules. In this paper, we proposed P-CSG, a penalty-based imitation learning approach with contrastive-based cross semantics generation sensor fusion technologies to increase the overall performance of end-to-end autonomous driving. In this method, we introduce three penalties - red light, stop sign, and curvature speed penalty to make the agent more sensitive to traffic rules. The proposed cross semantics generation helps to align the shared information of different input modalities. We assessed our model's performance using the CARLA Leaderboard - Town 05 Long Benchmark and Longest6 Benchmark, achieving 8.5% and 2.0% driving score improvement compared to the baselines. Furthermore, we conducted robustness evaluations against adversarial attacks like FGSM and Dot attacks, revealing a substantial increase in robustness compared to other baseline models. More detailed information can be found at https://hk-zh.github.io/p-csg-plus.

9/14/2024

Guiding Attention in End-to-End Driving Models

Diego Porres, Yi Xiao, Gabriel Villalonga, Alexandre Levy, Antonio M. L'opez

Vision-based end-to-end driving models trained by imitation learning can lead to affordable solutions for autonomous driving. However, training these well-performing models usually requires a huge amount of data, while still lacking explicit and intuitive activation maps to reveal the inner workings of these models while driving. In this paper, we study how to guide the attention of these models to improve their driving quality and obtain more intuitive activation maps by adding a loss term during training using salient semantic maps. In contrast to previous work, our method does not require these salient semantic maps to be available during testing time, as well as removing the need to modify the model's architecture to which it is applied. We perform tests using perfect and noisy salient semantic maps with encouraging results in both, the latter of which is inspired by possible errors encountered with real data. Using CIL++ as a representative state-of-the-art model and the CARLA simulator with its standard benchmarks, we conduct experiments that show the effectiveness of our method in training better autonomous driving models, especially when data and computational resources are scarce.

5/2/2024

LASIL: Learner-Aware Supervised Imitation Learning For Long-term Microscopic Traffic Simulation

Ke Guo, Zhenwei Miao, Wei Jing, Weiwei Liu, Weizi Li, Dayang Hao, Jia Pan

Microscopic traffic simulation plays a crucial role in transportation engineering by providing insights into individual vehicle behavior and overall traffic flow. However, creating a realistic simulator that accurately replicates human driving behaviors in various traffic conditions presents significant challenges. Traditional simulators relying on heuristic models often fail to deliver accurate simulations due to the complexity of real-world traffic environments. Due to the covariate shift issue, existing imitation learning-based simulators often fail to generate stable long-term simulations. In this paper, we propose a novel approach called learner-aware supervised imitation learning to address the covariate shift problem in multi-agent imitation learning. By leveraging a variational autoencoder simultaneously modeling the expert and learner state distribution, our approach augments expert states such that the augmented state is aware of learner state distribution. Our method, applied to urban traffic simulation, demonstrates significant improvements over existing state-of-the-art baselines in both short-term microscopic and long-term macroscopic realism when evaluated on the real-world dataset pNEUMA.

5/24/2024

🛸

Adversarial Safety-Critical Scenario Generation using Naturalistic Human Driving Priors

Kunkun Hao, Yonggang Luo, Wen Cui, Yuqiao Bai, Jucheng Yang, Songyang Yan, Yuxi Pan, Zijiang Yang

Evaluating the decision-making system is indispensable in developing autonomous vehicles, while realistic and challenging safety-critical test scenarios play a crucial role. Obtaining these scenarios is non-trivial, thanks to the long-tailed distribution, sparsity, and rarity in real-world data sets. To tackle this problem, in this paper, we introduce a natural adversarial scenario generation solution using naturalistic human driving priors and reinforcement learning techniques. By doing this, we can obtain large-scale test scenarios that are both diverse and realistic. Specifically, we build a simulation environment that mimics natural traffic interaction scenarios. Informed by this environment, we implement a two-stage procedure. The first stage incorporates conventional rule-based models, e.g., IDM~(Intelligent Driver Model) and MOBIL~(Minimizing Overall Braking Induced by Lane changes) model, to coarsely and discretely capture and calibrate key control parameters from the real-world dataset. Next, we leverage GAIL~(Generative Adversarial Imitation Learning) to represent driver behaviors continuously. The derived GAIL can be further used to design a PPO~(Proximal Policy Optimization)-based actor-critic network framework to fine-tune the reward function, and then optimizes our natural adversarial scenario generation solution. Extensive experiments have been conducted in the NGSIM dataset including the trajectory of 3,000 vehicles. Essential traffic parameters were measured in comparison with the baseline model, e.g., the collision rate, accelerations, steering, and the number of lane changes. Our findings demonstrate that the proposed model can generate realistic safety-critical test scenarios covering both naturalness and adversariality, which can be a cornerstone for the development of autonomous vehicles.

8/9/2024