An Autonomous Driving Model Integrated with BEV-V2X Perception, Fusion Prediction of Motion and Occupancy, and Driving Planning, in Complex Traffic Intersections

2312.05104

Published 4/23/2024 by Fukang Li, Wenlin Ou, Kunpeng Gao, Yuwen Pang, Yifei Li, Henry Fan

📈

Abstract

The comprehensiveness of vehicle-to-everything (V2X) recognition enriches and holistically shapes the global Birds-Eye-View (BEV) perception, incorporating rich semantics and integrating driving scene information, thereby serving features of vehicle state prediction, decision-making and driving planning. Utilizing V2X message sets to form BEV map proves to be an effective perception method for connected and automated vehicles (CAVs). Specifically, Map Msg. (MAP), Signal Phase And Timing (SPAT) and Roadside Information (RSI) contributes to the achievement of road connectivity, synchronized traffic signal navigation and obstacle warning. Moreover, harnessing time-sequential Basic Safety Msg. (BSM) data from multiple vehicles allows for the real-time perception and future state prediction. Therefore, this paper develops a comprehensive autonomous driving model that relies on BEV-V2X perception, Interacting Multiple model Unscented Kalman Filter (IMM-UKF)-based fusion prediction, and deep reinforcement learning (DRL)-based decision making and planning. We integrated them into a DRL environment to develop an optimal set of unified driving behaviors that encompass obstacle avoidance, lane changes, overtaking, turning maneuver, and synchronized traffic signal navigation. Consequently, a complex traffic intersection scenario was simulated, and the well-trained model was applied for driving planning. The observed driving behavior closely resembled that of an experienced driver, exhibiting anticipatory actions and revealing notable operational highlights of driving policy.

Create account to get full access

Overview

Explores a comprehensive autonomous driving model that leverages vehicle-to-everything (V2X) communication and perception, Interacting Multiple Model Unscented Kalman Filter (IMM-UKF) fusion prediction, and deep reinforcement learning (DRL)-based decision making and planning.
Aims to achieve features like obstacle avoidance, lane changes, overtaking, turning maneuvers, and synchronized traffic signal navigation.
Validated the model's performance in a complex traffic intersection scenario, demonstrating anticipatory actions and an experienced driver-like driving policy.

Plain English Explanation

The paper discusses a comprehensive autonomous driving system that combines several key technologies to enable advanced driving capabilities. At the core of this system is the use of V2X communication, where vehicles exchange information with their surroundings, including other vehicles, traffic signals, and infrastructure. This V2X data allows the system to build a detailed "birds-eye-view" of the driving environment, incorporating rich semantics and information about the state of the road, traffic signals, and obstacles.

To make sense of this complex data and predict the future behavior of other vehicles, the system employs an Interacting Multiple Model Unscented Kalman Filter (IMM-UKF). This fusion algorithm combines information from multiple sources to provide a robust and accurate understanding of the current and future state of the driving scene.

Finally, the system uses deep reinforcement learning (DRL) to make decisions and plan the vehicle's actions. The DRL model is trained in a simulated environment to learn how to navigate complex traffic scenarios, including obstacle avoidance, lane changes, overtaking, turning maneuvers, and coordinating with traffic signals.

The researchers demonstrated the effectiveness of this comprehensive approach by simulating a complex traffic intersection scenario. The well-trained model exhibited anticipatory actions and an experienced driver-like driving policy, suggesting that this integrated system can enable highly capable and safe autonomous driving.

Technical Explanation

The paper presents a comprehensive autonomous driving model that leverages vehicle-to-everything (V2X) communication and perception, Interacting Multiple Model Unscented Kalman Filter (IMM-UKF)-based fusion prediction, and deep reinforcement learning (DRL)-based decision making and planning.

The researchers utilize various V2X message sets, such as Map Msg. (MAP), Signal Phase And Timing (SPAT), and Roadside Information (RSI), to form a comprehensive "birds-eye-view" (BEV) perception of the driving scene. This perception incorporates rich semantics and integrates information about the road connectivity, traffic signal status, and potential obstacles. Additionally, the system leverages time-sequential Basic Safety Msg. (BSM) data from multiple vehicles to enable real-time perception and future state prediction.

The IMM-UKF-based fusion prediction module combines the V2X-derived information with other sensor data to provide a robust and accurate understanding of the current and future state of the driving environment. This includes predicting the behavior of other vehicles and potential obstacles.

The DRL-based decision-making and planning component is trained in a simulated environment to learn optimal driving behaviors that encompass obstacle avoidance, lane changes, overtaking, turning maneuvers, and synchronized traffic signal navigation. The researchers integrated all these components into a unified DRL environment to develop an optimal set of driving behaviors.

The model's performance was evaluated in a complex traffic intersection scenario, where the well-trained system exhibited anticipatory actions and an experienced driver-like driving policy, demonstrating its effectiveness in handling challenging driving situations.

Critical Analysis

The paper presents a comprehensive and integrated approach to autonomous driving, leveraging various state-of-the-art technologies, including V2X communication, sensor fusion, and deep reinforcement learning. The researchers have addressed several critical aspects of autonomous driving, such as perception, prediction, decision-making, and planning, in a holistic manner.

One potential limitation of the research is the reliance on a simulated environment for training and evaluation. While the simulation is designed to mimic real-world traffic conditions, there may be discrepancies between the simulated and actual driving scenarios. It would be valuable to further validate the model's performance in real-world experiments or in-depth case studies to assess its robustness and generalization capabilities.

Additionally, the paper does not provide extensive details on the specific algorithms or architectures used for the IMM-UKF fusion and the DRL-based decision-making. A more in-depth technical explanation of these components would help readers gain a deeper understanding of the underlying mechanisms and potentially identify areas for further optimization or innovation.

Furthermore, the paper does not address the potential challenges or limitations of the V2X communication infrastructure, such as network reliability, latency, or security considerations. These aspects may have a significant impact on the practical implementation and deployment of such a comprehensive autonomous driving system.

Overall, the research presented in the paper is a promising step towards developing highly capable and safe autonomous driving systems. However, further research and real-world validation would be valuable to fully assess the viability and scalability of the proposed approach.

Conclusion

This paper proposes a comprehensive autonomous driving model that leverages vehicle-to-everything (V2X) communication and perception, Interacting Multiple Model Unscented Kalman Filter (IMM-UKF)-based fusion prediction, and deep reinforcement learning (DRL)-based decision making and planning. The integration of these technologies enables advanced autonomous driving features, such as obstacle avoidance, lane changes, overtaking, turning maneuvers, and synchronized traffic signal navigation.

The researchers have demonstrated the effectiveness of their approach by simulating a complex traffic intersection scenario, where the well-trained model exhibited anticipatory actions and an experienced driver-like driving policy. This comprehensive system holds the potential to significantly enhance the safety and capabilities of autonomous vehicles, paving the way for their widespread adoption.

However, further research and real-world validation are necessary to address the potential limitations, such as the reliance on simulation, the need for more detailed technical explanations, and the consideration of practical V2X infrastructure challenges. Addressing these aspects will be crucial for the successful deployment and widespread acceptance of such advanced autonomous driving systems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Unified End-to-End V2X Cooperative Autonomous Driving

Zhiwei Li, Bozhen Zhang, Lei Yang, Tianyu Shen, Nuo Xu, Ruosen Hao, Weiting Li, Tao Yan, Huaping Liu

V2X cooperation, through the integration of sensor data from both vehicles and infrastructure, is considered a pivotal approach to advancing autonomous driving technology. Current research primarily focuses on enhancing perception accuracy, often overlooking the systematic improvement of accident prediction accuracy through end-to-end learning, leading to insufficient attention to the safety issues of autonomous driving. To address this challenge, this paper introduces the UniE2EV2X framework, a V2X-integrated end-to-end autonomous driving system that consolidates key driving modules within a unified network. The framework employs a deformable attention-based data fusion strategy, effectively facilitating cooperation between vehicles and infrastructure. The main advantages include: 1) significantly enhancing agents' perception and motion prediction capabilities, thereby improving the accuracy of accident predictions; 2) ensuring high reliability in the data fusion process; 3) superior end-to-end perception compared to modular approaches. Furthermore, We implement the UniE2EV2X framework on the challenging DeepAccident, a simulation dataset designed for V2X cooperative driving.

5/8/2024

cs.CV cs.MA

End-to-End Autonomous Driving through V2X Cooperation

Haibao Yu, Wenxian Yang, Jiaru Zhong, Zhenwei Yang, Siqi Fan, Ping Luo, Zaiqing Nie

Cooperatively utilizing both ego-vehicle and infrastructure sensor data via V2X communication has emerged as a promising approach for advanced autonomous driving. However, current research mainly focuses on improving individual modules, rather than taking end-to-end learning to optimize final planning performance, resulting in underutilized data potential. In this paper, we introduce UniV2X, a pioneering cooperative autonomous driving framework that seamlessly integrates all key driving modules across diverse views into a unified network. We propose a sparse-dense hybrid data transmission and fusion mechanism for effective vehicle-infrastructure cooperation, offering three advantages: 1) Effective for simultaneously enhancing agent perception, online mapping, and occupancy prediction, ultimately improving planning performance. 2) Transmission-friendly for practical and limited communication conditions. 3) Reliable data fusion with interpretability of this hybrid data. We implement UniV2X, as well as reproducing several benchmark methods, on the challenging DAIR-V2X, the real-world cooperative driving dataset. Experimental results demonstrate the effectiveness of UniV2X in significantly enhancing planning performance, as well as all intermediate output performance. Code is at https://github.com/AIR-THU/UniV2X.

4/23/2024

cs.RO cs.CV cs.MA

Towards Collaborative Autonomous Driving: Simulation Platform and End-to-End System

Genjia Liu, Yue Hu, Chenxin Xu, Weibo Mao, Junhao Ge, Zhengxiang Huang, Yifan Lu, Yinda Xu, Junkai Xia, Yafei Wang, Siheng Chen

Vehicle-to-everything-aided autonomous driving (V2X-AD) has a huge potential to provide a safer driving solution. Despite extensive researches in transportation and communication to support V2X-AD, the actual utilization of these infrastructures and communication resources in enhancing driving performances remains largely unexplored. This highlights the necessity of collaborative autonomous driving: a machine learning approach that optimizes the information sharing strategy to improve the driving performance of each vehicle. This effort necessitates two key foundations: a platform capable of generating data to facilitate the training and testing of V2X-AD, and a comprehensive system that integrates full driving-related functionalities with mechanisms for information sharing. From the platform perspective, we present V2Xverse, a comprehensive simulation platform for collaborative autonomous driving. This platform provides a complete pipeline for collaborative driving. From the system perspective, we introduce CoDriving, a novel end-to-end collaborative driving system that properly integrates V2X communication over the entire autonomous pipeline, promoting driving with shared perceptual information. The core idea is a novel driving-oriented communication strategy. Leveraging this strategy, CoDriving improves driving performance while optimizing communication efficiency. We make comprehensive benchmarks with V2Xverse, analyzing both modular performance and closed-loop driving performance. Experimental results show that CoDriving: i) significantly improves the driving score by 62.49% and drastically reduces the pedestrian collision rate by 53.50% compared to the SOTA end-to-end driving method, and ii) achieves sustaining driving performance superiority over dynamic constraint communication conditions.

4/16/2024

cs.CV

📶

V2X Cooperative Perception for Autonomous Driving: Recent Advances and Challenges

Tao Huang, Jianan Liu, Xi Zhou, Dinh C. Nguyen, Mostafa Rahimi Azghadi, Yuxuan Xia, Qing-Long Han, Sumei Sun

Accurate perception is essential for advancing autonomous driving and addressing safety challenges in modern transportation systems. Despite significant advancements in computer vision for object recognition, current perception methods still face difficulties in complex real-world traffic environments. Challenges such as physical occlusion and limited sensor field of view persist for individual vehicle systems. Cooperative Perception (CP) with Vehicle-to-Everything (V2X) technologies has emerged as a solution to overcome these obstacles and enhance driving automation systems. While some research has explored CP's fundamental architecture and critical components, there remains a lack of comprehensive summaries of the latest innovations, particularly in the context of V2X communication technologies. To address this gap, this paper provides a comprehensive overview of the evolution of CP technologies, spanning from early explorations to recent developments, including advancements in V2X communication technologies. Additionally, a contemporary generic framework is also proposed to illustrate the V2X-based CP workflow, aiding in the structured understanding of CP system components. Furthermore, this paper categorizes prevailing V2X-based CP methodologies based on the critical issues they address. An extensive literature review is conducted within this taxonomy, evaluating existing datasets and simulators. Finally, open challenges and future directions in CP for autonomous driving are discussed by considering both perception and V2X communication advancements.

5/10/2024

cs.CV