V2X Cooperative Perception for Autonomous Driving: Recent Advances and Challenges

2310.03525

Published 5/10/2024 by Tao Huang, Jianan Liu, Xi Zhou, Dinh C. Nguyen, Mostafa Rahimi Azghadi, Yuxuan Xia, Qing-Long Han, Sumei Sun

cs.CV

📶

Abstract

Accurate perception is essential for advancing autonomous driving and addressing safety challenges in modern transportation systems. Despite significant advancements in computer vision for object recognition, current perception methods still face difficulties in complex real-world traffic environments. Challenges such as physical occlusion and limited sensor field of view persist for individual vehicle systems. Cooperative Perception (CP) with Vehicle-to-Everything (V2X) technologies has emerged as a solution to overcome these obstacles and enhance driving automation systems. While some research has explored CP's fundamental architecture and critical components, there remains a lack of comprehensive summaries of the latest innovations, particularly in the context of V2X communication technologies. To address this gap, this paper provides a comprehensive overview of the evolution of CP technologies, spanning from early explorations to recent developments, including advancements in V2X communication technologies. Additionally, a contemporary generic framework is also proposed to illustrate the V2X-based CP workflow, aiding in the structured understanding of CP system components. Furthermore, this paper categorizes prevailing V2X-based CP methodologies based on the critical issues they address. An extensive literature review is conducted within this taxonomy, evaluating existing datasets and simulators. Finally, open challenges and future directions in CP for autonomous driving are discussed by considering both perception and V2X communication advancements.

Create account to get full access

Overview

Autonomous driving systems face challenges in complex real-world traffic environments, such as physical occlusion and limited sensor field of view.
Cooperative Perception (CP) using Vehicle-to-Everything (V2X) communication technologies has emerged as a solution to overcome these obstacles and enhance driving automation systems.
This paper provides a comprehensive overview of the evolution of CP technologies, spanning from early explorations to recent developments, including advancements in V2X communication technologies.
The paper proposes a contemporary generic framework to illustrate the V2X-based CP workflow and categorizes prevailing V2X-based CP methodologies based on the critical issues they address.
The paper also reviews existing datasets and simulators and discusses open challenges and future directions in CP for autonomous driving, considering both perception and V2X communication advancements.

Plain English Explanation

Autonomous driving systems, like self-driving cars, rely on computer vision and sensors to recognize objects around the vehicle. However, these systems can still face difficulties in complex real-world traffic situations, such as when vehicles or other objects are partially blocked from view or when the sensors have a limited field of view.

To address these challenges, researchers have developed a technique called Cooperative Perception (CP). CP uses Vehicle-to-Everything (V2X) communication technologies to allow vehicles to share information with each other and with infrastructure like traffic lights. This allows the vehicles to "see" beyond their own sensors and get a more complete understanding of the surrounding environment.

This paper provides a comprehensive overview of how CP and V2X technologies have evolved over time, from early experiments to the latest advancements. The paper also proposes a general framework to help understand how V2X-based CP systems work and categorizes different approaches based on the specific challenges they aim to address.

Additionally, the paper reviews the existing datasets and simulation tools used to test and develop these technologies. Finally, the paper discusses the remaining challenges and future directions for improving CP and V2X to support more reliable and advanced autonomous driving systems.

Technical Explanation

The paper begins by highlighting the importance of accurate perception for advancing autonomous driving and addressing safety challenges in modern transportation systems. While computer vision has made significant progress in object recognition, current perception methods still face difficulties in complex real-world traffic environments.

To overcome the limitations of individual vehicle sensors, the paper explores the use of Cooperative Perception (CP) enabled by Vehicle-to-Everything (V2X) communication technologies. The paper provides a comprehensive overview of the evolution of CP technologies, from early explorations to recent developments, including advancements in V2X communication.

The paper proposes a contemporary generic framework to illustrate the V2X-based CP workflow, which includes components such as data acquisition, data fusion, and decision-making. This framework helps to provide a structured understanding of the different system components involved in V2X-based CP.

Furthermore, the paper categorizes prevailing V2X-based CP methodologies based on the critical issues they address, such as object detection and tracking, localization and mapping, and collaborative perception. An extensive literature review is conducted within this taxonomy, evaluating existing datasets and simulators used for testing and development.

Finally, the paper discusses open challenges and future directions in CP for autonomous driving, considering both perception and V2X communication advancements. This includes exploring techniques to handle sensor imperfections, improving data fusion algorithms, and enhancing the resilience and reliability of V2X communication.

Critical Analysis

The paper provides a comprehensive and well-structured overview of the evolution of Cooperative Perception (CP) technologies for autonomous driving, highlighting the importance of overcoming the limitations of individual vehicle sensors through the use of Vehicle-to-Everything (V2X) communication.

The proposed generic framework for V2X-based CP is a valuable contribution, as it helps to establish a common understanding of the different system components and their interactions. This framework can serve as a foundation for further research and development in this area.

However, the paper acknowledges that there are still significant challenges and open questions to be addressed, such as handling sensor imperfections, improving data fusion algorithms, and enhancing the reliability of V2X communication. These are crucial areas that will require further research and innovation to fully realize the potential of CP in autonomous driving.

Additionally, while the paper provides a thorough review of existing datasets and simulators, it would be beneficial to have a more in-depth discussion on the limitations and shortcomings of these tools. This could help identify areas where new or improved evaluation resources are needed to support the development and testing of advanced CP systems.

Overall, the paper offers a valuable and comprehensive overview of the state of the art in Cooperative Perception for autonomous driving, and it highlights the importance of continued research and development in this rapidly evolving field.

Conclusion

This paper provides a comprehensive overview of the evolution of Cooperative Perception (CP) technologies for autonomous driving, focusing on the advancements in Vehicle-to-Everything (V2X) communication. The paper highlights the importance of overcoming the limitations of individual vehicle sensors through the use of CP and V2X, which can enhance the perception capabilities of autonomous driving systems.

The proposed generic framework for V2X-based CP and the categorization of prevailing methodologies offer a structured understanding of the different system components and the critical issues they address. The extensive literature review and evaluation of existing datasets and simulators further contribute to the understanding of the current state of the art in this field.

While significant progress has been made, the paper also identifies open challenges and future directions, such as handling sensor imperfections, improving data fusion algorithms, and enhancing the reliability of V2X communication. Addressing these challenges will be crucial for the continued advancement of Cooperative Perception and its integration into autonomous driving systems, ultimately leading to safer and more robust transportation solutions.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Enhanced Cooperative Perception for Autonomous Vehicles Using Imperfect Communication

Ahmad Sarlak, Hazim Alzorgan, Sayed Pedram Haeri Boroujeni, Abolfazl Razi, Rahul Amin

Sharing and joint processing of camera feeds and sensor measurements, known as Cooperative Perception (CP), has emerged as a new technique to achieve higher perception qualities. CP can enhance the safety of Autonomous Vehicles (AVs) where their individual visual perception quality is compromised by adverse weather conditions (haze as foggy weather), low illumination, winding roads, and crowded traffic. To cover the limitations of former methods, in this paper, we propose a novel approach to realize an optimized CP under constrained communications. At the core of our approach is recruiting the best helper from the available list of front vehicles to augment the visual range and enhance the Object Detection (OD) accuracy of the ego vehicle. In this two-step process, we first select the helper vehicles that contribute the most to CP based on their visual range and lowest motion blur. Next, we implement a radio block optimization among the candidate vehicles to further improve communication efficiency. We specifically focus on pedestrian detection as an exemplary scenario. To validate our approach, we used the CARLA simulator to create a dataset of annotated videos for different driving scenarios where pedestrian detection is challenging for an AV with compromised vision. Our results demonstrate the efficacy of our two-step optimization process in improving the overall performance of cooperative perception in challenging scenarios, substantially improving driving safety under adverse conditions. Finally, we note that the networking assumptions are adopted from LTE Release 14 Mode 4 side-link communication, commonly used for Vehicle-to-Vehicle (V2V) communication. Nonetheless, our method is flexible and applicable to arbitrary V2V communications.

4/15/2024

cs.CV cs.AI cs.LG

Towards Collaborative Autonomous Driving: Simulation Platform and End-to-End System

Genjia Liu, Yue Hu, Chenxin Xu, Weibo Mao, Junhao Ge, Zhengxiang Huang, Yifan Lu, Yinda Xu, Junkai Xia, Yafei Wang, Siheng Chen

Vehicle-to-everything-aided autonomous driving (V2X-AD) has a huge potential to provide a safer driving solution. Despite extensive researches in transportation and communication to support V2X-AD, the actual utilization of these infrastructures and communication resources in enhancing driving performances remains largely unexplored. This highlights the necessity of collaborative autonomous driving: a machine learning approach that optimizes the information sharing strategy to improve the driving performance of each vehicle. This effort necessitates two key foundations: a platform capable of generating data to facilitate the training and testing of V2X-AD, and a comprehensive system that integrates full driving-related functionalities with mechanisms for information sharing. From the platform perspective, we present V2Xverse, a comprehensive simulation platform for collaborative autonomous driving. This platform provides a complete pipeline for collaborative driving. From the system perspective, we introduce CoDriving, a novel end-to-end collaborative driving system that properly integrates V2X communication over the entire autonomous pipeline, promoting driving with shared perceptual information. The core idea is a novel driving-oriented communication strategy. Leveraging this strategy, CoDriving improves driving performance while optimizing communication efficiency. We make comprehensive benchmarks with V2Xverse, analyzing both modular performance and closed-loop driving performance. Experimental results show that CoDriving: i) significantly improves the driving score by 62.49% and drastically reduces the pedestrian collision rate by 53.50% compared to the SOTA end-to-end driving method, and ii) achieves sustaining driving performance superiority over dynamic constraint communication conditions.

4/16/2024

cs.CV

End-to-End Autonomous Driving through V2X Cooperation

Haibao Yu, Wenxian Yang, Jiaru Zhong, Zhenwei Yang, Siqi Fan, Ping Luo, Zaiqing Nie

Cooperatively utilizing both ego-vehicle and infrastructure sensor data via V2X communication has emerged as a promising approach for advanced autonomous driving. However, current research mainly focuses on improving individual modules, rather than taking end-to-end learning to optimize final planning performance, resulting in underutilized data potential. In this paper, we introduce UniV2X, a pioneering cooperative autonomous driving framework that seamlessly integrates all key driving modules across diverse views into a unified network. We propose a sparse-dense hybrid data transmission and fusion mechanism for effective vehicle-infrastructure cooperation, offering three advantages: 1) Effective for simultaneously enhancing agent perception, online mapping, and occupancy prediction, ultimately improving planning performance. 2) Transmission-friendly for practical and limited communication conditions. 3) Reliable data fusion with interpretability of this hybrid data. We implement UniV2X, as well as reproducing several benchmark methods, on the challenging DAIR-V2X, the real-world cooperative driving dataset. Experimental results demonstrate the effectiveness of UniV2X in significantly enhancing planning performance, as well as all intermediate output performance. Code is at https://github.com/AIR-THU/UniV2X.

4/23/2024

cs.RO cs.CV cs.MA

Unified End-to-End V2X Cooperative Autonomous Driving

Zhiwei Li, Bozhen Zhang, Lei Yang, Tianyu Shen, Nuo Xu, Ruosen Hao, Weiting Li, Tao Yan, Huaping Liu

V2X cooperation, through the integration of sensor data from both vehicles and infrastructure, is considered a pivotal approach to advancing autonomous driving technology. Current research primarily focuses on enhancing perception accuracy, often overlooking the systematic improvement of accident prediction accuracy through end-to-end learning, leading to insufficient attention to the safety issues of autonomous driving. To address this challenge, this paper introduces the UniE2EV2X framework, a V2X-integrated end-to-end autonomous driving system that consolidates key driving modules within a unified network. The framework employs a deformable attention-based data fusion strategy, effectively facilitating cooperation between vehicles and infrastructure. The main advantages include: 1) significantly enhancing agents' perception and motion prediction capabilities, thereby improving the accuracy of accident predictions; 2) ensuring high reliability in the data fusion process; 3) superior end-to-end perception compared to modular approaches. Furthermore, We implement the UniE2EV2X framework on the challenging DeepAccident, a simulation dataset designed for V2X cooperative driving.

5/8/2024

cs.CV cs.MA