Multi-V2X: A Large Scale Multi-modal Multi-penetration-rate Dataset for Cooperative Perception

Read original: arXiv:2409.04980 - Published 9/10/2024 by Rongsong Li, Xin Pei

Multi-V2X: A Large Scale Multi-modal Multi-penetration-rate Dataset for Cooperative Perception

Overview

The paper presents a large-scale dataset called Multi-V2X for multi-modal and multi-penetration-rate cooperative perception in autonomous driving.
The dataset includes sensor data from multiple vehicles at different penetration rates, enabling research on V2X (vehicle-to-everything) communication and collaborative perception.
The dataset is designed to advance the development of cooperative perception algorithms for autonomous driving.

Plain English Explanation

The researchers have created a new dataset called Multi-V2X that can be used to develop and test cooperative perception technologies for self-driving cars. Cooperative perception is when vehicles share sensor data with each other to get a better understanding of their surroundings.

The Multi-V2X dataset includes data from sensors on multiple vehicles, like cameras, radar, and lidar. Importantly, the dataset models different penetration rates, which means not all vehicles have the same sensor equipment. This realistic scenario is important for testing how cooperative perception systems perform when some vehicles have more advanced sensors than others.

By providing this diverse and large-scale dataset, the researchers hope to accelerate progress in cooperative perception for autonomous driving. Being able to share sensor data between cars can help self-driving vehicles better understand their environment, improving safety and capabilities.

Technical Explanation

The Multi-V2X dataset captures multi-modal sensor data, including camera, radar, and lidar, from multiple vehicles driving in a variety of real-world environments. Crucially, the dataset models different penetration rates of V2X (vehicle-to-everything) communication technology, where not all vehicles have the same sensor suites.

This heterogeneous setup allows researchers to develop and evaluate cooperative perception algorithms that can handle partial sensor coverage and uneven communication capabilities across a fleet of vehicles. The dataset provides ground truth annotations for object detection, tracking, and other perception tasks to enable comprehensive benchmarking.

The scale and diversity of the Multi-V2X dataset is intended to spur progress in multi-modal, multi-agent collaborative perception for autonomous driving. By capturing realistic variations in sensor modalities and communication penetration, the dataset aims to bridge the gap between laboratory settings and real-world deployment of cooperative perception systems.

Critical Analysis

The Multi-V2X dataset provides a valuable contribution to the field of autonomous driving by addressing the need for diverse, realistic datasets to develop and evaluate cooperative perception algorithms. The modeling of varied sensor suites and communication penetration rates is a key strength that reflects real-world deployment challenges.

However, the dataset is limited to a specific geographic region, and the authors acknowledge that further diversification across environmental conditions, traffic situations, and vehicle types would be beneficial. Additionally, the paper does not provide details on the quality and fidelity of the sensor data, which could impact the relevance and applicability of the dataset.

Researchers using the Multi-V2X dataset should also be mindful of potential biases or blind spots in the data, and consider complementing it with other datasets to gain a more comprehensive understanding of cooperative perception challenges.

Conclusion

The Multi-V2X dataset represents a significant step forward in providing a large-scale, multi-modal, and multi-penetration-rate dataset for cooperative perception research in autonomous driving. By capturing the heterogeneity of real-world vehicle sensor suites and communication capabilities, the dataset enables the development and evaluation of advanced cooperative perception algorithms that can handle partial sensor coverage and uneven communication.

The dataset's potential to accelerate progress in cooperative perception for autonomous driving is substantial, as it addresses a critical need for realistic and diverse datasets in this rapidly evolving field. As researchers continue to explore multi-modal, multi-agent collaborative perception techniques, the Multi-V2X dataset will serve as an invaluable resource for advancing the state of the art in autonomous driving technology.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Multi-V2X: A Large Scale Multi-modal Multi-penetration-rate Dataset for Cooperative Perception

Rongsong Li, Xin Pei

Cooperative perception through vehicle-to-everything (V2X) has garnered significant attention in recent years due to its potential to overcome occlusions and enhance long-distance perception. Great achievements have been made in both datasets and algorithms. However, existing real-world datasets are limited by the presence of few communicable agents, while synthetic datasets typically cover only vehicles. More importantly, the penetration rate of connected and autonomous vehicles (CAVs) , a critical factor for the deployment of cooperative perception technologies, has not been adequately addressed. To tackle these issues, we introduce Multi-V2X, a large-scale, multi-modal, multi-penetration-rate dataset for V2X perception. By co-simulating SUMO and CARLA, we equip a substantial number of cars and roadside units (RSUs) in simulated towns with sensor suites, and collect comprehensive sensing data. Datasets with specified CAV penetration rates can be obtained by masking some equipped cars as normal vehicles. In total, our Multi-V2X dataset comprises 549k RGB frames, 146k LiDAR frames, and 4,219k annotated 3D bounding boxes across six categories. The highest possible CAV penetration rate reaches 86.21%, with up to 31 agents in communication range, posing new challenges in selecting agents to collaborate with. We provide comprehensive benchmarks for cooperative 3D object detection tasks. Our data and code are available at https://github.com/RadetzkyLi/Multi-V2X .

9/10/2024

DeepSense-V2V: A Vehicle-to-Vehicle Multi-Modal Sensing, Localization, and Communications Dataset

Joao Morais, Gouranga Charan, Nikhil Srinivas, Ahmed Alkhateeb

High data rate and low-latency vehicle-to-vehicle (V2V) communication are essential for future intelligent transport systems to enable coordination, enhance safety, and support distributed computing and intelligence requirements. Developing effective communication strategies, however, demands realistic test scenarios and datasets. This is important at the high-frequency bands where more spectrum is available, yet harvesting this bandwidth is challenged by the need for direction transmission and the sensitivity of signal propagation to blockages. This work presents the first large-scale multi-modal dataset for studying mmWave vehicle-to-vehicle communications. It presents a two-vehicle testbed that comprises data from a 360-degree camera, four radars, four 60 GHz phased arrays, a 3D lidar, and two precise GPSs. The dataset contains vehicles driving during the day and night for 120 km in intercity and rural settings, with speeds up to 100 km per hour. More than one million objects were detected across all images, from trucks to bicycles. This work further includes detailed dataset statistics that prove the coverage of various situations and highlights how this dataset can enable novel machine-learning applications.

6/27/2024

Collaborative Perception Datasets in Autonomous Driving: A Survey

Melih Yazgan, Mythra Varun Akkanapragada, J. Marius Zoellner

This survey offers a comprehensive examination of collaborative perception datasets in the context of Vehicle-to-Infrastructure (V2I), Vehicle-to-Vehicle (V2V), and Vehicle-to-Everything (V2X). It highlights the latest developments in large-scale benchmarks that accelerate advancements in perception tasks for autonomous vehicles. The paper systematically analyzes a variety of datasets, comparing them based on aspects such as diversity, sensor setup, quality, public availability, and their applicability to downstream tasks. It also highlights the key challenges such as domain shift, sensor setup limitations, and gaps in dataset diversity and availability. The importance of addressing privacy and security concerns in the development of datasets is emphasized, regarding data sharing and dataset creation. The conclusion underscores the necessity for comprehensive, globally accessible datasets and collaborative efforts from both technological and research communities to overcome these challenges and fully harness the potential of autonomous driving.

4/23/2024

End-to-End Autonomous Driving through V2X Cooperation

Haibao Yu, Wenxian Yang, Jiaru Zhong, Zhenwei Yang, Siqi Fan, Ping Luo, Zaiqing Nie

Cooperatively utilizing both ego-vehicle and infrastructure sensor data via V2X communication has emerged as a promising approach for advanced autonomous driving. However, current research mainly focuses on improving individual modules, rather than taking end-to-end learning to optimize final planning performance, resulting in underutilized data potential. In this paper, we introduce UniV2X, a pioneering cooperative autonomous driving framework that seamlessly integrates all key driving modules across diverse views into a unified network. We propose a sparse-dense hybrid data transmission and fusion mechanism for effective vehicle-infrastructure cooperation, offering three advantages: 1) Effective for simultaneously enhancing agent perception, online mapping, and occupancy prediction, ultimately improving planning performance. 2) Transmission-friendly for practical and limited communication conditions. 3) Reliable data fusion with interpretability of this hybrid data. We implement UniV2X, as well as reproducing several benchmark methods, on the challenging DAIR-V2X, the real-world cooperative driving dataset. Experimental results demonstrate the effectiveness of UniV2X in significantly enhancing planning performance, as well as all intermediate output performance. Code is at https://github.com/AIR-THU/UniV2X.

4/23/2024