SCaRL- A Synthetic Multi-Modal Dataset for Autonomous Driving

Read original: arXiv:2405.17030 - Published 5/28/2024 by Avinash Nittur Ramesh, Aitor Correas-Serrano, Mar'ia Gonz'alez-Huici

🏅

Overview

The researchers present a novel synthetically generated multi-modal dataset called SCaRL for training and validating autonomous driving solutions.
Multi-modal datasets that combine data from different sensors are essential for developing robust and accurate autonomous systems.
Existing datasets for autonomous driving either lack synchronized data from a complete sensor suite or are based on real-world data, which can be challenging to obtain and annotate.

Plain English Explanation

The researchers have created a new dataset called SCaRL that combines data from different types of sensors, including cameras, radar, and lidar. This kind of multi-modal dataset is important for training and testing autonomous driving systems, which need to be able to accurately detect, classify, and track objects in the environment using information from various sensors.

Existing datasets for autonomous driving either don't have all the different sensor data synchronized and aligned, or they're based on real-world data, which can be difficult and time-consuming to collect and annotate. The SCaRL dataset, on the other hand, is synthetically generated using the CARLA simulator, which allows for the creation of diverse and dynamic driving scenarios. This makes it easier to obtain the large amounts of labeled data needed to train and validate autonomous driving systems.

Importantly, SCaRL is the first dataset to include synchronized data from coherent lidar and MIMO (multiple-input, multiple-output) radar sensors, which provide detailed 3D point cloud and range-Doppler-azimuth/elevation information, respectively. This rich sensor data can help autonomous driving systems better understand their surroundings and make more informed decisions.

Technical Explanation

The SCaRL dataset is based on the CARLA simulator, which allows for the generation of diverse and dynamic driving scenarios. The dataset includes synchronized data from the following sensors:

RGB, semantic/instance, and depth cameras
Range-Doppler-Azimuth/Elevation maps and raw data from MIMO radar
3D point clouds and 2D maps of semantic, depth, and Doppler data from coherent lidar

This comprehensive multi-modal data can be used to train and validate a wide range of autonomous driving tasks, such as object detection, classification, and tracking. The dataset is larger and more diverse than existing real-world or synthetic datasets for autonomous driving, and it is the first to include synchronized data from coherent lidar and MIMO radar sensors.

Critical Analysis

The researchers acknowledge that while SCaRL provides a rich and diverse dataset for training and validating autonomous driving solutions, it is still a synthetic dataset. As such, there may be differences between the simulated data and real-world conditions that could affect the performance of models trained on SCaRL when deployed in the real world.

Additionally, the paper does not provide detailed information on the specific scenarios, traffic conditions, and environmental factors included in the dataset. It would be helpful for future users to have a better understanding of the dataset's coverage and capability to assess its suitability for their particular use cases.

Further research could also explore ways to bridge the gap between synthetic and real-world data for autonomous driving, such as through domain adaptation or other techniques, to ensure that models trained on SCaRL can generalize well to real-world environments.

Conclusion

The SCaRL dataset provides a novel and comprehensive multi-modal dataset for training and validating autonomous driving solutions. By combining synchronized data from cameras, radar, and lidar sensors in diverse and dynamic simulated environments, SCaRL offers a valuable resource for the development of robust and accurate autonomous driving systems. While the dataset is synthetic, it represents an important step forward in addressing the data challenges faced by the autonomous driving community.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🏅

SCaRL- A Synthetic Multi-Modal Dataset for Autonomous Driving

Avinash Nittur Ramesh, Aitor Correas-Serrano, Mar'ia Gonz'alez-Huici

We present a novel synthetically generated multi-modal dataset, SCaRL, to enable the training and validation of autonomous driving solutions. Multi-modal datasets are essential to attain the robustness and high accuracy required by autonomous systems in applications such as autonomous driving. As deep learning-based solutions are becoming more prevalent for object detection, classification, and tracking tasks, there is great demand for datasets combining camera, lidar, and radar sensors. Existing real/synthetic datasets for autonomous driving lack synchronized data collection from a complete sensor suite. SCaRL provides synchronized Synthetic data from RGB, semantic/instance, and depth Cameras; Range-Doppler-Azimuth/Elevation maps and raw data from Radar; and 3D point clouds/2D maps of semantic, depth and Doppler data from coherent Lidar. SCaRL is a large dataset based on the CARLA Simulator, which provides data for diverse, dynamic scenarios and traffic conditions. SCaRL is the first dataset to include synthetic synchronized data from coherent Lidar and MIMO radar sensors. The dataset can be accessed here: https://fhr-ihs-sva.pages.fraunhofer.de/asp/scarl/

5/28/2024

👁️

CARLA-Loc: Synthetic SLAM Dataset with Full-stack Sensor Setup in Challenging Weather and Dynamic Environments

Yuhang Han, Zhengtao Liu, Shuo Sun, Dongen Li, Jiawei Sun, Chengran Yuan, Marcelo H. Ang Jr

The robustness of SLAM (Simultaneous Localization and Mapping) algorithms under challenging environmental conditions is critical for the success of autonomous driving. However, the real-world impact of such conditions remains largely unexplored due to the difficulty of altering environmental parameters in a controlled manner. To address this, we introduce CARLA-Loc, a synthetic dataset designed for challenging and dynamic environments, created using the CARLA simulator. Our dataset integrates a variety of sensors, including cameras, event cameras, LiDAR, radar, and IMU, etc. with tuned parameters and modifications to ensure the realism of the generated data. CARLA-Loc comprises 7 maps and 42 sequences, each varying in dynamics and weather conditions. Additionally, a pipeline script is provided that allows users to generate custom sequences conveniently. We evaluated 5 visual-based and 4 LiDAR-based SLAM algorithms across different sequences, analyzing how various challenging environmental factors influence localization accuracy. Our findings demonstrate the utility of the CARLA-Loc dataset in validating the efficacy of SLAM algorithms under diverse conditions.

4/19/2024

SCOPE: A Synthetic Multi-Modal Dataset for Collective Perception Including Physical-Correct Weather Conditions

Jorg Gamerdinger, Sven Teufel, Patrick Schulz, Stephan Amann, Jan-Patrick Kirchner, Oliver Bringmann

Collective perception has received considerable attention as a promising approach to overcome occlusions and limited sensing ranges of vehicle-local perception in autonomous driving. In order to develop and test novel collective perception technologies, appropriate datasets are required. These datasets must include not only different environmental conditions, as they strongly influence the perception capabilities, but also a wide range of scenarios with different road users as well as realistic sensor models. Therefore, we propose the Synthetic COllective PErception (SCOPE) dataset. SCOPE is the first synthetic multi-modal dataset that incorporates realistic camera and LiDAR models as well as parameterized and physically accurate weather simulations for both sensor types. The dataset contains 17,600 frames from over 40 diverse scenarios with up to 24 collaborative agents, infrastructure sensors, and passive traffic, including cyclists and pedestrians. In addition, recordings from two novel digital-twin maps from Karlsruhe and Tubingen are included. The dataset is available at https://ekut-es.github.io/scope

8/7/2024

SemanticSpray++: A Multimodal Dataset for Autonomous Driving in Wet Surface Conditions

Aldi Piroli, Vinzenz Dallabetta, Johannes Kopp, Marc Walessa, Daniel Meissner, Klaus Dietmayer

Autonomous vehicles rely on camera, LiDAR, and radar sensors to navigate the environment. Adverse weather conditions like snow, rain, and fog are known to be problematic for both camera and LiDAR-based perception systems. Currently, it is difficult to evaluate the performance of these methods due to the lack of publicly available datasets containing multimodal labeled data. To address this limitation, we propose the SemanticSpray++ dataset, which provides labels for camera, LiDAR, and radar data of highway-like scenarios in wet surface conditions. In particular, we provide 2D bounding boxes for the camera image, 3D bounding boxes for the LiDAR point cloud, and semantic labels for the radar targets. By labeling all three sensor modalities, the SemanticSpray++ dataset offers a comprehensive test bed for analyzing the performance of different perception methods when vehicles travel on wet surface conditions. Together with comprehensive label statistics, we also evaluate multiple baseline methods across different tasks and analyze their performances. The dataset will be available at https://semantic-spray-dataset.github.io .

6/17/2024