PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow

Read original: arXiv:2406.07667 - Published 6/13/2024 by Joshua Tokarsky, Ibrahim Abdulhafiz, Satya Ayyalasomayajula, Mostafa Mohsen, Navya G. Rao, Adam Forbes
Total Score

0

PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

• This paper introduces PLT-D3, a high-fidelity dynamic driving simulation dataset designed for stereo depth and scene flow research. • The dataset provides diverse driving scenarios, including urban and highway environments, with detailed annotated information such as 3D bounding boxes, instance segmentation, and scene flow. • PLT-D3 aims to accelerate the development of robust computer vision and robotics algorithms for autonomous driving applications.

Plain English Explanation

PLT-D3 is a new dataset that can be used to train and test computer vision and robotics systems for autonomous driving. It provides a highly realistic simulated driving environment with detailed information about the 3D structure of the scene, the movement of objects, and the depth of objects from the camera.

The dataset includes a wide range of driving scenarios, from urban city streets to highways, which can help algorithms learn to handle the diverse situations that a self-driving car might encounter. The detailed annotations, such as 3D bounding boxes and instance segmentation, give researchers a lot of information to work with when developing and evaluating their models.

By making this dataset available, the researchers hope to accelerate progress in areas like 3D SLAM for autonomous vehicles, autonomous decision-making, and scenario-driven dataset development. Having a high-quality, realistic dataset can help researchers test their algorithms more effectively and develop new techniques for perceiving and understanding dynamic driving environments.

Technical Explanation

The PLT-D3 dataset is a high-fidelity simulation of dynamic driving scenarios that provides detailed annotations for tasks like stereo depth estimation and scene flow prediction. The dataset contains over 60,000 stereo image pairs across a variety of urban and highway environments, with associated 3D bounding boxes, instance segmentation, and scene flow ground truth.

The researchers used the CARLA simulation engine to create the driving scenarios, which include a range of weather conditions, traffic patterns, and other dynamic elements. The annotation process leveraged the simulation's internal state to produce accurate 3D bounding boxes, instance segmentation, and scene flow ground truth.

Compared to existing autonomous driving datasets like 3D-RealCar-Wild and D2E, PLT-D3 offers a significantly larger scale, higher fidelity, and more comprehensive annotations, making it a valuable resource for developing and evaluating computer vision and robotics algorithms for autonomous driving.

Critical Analysis

The authors acknowledge that while PLT-D3 provides high-fidelity simulations, there may still be some differences between the virtual and real-world environments that could affect the generalization of algorithms trained on the dataset. They suggest that combining synthetic and real-world data, as in ParisLuCo3D, may be a promising direction for future research.

Additionally, the dataset is currently limited to a single sensor configuration (stereo cameras) and could benefit from the inclusion of other modalities, such as LiDAR, to better represent the sensor suites used in autonomous vehicles. Expanding the range of weather conditions and traffic scenarios may also enhance the dataset's utility for testing the robustness of perception algorithms.

Conclusion

The PLT-D3 dataset provides a high-fidelity, large-scale driving simulation environment with comprehensive annotations for stereo depth and scene flow estimation. By making this dataset publicly available, the researchers aim to accelerate progress in computer vision and robotics for autonomous driving applications. The detailed annotations and diverse driving scenarios in PLT-D3 can help researchers develop and evaluate more robust and capable perception systems for self-driving cars.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow
Total Score

0

PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow

Joshua Tokarsky, Ibrahim Abdulhafiz, Satya Ayyalasomayajula, Mostafa Mohsen, Navya G. Rao, Adam Forbes

Autonomous driving has experienced remarkable progress, bolstered by innovations in computational hardware and sophisticated deep learning methodologies. The foundation of these advancements rests on the availability and quality of datasets, which are crucial for the development and refinement of dependable and versatile autonomous driving algorithms. While numerous datasets have been developed to support the evolution of autonomous driving perception technologies, few offer the diversity required to thoroughly test and enhance system robustness under varied weather conditions. Many public datasets lack the comprehensive coverage of challenging weather scenarios and detailed, high-resolution data, which are critical for training and validating advanced autonomous-driving perception models. In this paper, we introduce PLT-D3; a Dynamic-weather Driving Dataset, designed specifically to enhance autonomous driving systems' adaptability to diverse weather conditions. PLT-D3 provides high-fidelity stereo depth and scene flow ground truth data generated using Unreal Engine 5. In particular, this dataset includes synchronized high-resolution stereo image sequences that replicate a wide array of dynamic weather scenarios including rain, snow, fog, and diverse lighting conditions, offering an unprecedented level of realism in simulation-based testing. The primary aim of PLT-D3 is to address the scarcity of comprehensive training and testing resources that can simulate real-world weather variations. Benchmarks have been established for several critical autonomous driving tasks using PLT-D3, such as depth estimation, optical flow and scene-flow to measure and enhance the performance of state-of-the-art models.

Read more

6/13/2024

SID: Stereo Image Dataset for Autonomous Driving in Adverse Conditions
Total Score

0

SID: Stereo Image Dataset for Autonomous Driving in Adverse Conditions

Zaid A. El-Shair, Abdalmalek Abu-raddaha, Aaron Cofield, Hisham Alawneh, Mohamed Aladem, Yazan Hamzeh, Samir A. Rawashdeh

Robust perception is critical for autonomous driving, especially under adverse weather and lighting conditions that commonly occur in real-world environments. In this paper, we introduce the Stereo Image Dataset (SID), a large-scale stereo-image dataset that captures a wide spectrum of challenging real-world environmental scenarios. Recorded at a rate of 20 Hz using a ZED stereo camera mounted on a vehicle, SID consists of 27 sequences totaling over 178k stereo image pairs that showcase conditions from clear skies to heavy snow, captured during the day, dusk, and night. The dataset includes detailed sequence-level annotations for weather conditions, time of day, location, and road conditions, along with instances of camera lens soiling, offering a realistic representation of the challenges in autonomous navigation. Our work aims to address a notable gap in research for autonomous driving systems by presenting high-fidelity stereo images essential for the development and testing of advanced perception algorithms. These algorithms support consistent and reliable operation across variable weather and lighting conditions, even when handling challenging situations like lens soiling. SID is publicly available at: https://doi.org/10.7302/esz6-nv83.

Read more

7/9/2024

ISETHDR: A Physics-based Synthetic Radiance Dataset for High Dynamic Range Driving Scenes
Total Score

0

ISETHDR: A Physics-based Synthetic Radiance Dataset for High Dynamic Range Driving Scenes

Zhenyi Liu, Devesh Shah, Brian Wandell

This paper describes a physics-based end-to-end software simulation for image systems. We use the software to explore sensors designed to enhance performance in high dynamic range (HDR) environments, such as driving through daytime tunnels and under nighttime conditions. We synthesize physically realistic HDR spectral radiance images and use them as the input to digital twins that model the optics and sensors of different systems. This paper makes three main contributions: (a) We create a labeled (instance segmentation and depth), synthetic radiance dataset of HDR driving scenes. (b) We describe the development and validation of the end-to-end simulation framework. (c) We present a comparative analysis of two single-shot sensors designed for HDR. We open-source both the dataset and the software.

Read more

8/23/2024

3DRealCar: An In-the-wild RGB-D Car Dataset with 360-degree Views
Total Score

0

3DRealCar: An In-the-wild RGB-D Car Dataset with 360-degree Views

Xiaobiao Du, Haiyang Sun, Shuyun Wang, Zhuojie Wu, Hongwei Sheng, Jiaying Ying, Ming Lu, Tianqing Zhu, Kun Zhan, Xin Yu

3D cars are commonly used in self-driving systems, virtual/augmented reality, and games. However, existing 3D car datasets are either synthetic or low-quality, presenting a significant gap toward the high-quality real-world 3D car datasets and limiting their applications in practical scenarios. In this paper, we propose the first large-scale 3D real car dataset, termed 3DRealCar, offering three distinctive features. (1) textbf{High-Volume}: 2,500 cars are meticulously scanned by 3D scanners, obtaining car images and point clouds with real-world dimensions; (2) textbf{High-Quality}: Each car is captured in an average of 200 dense, high-resolution 360-degree RGB-D views, enabling high-fidelity 3D reconstruction; (3) textbf{High-Diversity}: The dataset contains various cars from over 100 brands, collected under three distinct lighting conditions, including reflective, standard, and dark. Additionally, we offer detailed car parsing maps for each instance to promote research in car parsing tasks. Moreover, we remove background point clouds and standardize the car orientation to a unified axis for the reconstruction only on cars without background and controllable rendering. We benchmark 3D reconstruction results with state-of-the-art methods across each lighting condition in 3DRealCar. Extensive experiments demonstrate that the standard lighting condition part of 3DRealCar can be used to produce a large number of high-quality 3D cars, improving various 2D and 3D tasks related to cars. Notably, our dataset brings insight into the fact that recent 3D reconstruction methods face challenges in reconstructing high-quality 3D cars under reflective and dark lighting conditions. textcolor{red}{href{https://xiaobiaodu.github.io/3drealcar/}{Our dataset is available here.}}

Read more

6/10/2024