Dance of the ADS: Orchestrating Failures through Historically-Informed Scenario Fuzzing

Read original: arXiv:2407.04359 - Published 7/8/2024 by Tong Wang, Taotao Gu, Huan Deng, Hu Li, Xiaohui Kuang, Gang Zhao

Dance of the ADS: Orchestrating Failures through Historically-Informed Scenario Fuzzing

Overview

Autonomous driving systems (ADS) must be thoroughly tested to ensure safety
Scenario-based testing is a key approach, but can miss rare or complex failure modes
The paper "Dance of the ADS: Orchestrating Failures through Historically-Informed Scenario Fuzzing" proposes a new technique for generating diverse testing scenarios

Plain English Explanation

The paper discusses the challenge of ensuring the safety of autonomous driving systems (ADS). These systems must be extensively tested to identify potential failure modes before deployment. One common testing approach is scenario-based testing, where the ADS is evaluated in a variety of simulated driving situations.

However, the authors argue that scenario-based testing can miss rare or complex failure modes. To address this, they propose a new technique called "historically-informed scenario fuzzing." This approach uses historical data on real-world crashes and incidents to generate a diverse set of testing scenarios that are more likely to uncover system weaknesses.

By basing the test scenarios on real-world events, the authors believe their method can better anticipate the types of challenges an ADS might face in the real world. This can help developers identify and fix issues before the system is deployed, ultimately enhancing the safety and reliability of autonomous vehicles.

Technical Explanation

The paper introduces a novel framework for scenario-based testing of autonomous driving systems. The key innovation is the use of "historically-informed scenario fuzzing" to generate diverse testing scenarios.

The framework first collects a database of historical traffic incidents and accidents. It then uses this data to inform the generation of new test scenarios, ensuring they are representative of real-world challenges an ADS might encounter. This is done through a combination of parameter randomization and rule-based scenario extrapolation.

The authors evaluate their approach using a simulation-based testing environment. They demonstrate that the historically-informed scenarios are more effective at uncovering system failures compared to traditional scenario generation methods. The paper also provides insights into the types of failures that were observed and how they relate to known real-world incidents.

Critical Analysis

The paper presents a promising approach to improve the safety and reliability of autonomous driving systems. By basing test scenarios on real-world data, the authors aim to create a more comprehensive and representative set of challenges for the ADS to navigate.

One potential limitation is the reliance on historical data, which may not fully capture the evolving nature of traffic environments and driver behavior. The authors acknowledge this and suggest incorporating additional data sources, such as open-source tools for automated performance evaluation, to further enhance the scenario generation process.

Additionally, the paper focuses on simulation-based testing and does not address the challenges of translating these findings to real-world deployment. Further research is needed to understand how the historically-informed scenarios might perform in physical testing and how they can be integrated into the broader ADS development and validation process.

Conclusion

The "Dance of the ADS" paper introduces a novel approach to scenario-based testing for autonomous driving systems. By leveraging historical data on real-world incidents, the authors aim to create a more comprehensive and diverse set of testing scenarios that can better uncover potential system failures.

This research represents an important step towards improving the safety and reliability of autonomous vehicles. By incorporating real-world insights into the testing process, developers can better anticipate the challenges an ADS might face in the real world and address them before deployment. As the field of autonomous driving continues to evolve, techniques like scene extrapolation for generating interactive traffic scenarios will play a crucial role in ensuring the safety and widespread adoption of these transformative technologies.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Dance of the ADS: Orchestrating Failures through Historically-Informed Scenario Fuzzing

Tong Wang, Taotao Gu, Huan Deng, Hu Li, Xiaohui Kuang, Gang Zhao

As autonomous driving systems (ADS) advance towards higher levels of autonomy, orchestrating their safety verification becomes increasingly intricate. This paper unveils ScenarioFuzz, a pioneering scenario-based fuzz testing methodology. Designed like a choreographer who understands the past performances, it uncovers vulnerabilities in ADS without the crutch of predefined scenarios. Leveraging map road networks, such as OPENDRIVE, we extract essential data to form a foundational scenario seed corpus. This corpus, enriched with pertinent information, provides the necessary boundaries for fuzz testing in the absence of starting scenarios. Our approach integrates specialized mutators and mutation techniques, combined with a graph neural network model, to predict and filter out high-risk scenario seeds, optimizing the fuzzing process using historical test data. Compared to other methods, our approach reduces the time cost by an average of 60.3%, while the number of error scenarios discovered per unit of time increases by 103%. Furthermore, we propose a self-supervised collision trajectory clustering method, which aids in identifying and summarizing 54 high-risk scenario categories prone to inducing ADS faults. Our experiments have successfully uncovered 58 bugs across six tested systems, emphasizing the critical safety concerns of ADS.

7/8/2024

🗣️

Coverage Metrics for a Scenario Database for the Scenario-Based Assessment of Automated Driving Systems

Erwin de Gelder, Maren Buermann, Olaf Op den Camp

Automated Driving Systems (ADSs) have the potential to make mobility services available and safe for all. A multi-pillar Safety Assessment Framework (SAF) has been proposed for the type-approval process of ADSs. The SAF requires that the test scenarios for the ADS adequately covers the Operational Design Domain (ODD) of the ADS. A common method for generating test scenarios involves basing them on scenarios identified and characterized from driving data. This work addresses two questions when collecting scenarios from driving data. First, do the collected scenarios cover all relevant aspects of the ADS' ODD? Second, do the collected scenarios cover all relevant aspects that are in the driving data, such that no potentially important situations are missed? This work proposes coverage metrics that provide a quantitative answer to these questions. The proposed coverage metrics are illustrated by means of an experiment in which over 200000 scenarios from 10 different scenario categories are collected from the HighD data set. The experiment demonstrates that a coverage of 100 % can be achieved under certain conditions, and it also identifies which data and scenarios could be added to enhance the coverage outcomes in case a 100 % coverage has not been achieved. Whereas this work presents metrics for the quantification of the coverage of driving data and the identified scenarios, this paper concludes with future research directions, including the quantification of the completeness of driving data and the identified scenarios.

9/4/2024

ROCAS: Root Cause Analysis of Autonomous Driving Accidents via Cyber-Physical Co-mutation

Shiwei Feng, Yapeng Ye, Qingkai Shi, Zhiyuan Cheng, Xiangzhe Xu, Siyuan Cheng, Hongjun Choi, Xiangyu Zhang

As Autonomous driving systems (ADS) have transformed our daily life, safety of ADS is of growing significance. While various testing approaches have emerged to enhance the ADS reliability, a crucial gap remains in understanding the accidents causes. Such post-accident analysis is paramount and beneficial for enhancing ADS safety and reliability. Existing cyber-physical system (CPS) root cause analysis techniques are mainly designed for drones and cannot handle the unique challenges introduced by more complex physical environments and deep learning models deployed in ADS. In this paper, we address the gap by offering a formal definition of ADS root cause analysis problem and introducing ROCAS, a novel ADS root cause analysis framework featuring cyber-physical co-mutation. Our technique uniquely leverages both physical and cyber mutation that can precisely identify the accident-trigger entity and pinpoint the misconfiguration of the target ADS responsible for an accident. We further design a differential analysis to identify the responsible module to reduce search space for the misconfiguration. We study 12 categories of ADS accidents and demonstrate the effectiveness and efficiency of ROCAS in narrowing down search space and pinpointing the misconfiguration. We also show detailed case studies on how the identified misconfiguration helps understand rationale behind accidents.

9/17/2024

📉

Characterization and Mitigation of Insufficiencies in Automated Driving Systems

Yuting Fu, Jochen Seemann, Caspar Hanselaar, Tim Beurskens, Andrei Terechko, Emilia Silvas, Maurice Heemels

Automated Driving (AD) systems have the potential to increase safety, comfort and energy efficiency. Recently, major automotive companies have started testing and validating AD systems (ADS) on public roads. Nevertheless, the commercial deployment and wide adoption of ADS have been moderate, partially due to system functional insufficiencies (FI) that undermine passenger safety and lead to hazardous situations on the road. FIs are defined in ISO 21448 Safety Of The Intended Functionality (SOTIF). FIs are insufficiencies in sensors, actuators and algorithm implementations, including neural networks and probabilistic calculations. Examples of FIs in ADS include inaccurate ego-vehicle localization on the road, incorrect prediction of a cyclist maneuver, unreliable detection of a pedestrian, etc. The main goal of our study is to formulate a generic architectural design pattern, which is compatible with existing methods and ADS, to improve FI mitigation and enable faster commercial deployment of ADS. First, we studied the 2021 autonomous vehicles disengagement reports published by the California Department of Motor Vehicles (DMV). The data clearly show that disengagements are five times more often caused by FIs rather than by system faults. We then made a comprehensive list of insufficiencies and their characteristics by analyzing over 10 hours of publicly available road test videos. In particular, we identified insufficiency types in four major categories: world model, motion plan, traffic rule, and operational design domain. The insufficiency characterization helps making the SOTIF analyses of triggering conditions more systematic and comprehensive. Based on our FI characterization, simulation experiments and literature survey, we define a novel generic architectural design pattern Daruma to dynamically select the channel that is least likely to have a FI at the moment.

4/16/2024