Predicting Safety Misbehaviours in Autonomous Driving Systems using Uncertainty Quantification

2404.18573

Published 4/30/2024 by Ruben Grewal, Paolo Tonella, Andrea Stocco

🏅

Abstract

The automated real-time recognition of unexpected situations plays a crucial role in the safety of autonomous vehicles, especially in unsupported and unpredictable scenarios. This paper evaluates different Bayesian uncertainty quantification methods from the deep learning domain for the anticipatory testing of safety-critical misbehaviours during system-level simulation-based testing. Specifically, we compute uncertainty scores as the vehicle executes, following the intuition that high uncertainty scores are indicative of unsupported runtime conditions that can be used to distinguish safe from failure-inducing driving behaviors. In our study, we conducted an evaluation of the effectiveness and computational overhead associated with two Bayesian uncertainty quantification methods, namely MC- Dropout and Deep Ensembles, for misbehaviour avoidance. Overall, for three benchmarks from the Udacity simulator comprising both out-of-distribution and unsafe conditions introduced via mutation testing, both methods successfully detected a high number of out-of-bounds episodes providing early warnings several seconds in advance, outperforming two state-of-the-art misbehaviour prediction methods based on autoencoders and attention maps in terms of effectiveness and efficiency. Notably, Deep Ensembles detected most misbehaviours without any false alarms and did so even when employing a relatively small number of models, making them computationally feasible for real-time detection. Our findings suggest that incorporating uncertainty quantification methods is a viable approach for building fail-safe mechanisms in deep neural network-based autonomous vehicles.

Create account to get full access

Overview

The paper evaluates different Bayesian uncertainty quantification methods from deep learning to detect unexpected, unsafe situations during autonomous vehicle testing.
The methods, MC-Dropout and Deep Ensembles, are used to compute uncertainty scores as the vehicle executes, with high scores indicating unsupported conditions that could lead to failures.
The effectiveness and efficiency of these methods are compared against other state-of-the-art misbehavior prediction approaches.

Plain English Explanation

The paper looks at ways to automatically detect unexpected, unsafe situations that could occur during testing of autonomous vehicles. This is crucial for ensuring the safety of self-driving cars, especially in unpredictable real-world scenarios.

The researchers evaluated two Bayesian uncertainty quantification methods from deep learning - MC-Dropout and Deep Ensembles. These methods calculate "uncertainty scores" as the autonomous vehicle is driving. The idea is that high uncertainty scores indicate the vehicle is in an unsupported situation that could lead to dangerous behavior.

The researchers tested these uncertainty quantification methods on several simulated driving scenarios, including cases where unsafe conditions were intentionally introduced. They found that both methods were effective at detecting these out-of-bounds situations and providing early warnings, outperforming other state-of-the-art misbehavior prediction approaches.

Notably, the Deep Ensembles method was able to detect most issues without any false alarms, even when using a relatively small number of models. This makes it computationally feasible for real-time use in autonomous vehicles.

Overall, the results suggest that incorporating Bayesian uncertainty quantification is a promising approach for building robust, fail-safe mechanisms in deep neural network-based autonomous driving systems.

Technical Explanation

The paper investigates the use of Bayesian uncertainty quantification methods from deep learning to enhance the safety of autonomous vehicles, particularly in unsupported and unpredictable scenarios.

The authors evaluate two prominent techniques - MC-Dropout and Deep Ensembles - in the context of simulation-based testing for safety-critical misbehaviors. The core idea is to compute uncertainty scores as the vehicle executes, where high uncertainty is indicative of unsupported runtime conditions that could lead to failures.

Through experiments on three benchmarks from the Udacity simulator, the researchers assess the effectiveness and computational overhead of these uncertainty quantification methods. They introduce both out-of-distribution and unsafe conditions via mutation testing to stress the systems.

The results show that both MC-Dropout and Deep Ensembles successfully detected a high number of out-of-bounds episodes, providing early warnings several seconds in advance. Importantly, the Deep Ensembles approach was able to achieve this without any false alarms, even when using a relatively small number of models, making it computationally feasible for real-time deployment.

These uncertainty-aware methods outperformed two state-of-the-art misbehavior prediction techniques based on autoencoders and attention maps in terms of both effectiveness and efficiency.

Critical Analysis

The paper provides a compelling case for the use of Bayesian uncertainty quantification in the context of autonomous vehicle safety testing. By explicitly modeling the uncertainty of deep neural networks, the proposed methods can identify situations that are outside the norm and potentially unsafe.

However, the paper does not delve into the limitations of these approaches. For example, the researchers acknowledge that the simulation-based evaluation may not fully capture the complexity of real-world driving scenarios. Further validation on physical test platforms or in real-world driving conditions would be necessary to assess the robustness of these techniques.

Additionally, the paper does not discuss the potential for adversarial attacks to exploit the uncertainty quantification models, nor does it address how these methods might be integrated into the broader autonomous vehicle software stack. Addressing these types of concerns would be important for transitioning the research into practical deployments.

Overall, the paper presents a promising approach, but more work is needed to fully understand the limitations and explore the real-world implications of applying Bayesian uncertainty quantification to autonomous vehicle safety.

Conclusion

This paper highlights the potential of Bayesian uncertainty quantification methods, such as MC-Dropout and Deep Ensembles, to enhance the safety of autonomous vehicles during testing and deployment. By explicitly modeling the uncertainty of deep neural networks, these techniques can detect unexpected, potentially unsafe situations in a computationally efficient manner.

The authors' findings suggest that incorporating uncertainty quantification is a viable approach for building robust, fail-safe mechanisms in self-driving car systems. This is a crucial step towards ensuring the safe operation of autonomous vehicles, particularly in unsupported and unpredictable real-world scenarios.

While further research is needed to address the limitations and challenges of these methods, the paper's results are an important contribution to the ongoing efforts to improve the safety and reliability of autonomous driving technologies.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

DADEE: Well-calibrated uncertainty quantification in neural networks for barriers-based robot safety

Masoud Ataei, Vikas Dhiman

Uncertainty-aware controllers that guarantee safety are critical for safety critical applications. Among such controllers, Control Barrier Functions (CBFs) based approaches are popular because they are fast, yet safe. However, most such works depend on Gaussian Processes (GPs) or MC-Dropout for learning and uncertainty estimation, and both approaches come with drawbacks: GPs are non-parametric methods that are slow, while MC-Dropout does not capture aleatoric uncertainty. On the other hand, modern Bayesian learning algorithms have shown promise in uncertainty quantification. The application of modern Bayesian learning methods to CBF-based controllers has not yet been studied. We aim to fill this gap by surveying uncertainty quantification algorithms and evaluating them on CBF-based safe controllers. We find that model variance-based algorithms (for example, Deep ensembles, MC-dropout, etc.) and direct estimation-based algorithms (such as DEUP) have complementary strengths. Algorithms in the former category can only estimate uncertainty accurately out-of-domain, while those in the latter category can only do so in-domain. We combine the two approaches to obtain more accurate uncertainty estimates both in- and out-of-domain. As measured by the failure rate of a simulated robot, this results in a safer CBF-based robot controller.

7/2/2024

cs.RO cs.LG cs.SY eess.SY

🤿

A Comprehensive Survey on Uncertainty Quantification for Deep Learning

Wenchong He, Zhe Jiang

Deep neural networks (DNNs) have achieved tremendous success in making accurate predictions for computer vision, natural language processing, as well as science and engineering domains. However, it is also well-recognized that DNNs sometimes make unexpected, incorrect, but overconfident predictions. This can cause serious consequences in high-stake applications, such as autonomous driving, medical diagnosis, and disaster response. Uncertainty quantification (UQ) aims to estimate the confidence of DNN predictions beyond prediction accuracy. In recent years, many UQ methods have been developed for DNNs. It is of great practical value to systematically categorize these UQ methods and compare their advantages and disadvantages. However, existing surveys mostly focus on categorizing UQ methodologies from a neural network architecture perspective or a Bayesian perspective and ignore the source of uncertainty that each methodology can incorporate, making it difficult to select an appropriate UQ method in practice. To fill the gap, this paper presents a systematic taxonomy of UQ methods for DNNs based on the types of uncertainty sources (data uncertainty versus model uncertainty). We summarize the advantages and disadvantages of methods in each category. We show how our taxonomy of UQ methodologies can potentially help guide the choice of UQ method in different machine learning problems (e.g., active learning, robustness, and reinforcement learning). We also identify current research gaps and propose several future research directions.

4/11/2024

cs.LG stat.ML

🔮

Probabilistic Uncertainty Quantification of Prediction Models with Application to Visual Localization

Junan Chen, Josephine Monica, Wei-Lun Chao, Mark Campbell

The uncertainty quantification of prediction models (e.g., neural networks) is crucial for their adoption in many robotics applications. This is arguably as important as making accurate predictions, especially for safety-critical applications such as self-driving cars. This paper proposes our approach to uncertainty quantification in the context of visual localization for autonomous driving, where we predict locations from images. Our proposed framework estimates probabilistic uncertainty by creating a sensor error model that maps an internal output of the prediction model to the uncertainty. The sensor error model is created using multiple image databases of visual localization, each with ground-truth location. We demonstrate the accuracy of our uncertainty prediction framework using the Ithaca365 dataset, which includes variations in lighting, weather (sunny, snowy, night), and alignment errors between databases. We analyze both the predicted uncertainty and its incorporation into a Kalman-based localization filter. Our results show that prediction error variations increase with poor weather and lighting condition, leading to greater uncertainty and outliers, which can be predicted by our proposed uncertainty model. Additionally, our probabilistic error model enables the filter to remove ad hoc sensor gating, as the uncertainty automatically adjusts the model to the input data

4/9/2024

cs.RO

Uncertainty Quantification for Bird's Eye View Semantic Segmentation: Methods and Benchmarks

Linlin Yu, Bowen Yang, Tianhao Wang, Kangshuo Li, Feng Chen

The fusion of raw features from multiple sensors on an autonomous vehicle to create a Bird's Eye View (BEV) representation is crucial for planning and control systems. There is growing interest in using deep learning models for BEV semantic segmentation. Anticipating segmentation errors and improving the explainability of DNNs is essential for autonomous driving, yet it is under-studied. This paper introduces a benchmark for predictive uncertainty quantification in BEV segmentation. The benchmark assesses various approaches across three popular datasets using two representative backbones and focuses on the effectiveness of predicted uncertainty in identifying misclassified and out-of-distribution (OOD) pixels, as well as calibration. Empirical findings highlight the challenges in uncertainty quantification. Our results find that evidential deep learning based approaches show the most promise by efficiently quantifying aleatoric and epistemic uncertainty. We propose the Uncertainty-Focal-Cross-Entropy (UFCE) loss, designed for highly imbalanced data, which consistently improves the segmentation quality and calibration. Additionally, we introduce a vacuity-scaled regularization term that enhances the model's focus on high uncertainty pixels, improving epistemic uncertainty quantification.

6/3/2024

cs.LG cs.CV