Diversifying Deep Ensembles: A Saliency Map Approach for Enhanced OOD Detection, Calibration, and Accuracy

2305.11616

Published 6/17/2024 by Stanislav Dereka, Ivan Karpukhin, Maksim Zhdanov, Sergey Kolesnikov

🤿

Abstract

Deep ensembles are capable of achieving state-of-the-art results in classification and out-of-distribution (OOD) detection. However, their effectiveness is limited due to the homogeneity of learned patterns within ensembles. To overcome this issue, our study introduces Saliency Diversified Deep Ensemble (SDDE), a novel approach that promotes diversity among ensemble members by leveraging saliency maps. Through incorporating saliency map diversification, our method outperforms conventional ensemble techniques and improves calibration in multiple classification and OOD detection tasks. In particular, the proposed method achieves state-of-the-art OOD detection quality, calibration, and accuracy on multiple benchmarks, including CIFAR10/100 and large-scale ImageNet datasets.

Create account to get full access

Overview

Deep ensembles can achieve state-of-the-art results in classification and out-of-distribution (OOD) detection tasks.
However, their effectiveness is limited due to the homogeneity of learned patterns within the ensemble members.
The study introduces Saliency Diversified Deep Ensemble (SDDE), a novel approach that promotes diversity among ensemble members by leveraging saliency maps.

Plain English Explanation

Deep learning models can be very good at tasks like classifying images or detecting when an image doesn't belong to the normal set of images they were trained on (out-of-distribution detection). One way to make these models even better is to use an ensemble, which combines the predictions of multiple models to get a more reliable result.

However, the models in a typical ensemble often end up learning very similar patterns, which limits how much the ensemble can improve on the individual models. To address this, the researchers developed a new approach called Saliency Diversified Deep Ensemble (SDDE). SDDE uses saliency maps to identify the key features each model is focusing on, and then encourages the models in the ensemble to focus on different features. This increases the diversity of the ensemble, allowing it to outperform traditional ensembles on both classification accuracy and out-of-distribution detection.

The saliency map diversification technique used in SDDE is a clever way to address the homogeneity problem that often plagues deep learning ensembles. By promoting diversity, SDDE is able to achieve state-of-the-art results on a variety of classification and OOD detection benchmarks, including the popular CIFAR and ImageNet datasets.

Technical Explanation

The core idea behind the Saliency Diversified Deep Ensemble (SDDE) approach is to increase the diversity of the ensemble members by encouraging them to focus on different features of the input data. This is achieved by incorporating a saliency map diversification mechanism into the ensemble training process.

Saliency maps are visual representations that highlight the important regions of an input image that a neural network model is focusing on to make its prediction. By diversifying the saliency maps of the ensemble members, the researchers ensure that each model is attending to different features, leading to more diverse and complementary predictions.

The SDDE training process involves several key steps:

Training the base models independently on the task at hand, such as image classification or OOD detection.
Generating saliency maps for each base model's predictions on the training data.
Introducing a diversification penalty that encourages the saliency maps of the ensemble members to be as different as possible from each other.
Fine-tuning the base models using this diversification penalty, along with the original task-specific loss.

By promoting saliency map diversity, the SDDE approach is able to create an ensemble where each member focuses on different aspects of the input, leading to improved overall performance on both classification and OOD detection tasks. The researchers demonstrate the effectiveness of SDDE on several benchmark datasets, showcasing its state-of-the-art results.

Critical Analysis

The paper introduces a novel and promising approach to improving the effectiveness of deep ensembles. By leveraging saliency map diversification, the SDDE method addresses the homogeneity issue that often plagues ensemble models, leading to significant performance gains.

However, the paper does not explore the potential limitations or caveats of the SDDE approach. For example, it would be interesting to understand how the method scales with the number of ensemble members, or how sensitive it is to the choice of hyperparameters controlling the diversification penalty.

Additionally, the paper focuses solely on image-based tasks, and it's unclear how well the SDDE approach would generalize to other domains, such as natural language processing or time series analysis. Further research is needed to understand the broader applicability of this technique.

While the results are impressive, the paper could benefit from a more critical examination of the potential drawbacks or edge cases of the SDDE method. Encouraging readers to think critically about the research and form their own opinions is important for advancing the field.

Conclusion

The Saliency Diversified Deep Ensemble (SDDE) approach introduced in this paper represents a significant advancement in the field of deep ensemble learning. By promoting diversity among ensemble members through saliency map diversification, SDDE is able to outperform traditional ensemble techniques on both classification and out-of-distribution detection tasks.

The state-of-the-art results demonstrated on popular benchmarks, such as CIFAR and ImageNet, highlight the practical value of this innovation. As deep learning models become increasingly ubiquitous in real-world applications, techniques like SDDE that can improve model robustness and reliability will be increasingly important.

While the paper does not explore all the potential limitations of the SDDE method, it provides a solid foundation for future research in this area. Exploring the scalability, generalizability, and edge cases of saliency-based ensemble diversification could lead to further advancements in the field of ensemble learning and out-of-distribution detection.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Split-Ensemble: Efficient OOD-aware Ensemble via Task and Model Splitting

Anthony Chen, Huanrui Yang, Yulu Gan, Denis A Gudovskiy, Zhen Dong, Haofan Wang, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Shanghang Zhang

Uncertainty estimation is crucial for machine learning models to detect out-of-distribution (OOD) inputs. However, the conventional discriminative deep learning classifiers produce uncalibrated closed-set predictions for OOD data. A more robust classifiers with the uncertainty estimation typically require a potentially unavailable OOD dataset for outlier exposure training, or a considerable amount of additional memory and compute to build ensemble models. In this work, we improve on uncertainty estimation without extra OOD data or additional inference costs using an alternative Split-Ensemble method. Specifically, we propose a novel subtask-splitting ensemble training objective, where a common multiclass classification task is split into several complementary subtasks. Then, each subtask's training data can be considered as OOD to the other subtasks. Diverse submodels can therefore be trained on each subtask with OOD-aware objectives. The subtask-splitting objective enables us to share low-level features across submodels to avoid parameter and computational overheads. In particular, we build a tree-like Split-Ensemble architecture by performing iterative splitting and pruning from a shared backbone model, where each branch serves as a submodel corresponding to a subtask. This leads to improved accuracy and uncertainty estimation across submodels under a fixed ensemble computation budget. Empirical study with ResNet-18 backbone shows Split-Ensemble, without additional computation cost, improves accuracy over a single model by 0.8%, 1.8%, and 25.5% on CIFAR-10, CIFAR-100, and Tiny-ImageNet, respectively. OOD detection for the same backbone and in-distribution datasets surpasses a single model baseline by, correspondingly, 2.2%, 8.1%, and 29.6% mean AUROC.

5/28/2024

cs.LG cs.CV

🔎

MultiOOD: Scaling Out-of-Distribution Detection for Multiple Modalities

Hao Dong, Yue Zhao, Eleni Chatzi, Olga Fink

Detecting out-of-distribution (OOD) samples is important for deploying machine learning models in safety-critical applications such as autonomous driving and robot-assisted surgery. Existing research has mainly focused on unimodal scenarios on image data. However, real-world applications are inherently multimodal, which makes it essential to leverage information from multiple modalities to enhance the efficacy of OOD detection. To establish a foundation for more realistic Multimodal OOD Detection, we introduce the first-of-its-kind benchmark, MultiOOD, characterized by diverse dataset sizes and varying modality combinations. We first evaluate existing unimodal OOD detection algorithms on MultiOOD, observing that the mere inclusion of additional modalities yields substantial improvements. This underscores the importance of utilizing multiple modalities for OOD detection. Based on the observation of Modality Prediction Discrepancy between in-distribution (ID) and OOD data, and its strong correlation with OOD performance, we propose the Agree-to-Disagree (A2D) algorithm to encourage such discrepancy during training. Moreover, we introduce a novel outlier synthesis method, NP-Mix, which explores broader feature spaces by leveraging the information from nearest neighbor classes and complements A2D to strengthen OOD detection performance. Extensive experiments on MultiOOD demonstrate that training with A2D and NP-Mix improves existing OOD detection algorithms by a large margin. Our source code and MultiOOD benchmark are available at https://github.com/donghao51/MultiOOD.

5/28/2024

cs.CV cs.AI cs.LG

Unraveling the Key Components of OOD Generalization via Diversification

Harold Benoit, Liangze Jiang, Andrei Atanov, Ou{g}uzhan Fatih Kar, Mattia Rigotti, Amir Zamir

Supervised learning datasets may contain multiple cues that explain the training set equally well, i.e., learning any of them would lead to the correct predictions on the training data. However, many of them can be spurious, i.e., lose their predictive power under a distribution shift and consequently fail to generalize to out-of-distribution (OOD) data. Recently developed diversification methods (Lee et al., 2023; Pagliardini et al., 2023) approach this problem by finding multiple diverse hypotheses that rely on different features. This paper aims to study this class of methods and identify the key components contributing to their OOD generalization abilities. We show that (1) diversification methods are highly sensitive to the distribution of the unlabeled data used for diversification and can underperform significantly when away from a method-specific sweet spot. (2) Diversification alone is insufficient for OOD generalization. The choice of the used learning algorithm, e.g., the model's architecture and pretraining, is crucial. In standard experiments (classification on Waterbirds and Office-Home datasets), using the second-best choice leads to an up to 20% absolute drop in accuracy. (3) The optimal choice of learning algorithm depends on the unlabeled data and vice versa i.e. they are co-dependent. (4) Finally, we show that, in practice, the above pitfalls cannot be alleviated by increasing the number of diverse hypotheses, the major feature of diversification methods. These findings provide a clearer understanding of the critical design factors influencing the OOD generalization abilities of diversification methods. They can guide practitioners in how to use the existing methods best and guide researchers in developing new, better ones.

4/23/2024

cs.LG

🤷

Unified Unsupervised Salient Object Detection via Knowledge Transfer

Yao Yuan, Wutao Liu, Pan Gao, Qun Dai, Jie Qin

Recently, unsupervised salient object detection (USOD) has gained increasing attention due to its annotation-free nature. However, current methods mainly focus on specific tasks such as RGB and RGB-D, neglecting the potential for task migration. In this paper, we propose a unified USOD framework for generic USOD tasks. Firstly, we propose a Progressive Curriculum Learning-based Saliency Distilling (PCL-SD) mechanism to extract saliency cues from a pre-trained deep network. This mechanism starts with easy samples and progressively moves towards harder ones, to avoid initial interference caused by hard samples. Afterwards, the obtained saliency cues are utilized to train a saliency detector, and we employ a Self-rectify Pseudo-label Refinement (SPR) mechanism to improve the quality of pseudo-labels. Finally, an adapter-tuning method is devised to transfer the acquired saliency knowledge, leveraging shared knowledge to attain superior transferring performance on the target tasks. Extensive experiments on five representative SOD tasks confirm the effectiveness and feasibility of our proposed method. Code and supplement materials are available at https://github.com/I2-Multimedia-Lab/A2S-v3.

4/24/2024

cs.CV