Nuisances via Negativa: Adjusting for Spurious Correlations via Data Augmentation

Read original: arXiv:2210.01302 - Published 7/4/2024 by Aahlad Puli, Nitish Joshi, Yoav Wald, He He, Rajesh Ranganath

📊

Overview

In prediction tasks, there are features that are consistently related to the label, called semantic features or semantics, and features that have varying relationships to the label, called nuisances.
Models that exploit nuisance-label relationships can face performance degradation when these relationships change, a challenge that requires going beyond just using samples of the features and labels.
Existing work has used annotations of nuisances or assumed ERM-trained models depend on nuisances, but new approaches can expand the settings where robust models can be built.

Plain English Explanation

When trying to predict something, like whether an image contains a cow, there are certain features that are consistently related to the thing you're trying to predict (the label) - in this case, the shape of the cow's head. These are called semantic features. However, there are also features that have a more variable relationship to the label, like the background of the image. These are called nuisances.

The problem is that models that rely on these nuisance-label relationships can struggle when those relationships change. For example, if cows are often shown in images with grass backgrounds, but then you try to use the model on images of cows with different backgrounds, the model's performance will degrade.

Simply using more samples of the features and labels isn't enough to solve this problem. Instead, researchers have explored using additional knowledge, like annotations of the nuisance features or assumptions about how the model is using those nuisances.

This paper introduces a new approach that uses knowledge about the semantic features - specifically, by corrupting or altering those features in the training data. This helps the model identify when it's relying too heavily on the nuisance-label relationships, so it can adjust and become more robust to changes in those relationships.

The researchers test this approach on a few different prediction tasks, like classifying waterbirds, doing natural language inference, and detecting cardiomegaly in chest X-rays.

Technical Explanation

The key idea in this paper is to use knowledge about the semantic features - the features that are consistently related to the label - to build models that are robust to changes in the relationships between nuisance features and the label.

The researchers develop an approach that involves corrupting the semantic features in the training data. This helps the model identify when it's relying too heavily on nuisance-label relationships, so it can adjust its predictions accordingly. This expands the settings where robust models can be built, compared to previous approaches that relied on annotations of nuisances or assumptions about how the model uses them.

The researchers test their approach on a variety of out-of-distribution (OOD) tasks, including classifying waterbirds, doing natural language inference, and detecting cardiomegaly in chest X-rays. Their results show that corrupting the semantic features can effectively power different methods for avoiding spurious correlations and improving model robustness.

Critical Analysis

The paper provides a novel approach for building robust models that can generalize beyond the specific relationships between nuisance features and labels seen in the training data. By corrupting the semantic features, the model is forced to rely less on nuisance-label correlations, which can break down in real-world settings.

However, the paper does not address how to determine which features are truly semantic versus nuisance, as this distinction may not always be clear-cut. Additionally, the specific corruption techniques used may not be generalizable to all types of semantic features, and their effectiveness could depend on the task and data.

Further research could explore more automated or adaptive methods for identifying semantic and nuisance features, as well as corruption techniques that can work across a wider range of prediction tasks and data modalities. Noise correction for subjective datasets may also be a fruitful area to investigate in the context of this work.

Overall, this paper presents a promising direction for building more robust and generalizable machine learning models, but there is still work to be done to address the limitations and expand the applicability of these techniques.

Conclusion

This paper introduces a novel approach for building machine learning models that are robust to changes in the relationships between nuisance features and the target label. By corrupting the semantic features in the training data, the model is forced to rely less on these potentially fragile nuisance-label correlations, which can break down in real-world deployment scenarios.

The researchers demonstrate the effectiveness of this approach on a variety of out-of-distribution prediction tasks, showing that it can power different methods for avoiding spurious correlations and improving model robustness. While there are still some limitations to address, this work represents an important step forward in developing more generalizable and reliable machine learning systems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

📊

Nuisances via Negativa: Adjusting for Spurious Correlations via Data Augmentation

Aahlad Puli, Nitish Joshi, Yoav Wald, He He, Rajesh Ranganath

In prediction tasks, there exist features that are related to the label in the same way across different settings for that task; these are semantic features or semantics. Features with varying relationships to the label are nuisances. For example, in detecting cows from natural images, the shape of the head is semantic but because images of cows often have grass backgrounds but not always, the background is a nuisance. Models that exploit nuisance-label relationships face performance degradation when these relationships change. Building models robust to such changes requires additional knowledge beyond samples of the features and labels. For example, existing work uses annotations of nuisances or assumes ERM-trained models depend on nuisances. Approaches to integrate new kinds of additional knowledge enlarge the settings where robust models can be built. We develop an approach to use knowledge about the semantics by corrupting them in data, and then using the corrupted data to produce models which identify correlations between nuisances and the label. Once these correlations are identified, they can be used to adjust for where nuisances drive predictions. We study semantic corruptions in powering different spurious-correlation avoiding methods on multiple out-of-distribution (OOD) tasks like classifying waterbirds, natural language inference (NLI), and detecting cardiomegaly in chest X-rays.

7/4/2024

Out of spuriousity: Improving robustness to spurious correlations without group annotations

Phuong Quynh Le, Jorg Schlotterer, Christin Seifert

Machine learning models are known to learn spurious correlations, i.e., features having strong relations with class labels but no causal relation. Relying on those correlations leads to poor performance in the data groups without these correlations and poor generalization ability. To improve the robustness of machine learning models to spurious correlations, we propose an approach to extract a subnetwork from a fully trained network that does not rely on spurious correlations. The subnetwork is found by the assumption that data points with the same spurious attribute will be close to each other in the representation space when training with ERM, then we employ supervised contrastive loss in a novel way to force models to unlearn the spurious connections. The increase in the worst-group performance of our approach contributes to strengthening the hypothesis that there exists a subnetwork in a fully trained dense network that is responsible for using only invariant features in classification tasks, therefore erasing the influence of spurious features even in the setup of multi spurious attributes and no prior knowledge of attributes labels.

7/23/2024

Robust Classification by Coupling Data Mollification with Label Smoothing

Markus Heinonen, Ba-Hien Tran, Michael Kampffmeyer, Maurizio Filippone

Introducing training-time augmentations is a key technique to enhance generalization and prepare deep neural networks against test-time corruptions. Inspired by the success of generative diffusion models, we propose a novel approach coupling data augmentation, in the form of image noising and blurring, with label smoothing to align predicted label confidences with image degradation. The method is simple to implement, introduces negligible overheads, and can be combined with existing augmentations. We demonstrate improved robustness and uncertainty quantification on the corrupted image benchmarks of the CIFAR and TinyImageNet datasets.

6/4/2024

Learning Robust Classifiers with Self-Guided Spurious Correlation Mitigation

Guangtao Zheng, Wenqian Ye, Aidong Zhang

Deep neural classifiers tend to rely on spurious correlations between spurious attributes of inputs and targets to make predictions, which could jeopardize their generalization capability. Training classifiers robust to spurious correlations typically relies on annotations of spurious correlations in data, which are often expensive to get. In this paper, we tackle an annotation-free setting and propose a self-guided spurious correlation mitigation framework. Our framework automatically constructs fine-grained training labels tailored for a classifier obtained with empirical risk minimization to improve its robustness against spurious correlations. The fine-grained training labels are formulated with different prediction behaviors of the classifier identified in a novel spuriousness embedding space. We construct the space with automatically detected conceptual attributes and a novel spuriousness metric which measures how likely a class-attribute correlation is exploited for predictions. We demonstrate that training the classifier to distinguish different prediction behaviors reduces its reliance on spurious correlations without knowing them a priori and outperforms prior methods on five real-world datasets.

5/7/2024