The Group Robustness is in the Details: Revisiting Finetuning under Spurious Correlations

Read original: arXiv:2407.13957 - Published 7/22/2024 by Tyler LaBonte, John C. Hill, Xinchen Zhang, Vidya Muthukumar, Abhishek Kumar

The Group Robustness is in the Details: Revisiting Finetuning under Spurious Correlations

Overview

Examines the problem of spurious correlations in machine learning models
Proposes methods to improve model robustness against these correlations
Evaluates the proposed techniques on various real-world datasets

Plain English Explanation

Machine learning models can sometimes identify patterns in data that are not actually meaningful or causal. These spurious correlations can lead to models that perform well on the training data but generalize poorly to new situations.

This paper explores ways to make models more robust against these kinds of spurious correlations. The key idea is to use data augmentation and other techniques to expose the model to a wider range of potential correlations during training. This helps the model learn to focus on the truly important features rather than relying on superficial patterns.

The paper evaluates these methods on several real-world datasets, showing that they can lead to significant improvements in group robustness - the ability to perform well across diverse subgroups in the data. By making models more resistant to spurious correlations, these techniques help ensure they generalize better to new situations and avoid unfairly biased predictions.

Technical Explanation

The paper proposes several methods to improve model robustness against spurious correlations:

Data Augmentation: Generating additional training examples by applying semantic-preserving transformations to the input data. This exposes the model to a wider range of potential correlations during training.
Last-Layer Retraining: Fine-tuning only the final layer of a pre-trained model, which can help it learn to focus on the most important features for the task at hand.
Group-Robust Training: Explicitly optimizing the model to perform well across different subgroups in the data, rather than just maximizing overall performance.

The authors evaluate these techniques on several real-world datasets, including MColorMNIST, CelebA, and Waterbirds. They find that the proposed methods significantly improve the models' group robustness compared to standard training approaches, reducing the performance gap between majority and minority subgroups.

Critical Analysis

The paper provides a thorough empirical evaluation of the proposed techniques, demonstrating their effectiveness on a range of datasets. However, it does not delve deeply into the theoretical underpinnings of why these methods work. Further research could explore the formal guarantees provided by these approaches and how they relate to the underlying causes of spurious correlations.

Additionally, the paper focuses on supervised learning tasks, but it would be interesting to see how these techniques could be extended to unsupervised settings where spurious correlations may also be a concern.

Conclusion

This paper presents important advances in the fight against spurious correlations in machine learning models. By leveraging data augmentation, targeted fine-tuning, and group-level optimization, the proposed methods help ensure that models learn to focus on the truly relevant features, leading to improved generalization and fairness across different subgroups. As machine learning systems become increasingly prevalent in high-stakes applications, techniques like these will be crucial for building reliable and trustworthy AI.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

The Group Robustness is in the Details: Revisiting Finetuning under Spurious Correlations

Tyler LaBonte, John C. Hill, Xinchen Zhang, Vidya Muthukumar, Abhishek Kumar

Modern machine learning models are prone to over-reliance on spurious correlations, which can often lead to poor performance on minority groups. In this paper, we identify surprising and nuanced behavior of finetuned models on worst-group accuracy via comprehensive experiments on four well-established benchmarks across vision and language tasks. We first show that the commonly used class-balancing techniques of mini-batch upsampling and loss upweighting can induce a decrease in worst-group accuracy (WGA) with training epochs, leading to performance no better than without class-balancing. While in some scenarios, removing data to create a class-balanced subset is more effective, we show this depends on group structure and propose a mixture method which can outperform both techniques. Next, we show that scaling pretrained models is generally beneficial for worst-group accuracy, but only in conjuction with appropriate class-balancing. Finally, we identify spectral imbalance in finetuning features as a potential source of group disparities -- minority group covariance matrices incur a larger spectral norm than majority groups once conditioned on the classes. Our results show more nuanced interactions of modern finetuned models with group robustness than was previously known. Our code is available at https://github.com/tmlabonte/revisiting-finetuning.

7/22/2024

🚀

Bias Amplification Enhances Minority Group Performance

Gaotang Li, Jiarui Liu, Wei Hu

Neural networks produced by standard training are known to suffer from poor accuracy on rare subgroups despite achieving high accuracy on average, due to the correlations between certain spurious features and labels. Previous approaches based on worst-group loss minimization (e.g. Group-DRO) are effective in improving worse-group accuracy but require expensive group annotations for all the training samples. In this paper, we focus on the more challenging and realistic setting where group annotations are only available on a small validation set or are not available at all. We propose BAM, a novel two-stage training algorithm: in the first stage, the model is trained using a bias amplification scheme via introducing a learnable auxiliary variable for each training sample; in the second stage, we upweight the samples that the bias-amplified model misclassifies, and then continue training the same model on the reweighted dataset. Empirically, BAM achieves competitive performance compared with existing methods evaluated on spurious correlation benchmarks in computer vision and natural language processing. Moreover, we find a simple stopping criterion based on minimum class accuracy difference that can remove the need for group annotations, with little or no loss in worst-group accuracy. We perform extensive analyses and ablations to verify the effectiveness and robustness of our algorithm in varying class and group imbalance ratios.

4/10/2024

👀

Multigroup Robustness

Lunjia Hu, Charlotte Peale, Judy Hanwen Shen

To address the shortcomings of real-world datasets, robust learning algorithms have been designed to overcome arbitrary and indiscriminate data corruption. However, practical processes of gathering data may lead to patterns of data corruption that are localized to specific partitions of the training dataset. Motivated by critical applications where the learned model is deployed to make predictions about people from a rich collection of overlapping subpopulations, we initiate the study of multigroup robust algorithms whose robustness guarantees for each subpopulation only degrade with the amount of data corruption inside that subpopulation. When the data corruption is not distributed uniformly over subpopulations, our algorithms provide more meaningful robustness guarantees than standard guarantees that are oblivious to how the data corruption and the affected subpopulations are related. Our techniques establish a new connection between multigroup fairness and robustness.

5/2/2024

Out of spuriousity: Improving robustness to spurious correlations without group annotations

Phuong Quynh Le, Jorg Schlotterer, Christin Seifert

Machine learning models are known to learn spurious correlations, i.e., features having strong relations with class labels but no causal relation. Relying on those correlations leads to poor performance in the data groups without these correlations and poor generalization ability. To improve the robustness of machine learning models to spurious correlations, we propose an approach to extract a subnetwork from a fully trained network that does not rely on spurious correlations. The subnetwork is found by the assumption that data points with the same spurious attribute will be close to each other in the representation space when training with ERM, then we employ supervised contrastive loss in a novel way to force models to unlearn the spurious connections. The increase in the worst-group performance of our approach contributes to strengthening the hypothesis that there exists a subnetwork in a fully trained dense network that is responsible for using only invariant features in classification tasks, therefore erasing the influence of spurious features even in the setup of multi spurious attributes and no prior knowledge of attributes labels.

7/23/2024