Improving Pseudo-labelling and Enhancing Robustness for Semi-Supervised Domain Generalization

Read original: arXiv:2401.13965 - Published 9/26/2024 by Adnan Khan, Mai A. Shaaban, Muhammad Haris Khan

Improving Pseudo-labelling and Enhancing Robustness for Semi-Supervised Domain Generalization

Overview

Proposes a new framework for semi-supervised domain generalization that improves pseudo-labeling and enhances robustness
Aims to address challenges in generalizing models to unseen domains with limited labeled data
Introduces techniques to boost the performance of pseudo-labeling and make models more robust to domain shifts

Plain English Explanation

The paper presents a new approach to semi-supervised domain generalization, which is the task of training models that can perform well on new, unseen domains using limited labeled data.

The key idea is to improve the pseudo-labeling process and make the models more robust to domain shifts. Pseudo-labeling involves using the model's own predictions on unlabeled data to provide additional training signals. The authors propose techniques to make this process more effective.

They also introduce ways to enhance the model's robustness so it can better handle differences between the training and test domains. This includes methods to create and leverage intermediate domains during training.

The overall goal is to develop semi-supervised learning approaches that can generalize well to new, unseen domains even when limited labeled data is available for training.

Technical Explanation

The proposed framework consists of two key components:

Improved Pseudo-labeling: The authors introduce a new pseudo-labeling strategy that leverages both the model's predictions and the known labeled data. This helps generate higher-quality pseudo-labels and improve the model's performance on unlabeled data.
Enhanced Robustness: To make the model more robust to domain shifts, the authors propose techniques to create and leverage intermediate domains during training. This encourages the model to learn representations that are transferable across a wider range of domains.

The paper evaluates the framework on several semi-supervised domain generalization benchmarks and demonstrates significant performance improvements compared to existing methods.

Critical Analysis

The paper provides a comprehensive solution to the challenge of semi-supervised domain generalization, which is an important problem in machine learning with real-world applications. The authors' techniques for improving pseudo-labeling and enhancing model robustness are well-designed and empirically validated.

However, the paper does not address the potential limitations of the proposed framework. For example, it's unclear how the performance of the approach would scale with the complexity of the task or the number of target domains. Additionally, the computational cost and training time of the techniques are not discussed, which could be important factors in practical deployments.

Further research could explore ways to make the framework more efficient and accessible, as well as investigate its generalization to other problem domains beyond the specific benchmarks considered in this paper.

Conclusion

This paper presents a novel framework for semi-supervised domain generalization that advances the state-of-the-art in this important area of machine learning. By improving pseudo-labeling and enhancing model robustness, the proposed techniques can help train models that perform well on new, unseen domains with limited labeled data.

The findings of this research have the potential to enable more flexible and adaptable machine learning systems that can be deployed in diverse real-world settings. As the field of domain generalization continues to evolve, this work contributes valuable insights and techniques that can inform future developments.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

New!Improving Pseudo-labelling and Enhancing Robustness for Semi-Supervised Domain Generalization

Adnan Khan, Mai A. Shaaban, Muhammad Haris Khan

Beyond attaining domain generalization (DG), visual recognition models should also be data-efficient during learning by leveraging limited labels. We study the problem of Semi-Supervised Domain Generalization (SSDG) which is crucial for real-world applications like automated healthcare. SSDG requires learning a cross-domain generalizable model when the given training data is only partially labelled. Empirical investigations reveal that the DG methods tend to underperform in SSDG settings, likely because they are unable to exploit the unlabelled data. Semi-supervised learning (SSL) shows improved but still inferior results compared to fully-supervised learning. A key challenge, faced by the best-performing SSL-based SSDG methods, is selecting accurate pseudo-labels under multiple domain shifts and reducing overfitting to source domains under limited labels. In this work, we propose new SSDG approach, which utilizes a novel uncertainty-guided pseudo-labelling with model averaging (UPLM). Our uncertainty-guided pseudo-labelling (UPL) uses model uncertainty to improve pseudo-labelling selection, addressing poor model calibration under multi-source unlabelled data. The UPL technique, enhanced by our novel model averaging (MA) strategy, mitigates overfitting to source domains with limited labels. Extensive experiments on key representative DG datasets suggest that our method demonstrates effectiveness against existing methods. Our code and chosen labelled data seeds are available on GitHub: https://github.com/Adnan-Khan7/UPLM

9/26/2024

Towards Generalizing to Unseen Domains with Few Labels

Chamuditha Jayanga Galappaththige, Sanoojan Baliah, Malitha Gunawardhana, Muhammad Haris Khan

We approach the challenge of addressing semi-supervised domain generalization (SSDG). Specifically, our aim is to obtain a model that learns domain-generalizable features by leveraging a limited subset of labelled data alongside a substantially larger pool of unlabeled data. Existing domain generalization (DG) methods which are unable to exploit unlabeled data perform poorly compared to semi-supervised learning (SSL) methods under SSDG setting. Nevertheless, SSL methods have considerable room for performance improvement when compared to fully-supervised DG training. To tackle this underexplored, yet highly practical problem of SSDG, we make the following core contributions. First, we propose a feature-based conformity technique that matches the posterior distributions from the feature space with the pseudo-label from the model's output space. Second, we develop a semantics alignment loss to learn semantically-compatible representations by regularizing the semantic structure in the feature space. Our method is plug-and-play and can be readily integrated with different SSL-based SSDG baselines without introducing any additional parameters. Extensive experimental results across five challenging DG benchmarks with four strong SSL baselines suggest that our method provides consistent and notable gains in two different SSDG settings.

5/8/2024

Domain-Guided Weight Modulation for Semi-Supervised Domain Generalization

Chamuditha Jayanaga Galappaththige, Zachary Izzo, Xilin He, Honglu Zhou, Muhammad Haris Khan

Unarguably, deep learning models capable of generalizing to unseen domain data while leveraging a few labels are of great practical significance due to low developmental costs. In search of this endeavor, we study the challenging problem of semi-supervised domain generalization (SSDG), where the goal is to learn a domain-generalizable model while using only a small fraction of labeled data and a relatively large fraction of unlabeled data. Domain generalization (DG) methods show subpar performance under the SSDG setting, whereas semi-supervised learning (SSL) methods demonstrate relatively better performance, however, they are considerably poor compared to the fully-supervised DG methods. Towards handling this new, but challenging problem of SSDG, we propose a novel method that can facilitate the generation of accurate pseudo-labels under various domain shifts. This is accomplished by retaining the domain-level specialism in the classifier during training corresponding to each source domain. Specifically, we first create domain-level information vectors on the fly which are then utilized to learn a domain-aware mask for modulating the classifier's weights. We provide a mathematical interpretation for the effect of this modulation procedure on both pseudo-labeling and model training. Our method is plug-and-play and can be readily applied to different SSL baselines for SSDG. Extensive experiments on six challenging datasets in two different SSDG settings show that our method provides visible gains over the various strong SSL-based SSDG baselines.

9/6/2024

🖼️

MultiMatch: Multi-task Learning for Semi-supervised Domain Generalization

Lei Qi, Hongpeng Yang, Yinghuan Shi, Xin Geng

Domain generalization (DG) aims at learning a model on source domains to well generalize on the unseen target domain. Although it has achieved great success, most of existing methods require the label information for all training samples in source domains, which is time-consuming and expensive in the real-world application. In this paper, we resort to solving the semi-supervised domain generalization (SSDG) task, where there are a few label information in each source domain. To address the task, we first analyze the theory of the multi-domain learning, which highlights that 1) mitigating the impact of domain gap and 2) exploiting all samples to train the model can effectively reduce the generalization error in each source domain so as to improve the quality of pseudo-labels. According to the analysis, we propose MultiMatch, i.e., extending FixMatch to the multi-task learning framework, producing the high-quality pseudo-label for SSDG. To be specific, we consider each training domain as a single task (i.e., local task) and combine all training domains together (i.e., global task) to train an extra task for the unseen test domain. In the multi-task framework, we utilize the independent BN and classifier for each task, which can effectively alleviate the interference from different domains during pseudo-labeling. Also, most of parameters in the framework are shared, which can be trained by all training samples sufficiently. Moreover, to further boost the pseudo-label accuracy and the model's generalization, we fuse the predictions from the global task and local task during training and testing, respectively. A series of experiments validate the effectiveness of the proposed method, and it outperforms the existing semi-supervised methods and the SSDG method on several benchmark DG datasets.

4/30/2024