Overcoming Negative Transfer by Online Selection: Distant Domain Adaptation for Fault Diagnosis

Read original: arXiv:2405.17493 - Published 5/29/2024 by Ziyan Wang, Mohamed Ragab, Wenmian Yang, Min Wu, Sinno Jialin Pan, Jie Zhang, Zhenghua Chen

Overcoming Negative Transfer by Online Selection: Distant Domain Adaptation for Fault Diagnosis

Overview

This paper explores a method for overcoming negative transfer in fault diagnosis of rotating machines through distant domain adaptation.
The proposed approach, called Online Selection (OS), aims to select the most relevant target domain data for adaptation, avoiding negative transfer from irrelevant or distant domains.
The authors demonstrate the effectiveness of OS on several benchmark datasets, showing improved fault diagnosis performance compared to other domain adaptation techniques.

Plain English Explanation

Fault diagnosis is the process of identifying issues or problems in machines, such as those found in rotating equipment like motors or turbines. This is an important task for maintaining the reliability and efficiency of industrial systems.

However, training fault diagnosis models can be challenging, as the data available for a specific machine or system may be limited. Domain adaptation techniques can help overcome this by leveraging data from related but different machines or systems (known as the "target domain") to improve the performance of the diagnosis model.

The key challenge with domain adaptation is avoiding "negative transfer," where the differences between the source and target domains actually reduce the model's performance, rather than improving it. This paper introduces a method called "Online Selection" (OS) that aims to address this problem.

The OS approach selects the most relevant target domain data during the training process, focusing on the aspects that are most similar to the source domain. This helps the model avoid learning features that are specific to irrelevant or distant target domains, which could otherwise lead to negative transfer.

The researchers demonstrate the effectiveness of OS on several benchmark datasets for fault diagnosis, showing that it outperforms other domain adaptation techniques in terms of improving the model's performance on the target domain.

Technical Explanation

The key innovation in this paper is the Online Selection (OS) approach, which aims to overcome the problem of negative transfer in fault diagnosis.

The authors first define a framework for distant domain adaptation, where the source and target domains are significantly different, such as different types of rotating machines. They then propose the OS method, which selects the most relevant target domain data during the training process.

The OS method works as follows:

The model is first trained on the source domain data using a standard deep learning approach.
During training on the target domain data, OS dynamically selects a subset of the target domain samples that are most similar to the source domain. This is done by computing a "relevance score" for each target domain sample, based on its similarity to the source domain.
The model is then trained on this selected subset of the target domain data, allowing it to adapt to the relevant aspects of the target domain while avoiding negative transfer from irrelevant or distant parts of the target domain.

The authors evaluate the OS approach on several benchmark datasets for fault diagnosis, including rotating machinery data from different domains (e.g., fans, pumps, and gearboxes). They compare the performance of OS to other domain adaptation techniques, such as adversarial training and multi-target adaptation.

The results show that the OS method consistently outperforms these other approaches, demonstrating its effectiveness in overcoming negative transfer and improving fault diagnosis performance on the target domain.

Critical Analysis

The OS approach presented in this paper is a promising solution for addressing the problem of negative transfer in fault diagnosis. By dynamically selecting the most relevant target domain data during training, the method is able to avoid learning features that are specific to irrelevant or distant target domains, which can otherwise degrade model performance.

One potential limitation of the OS approach is that it relies on the assumption that there is a meaningful notion of "relevance" between the source and target domains. In practice, this may not always be the case, especially for very distant domains. The authors acknowledge this and suggest that further research is needed to better understand the relationship between domain relevance and the effectiveness of OS.

Additionally, the OS method increases the computational complexity of the training process, as it requires an additional step of computing relevance scores for each target domain sample. This may limit the scalability of the approach, particularly for large-scale or real-time fault diagnosis applications.

Despite these potential limitations, the strong empirical results presented in the paper suggest that the OS method is a valuable contribution to the field of domain adaptation for fault diagnosis. Further research could explore ways to improve the efficiency of the method, as well as investigate its applicability to other types of domain adaptation problems beyond fault diagnosis.

Conclusion

This paper presents a novel approach called Online Selection (OS) for overcoming negative transfer in fault diagnosis of rotating machines through distant domain adaptation. The OS method dynamically selects the most relevant target domain data during training, allowing the model to adapt to the target domain while avoiding the negative effects of irrelevant or distant target domain data.

The authors demonstrate the effectiveness of OS on several benchmark datasets, showing that it outperforms other domain adaptation techniques in improving fault diagnosis performance on the target domain. While the method has some potential limitations, it represents a significant advancement in addressing the challenge of negative transfer in this important application area.

The insights and techniques developed in this work could have broader implications for domain adaptation in other fields, such as computer vision and natural language processing. As researchers continue to explore ways to overcome the limitations of traditional machine learning approaches, the OS method provides a valuable contribution to the growing body of work on domain adaptation and transfer learning.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Overcoming Negative Transfer by Online Selection: Distant Domain Adaptation for Fault Diagnosis

Ziyan Wang, Mohamed Ragab, Wenmian Yang, Min Wu, Sinno Jialin Pan, Jie Zhang, Zhenghua Chen

Unsupervised domain adaptation (UDA) has achieved remarkable success in fault diagnosis, bringing significant benefits to diverse industrial applications. While most UDA methods focus on cross-working condition scenarios where the source and target domains are notably similar, real-world applications often grapple with severe domain shifts. We coin the term `distant domain adaptation problem' to describe the challenge of adapting from a labeled source domain to a significantly disparate unlabeled target domain. This problem exhibits the risk of negative transfer, where extraneous knowledge from the source domain adversely affects the target domain performance. Unfortunately, conventional UDA methods often falter in mitigating this negative transfer, leading to suboptimal performance. In response to this challenge, we propose a novel Online Selective Adversarial Alignment (OSAA) approach. Central to OSAA is its ability to dynamically identify and exclude distant source samples via an online gradient masking approach, focusing primarily on source samples that closely resemble the target samples. Furthermore, recognizing the inherent complexities in bridging the source and target domains, we construct an intermediate domain to act as a transitional domain and ease the adaptation process. Lastly, we develop a class-conditional adversarial adaptation to address the label distribution disparities while learning domain invariant representation to account for potential label distribution disparities between the domains. Through detailed experiments and ablation studies on two real-world datasets, we validate the superior performance of the OSAA method over state-of-the-art methods, underscoring its significant utility in practical scenarios with severe domain shifts.

5/29/2024

Style Adaptation for Domain-adaptive Semantic Segmentation

Ting Li, Jianshu Chao, Deyu An

Unsupervised Domain Adaptation (UDA) refers to the method that utilizes annotated source domain data and unlabeled target domain data to train a model capable of generalizing to the target domain data. Domain discrepancy leads to a significant decrease in the performance of general network models trained on the source domain data when applied to the target domain. We introduce a straightforward approach to mitigate the domain discrepancy, which necessitates no additional parameter calculations and seamlessly integrates with self-training-based UDA methods. Through the transfer of the target domain style to the source domain in the latent feature space, the model is trained to prioritize the target domain style during the decision-making process. We tackle the problem at both the image-level and shallow feature map level by transferring the style information from the target domain to the source domain data. As a result, we obtain a model that exhibits superior performance on the target domain. Our method yields remarkable enhancements in the state-of-the-art performance for synthetic-to-real UDA tasks. For example, our proposed method attains a noteworthy UDA performance of 76.93 mIoU on the GTA->Cityscapes dataset, representing a notable improvement of +1.03 percentage points over the previous state-of-the-art results.

4/26/2024

🤷

Unsupervised Domain Adaptation via Style-Aware Self-intermediate Domain

Lianyu Wang, Meng Wang, Daoqiang Zhang, Huazhu Fu

Unsupervised domain adaptation (UDA) has attracted considerable attention, which transfers knowledge from a label-rich source domain to a related but unlabeled target domain. Reducing inter-domain differences has always been a crucial factor to improve performance in UDA, especially for tasks where there is a large gap between source and target domains. To this end, we propose a novel style-aware feature fusion method (SAFF) to bridge the large domain gap and transfer knowledge while alleviating the loss of class-discriminative information. Inspired by the human transitive inference and learning ability, a novel style-aware self-intermediate domain (SSID) is investigated to link two seemingly unrelated concepts through a series of intermediate auxiliary synthesized concepts. Specifically, we propose a novel learning strategy of SSID, which selects samples from both source and target domains as anchors, and then randomly fuses the object and style features of these anchors to generate labeled and style-rich intermediate auxiliary features for knowledge transfer. Moreover, we design an external memory bank to store and update specified labeled features to obtain stable class features and class-wise style features. Based on the proposed memory bank, the intra- and inter-domain loss functions are designed to improve the class recognition ability and feature compatibility, respectively. Meanwhile, we simulate the rich latent feature space of SSID by infinite sampling and the convergence of the loss function by mathematical theory. Finally, we conduct comprehensive experiments on commonly used domain adaptive benchmarks to evaluate the proposed SAFF, and the experimental results show that the proposed SAFF can be easily combined with different backbone networks and obtain better performance as a plug-in-plug-out module.

8/28/2024

🛸

Open-Set Domain Adaptation for Semantic Segmentation

Seun-An Choe, Ah-Hyung Shin, Keon-Hee Park, Jinwoo Choi, Gyeong-Moon Park

Unsupervised domain adaptation (UDA) for semantic segmentation aims to transfer the pixel-wise knowledge from the labeled source domain to the unlabeled target domain. However, current UDA methods typically assume a shared label space between source and target, limiting their applicability in real-world scenarios where novel categories may emerge in the target domain. In this paper, we introduce Open-Set Domain Adaptation for Semantic Segmentation (OSDA-SS) for the first time, where the target domain includes unknown classes. We identify two major problems in the OSDA-SS scenario as follows: 1) the existing UDA methods struggle to predict the exact boundary of the unknown classes, and 2) they fail to accurately predict the shape of the unknown classes. To address these issues, we propose Boundary and Unknown Shape-Aware open-set domain adaptation, coined BUS. Our BUS can accurately discern the boundaries between known and unknown classes in a contrastive manner using a novel dilation-erosion-based contrastive loss. In addition, we propose OpenReMix, a new domain mixing augmentation method that guides our model to effectively learn domain and size-invariant features for improving the shape detection of the known and unknown classes. Through extensive experiments, we demonstrate that our proposed BUS effectively detects unknown classes in the challenging OSDA-SS scenario compared to the previous methods by a large margin. The code is available at https://github.com/KHU-AGI/BUS.

5/31/2024