Self-degraded contrastive domain adaptation for industrial fault diagnosis with bi-imbalanced data

2405.20700

Published 6/3/2024 by Gecheng Chen, Zeyu Yang, Chengwen Luo, Jianqiang Li

Self-degraded contrastive domain adaptation for industrial fault diagnosis with bi-imbalanced data

Abstract

Modern industrial fault diagnosis tasks often face the combined challenge of distribution discrepancy and bi-imbalance. Existing domain adaptation approaches pay little attention to the prevailing bi-imbalance, leading to poor domain adaptation performance or even negative transfer. In this work, we propose a self-degraded contrastive domain adaptation (Sd-CDA) diagnosis framework to handle the domain discrepancy under the bi-imbalanced data. It first pre-trains the feature extractor via imbalance-aware contrastive learning based on model pruning to learn the feature representation efficiently in a self-supervised manner. Then it forces the samples away from the domain boundary based on supervised contrastive domain adversarial learning (SupCon-DA) and ensures the features generated by the feature extractor are discriminative enough. Furthermore, we propose the pruned contrastive domain adversarial learning (PSupCon-DA) to pay automatically re-weighted attention to the minorities to enhance the performance towards bi-imbalanced data. We show the superiority of the proposed method via two experiments.

Create account to get full access

Overview

This paper proposes a self-degraded contrastive domain adaptation (SDCDA) approach for industrial fault diagnosis with bi-imbalanced data.
The key ideas include model pruning, contrastive learning, and adversarial learning to overcome the challenges of domain shift and data imbalance.
The SDCDA framework aims to learn robust and transferable feature representations for improved fault diagnosis performance across different industrial domains.

Plain English Explanation

In industrial settings, fault diagnosis is crucial for maintaining equipment and preventing costly downtime. However, collecting comprehensive sensor data across different machines and factories can be challenging. This often leads to domain shift, where the characteristics of the data used to train a model differ from the real-world data it needs to work with. Additionally, the available data may be imbalanced, meaning some fault types are much more common than others, making it difficult for the model to learn to recognize all faults accurately.

The researchers in this paper propose a novel approach called self-degraded contrastive domain adaptation (SDCDA) to address these challenges. The key ideas behind SDCDA are:

Model Pruning: The model is gradually "pruned" or simplified during training, forcing it to learn more robust and transferable features that can work well across different domains.
Contrastive Learning: The model is trained to learn feature representations that maximize the difference between samples from different fault types, while minimizing the difference between samples of the same fault type. This helps the model better distinguish between different faults.
Adversarial Learning: An adversarial network is trained alongside the main fault diagnosis model to confuse the model's ability to distinguish between the source and target domains. This encourages the model to learn features that are more domain-agnostic and can perform well across different industrial settings.

By combining these techniques, the SDCDA framework aims to overcome the challenges of domain shift and data imbalance to improve the performance of industrial fault diagnosis systems. This could lead to more reliable and cost-effective maintenance strategies for industrial equipment.

Technical Explanation

The SDCDA framework consists of three key components:

Self-Degraded Feature Extractor: This is the core of the model, responsible for learning robust and transferable feature representations. It is gradually pruned during training to force the model to learn more essential features.
Contrastive Classifier: This module is trained to maximize the difference between features of different fault types and minimize the difference between features of the same fault type. This helps the model better discriminate between faults.
Domain Discriminator: This adversarial network is trained to confuse the feature extractor's ability to distinguish between the source and target domains. This encourages the feature extractor to learn domain-agnostic representations.

The training process involves alternating between updating the feature extractor and the domain discriminator. The feature extractor is also gradually pruned to further improve its transferability.

The researchers evaluate the SDCDA framework on two industrial fault diagnosis datasets, demonstrating improved performance compared to other domain adaptation and imbalanced learning techniques. The results highlight the effectiveness of the proposed approach in overcoming the challenges of domain shift and data imbalance in industrial settings.

Critical Analysis

The SDCDA framework presents a promising approach to address the critical issues of domain shift and data imbalance in industrial fault diagnosis. The key strengths of the research include:

The integration of model pruning, contrastive learning, and adversarial learning to jointly tackle the challenges of domain adaptation and imbalanced data.
The empirical validation of the SDCDA framework on real-world industrial datasets, demonstrating its practical applicability.
The potential for the SDCDA approach to be extended to other industrial applications beyond fault diagnosis, such as predictive maintenance or object detection.

However, the paper also highlights some limitations and areas for further research:

The SDCDA framework is evaluated on relatively small-scale datasets, and its performance on larger, more complex industrial datasets is yet to be explored.
The pruning strategy used in the feature extractor could be further optimized to strike a better balance between model complexity and transferability.
Extending the SDCDA approach to handle continuous domain adaptation or bridging the gap between synthetic and real-world data could further improve its applicability in dynamic industrial environments.
Investigating the continual unsupervised domain adaptation capabilities of the SDCDA framework could enhance its robustness to ongoing domain shifts.

Overall, the SDCDA approach represents a valuable contribution to the field of industrial fault diagnosis, providing a comprehensive solution to the challenges of domain shift and data imbalance. Further research and real-world deployments could help refine and expand the capabilities of this promising technique.

Conclusion

This paper introduces the self-degraded contrastive domain adaptation (SDCDA) framework as a novel solution for industrial fault diagnosis in the face of domain shift and data imbalance challenges. By integrating model pruning, contrastive learning, and adversarial learning, the SDCDA approach aims to learn robust and transferable feature representations that can perform well across different industrial domains and fault types.

The empirical evaluation of the SDCDA framework on real-world industrial datasets demonstrates its effectiveness in overcoming the limitations of existing domain adaptation and imbalanced learning techniques. The proposed approach holds the potential to significantly improve the reliability and cost-effectiveness of industrial maintenance strategies, with potential applications beyond fault diagnosis in other industrial domains.

While the SDCDA framework shows promising results, further research is needed to address its scalability, optimization, and continuous adaptation capabilities to fully realize its impact on the industrial sector. Nonetheless, this work represents an important step forward in addressing the critical challenges faced by industrial fault diagnosis systems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

New!Zero-shot domain adaptation based on dual-level mix and contrast

Yu Zhe, Jun Sakuma

Zero-shot domain adaptation (ZSDA) is a domain adaptation problem in the situation that labeled samples for a target task (task of interest) are only available from the source domain at training time, but for a task different from the task of interest (irrelevant task), labeled samples are available from both source and target domains. In this situation, classical domain adaptation techniques can only learn domain-invariant features in the irrelevant task. However, due to the difference in sample distribution between the two tasks, domain-invariant features learned in the irrelevant task are biased and not necessarily domain-invariant in the task of interest. To solve this problem, this paper proposes a new ZSDA method to learn domain-invariant features with low task bias. To this end, we propose (1) data augmentation with dual-level mixups in both task and domain to fill the absence of target task-of-interest data, (2) an extension of domain adversarial learning to learn domain-invariant features with less task bias, and (3) a new dual-level contrastive learning method that enhances domain-invariance and less task biasedness of features. Experimental results show that our proposal achieves good performance on several benchmarks.

6/28/2024

cs.CV cs.LG

Learning to Discover Knowledge: A Weakly-Supervised Partial Domain Adaptation Approach

Mengcheng Lan, Min Meng, Jun Yu, Jigang Wu

Domain adaptation has shown appealing performance by leveraging knowledge from a source domain with rich annotations. However, for a specific target task, it is cumbersome to collect related and high-quality source domains. In real-world scenarios, large-scale datasets corrupted with noisy labels are easy to collect, stimulating a great demand for automatic recognition in a generalized setting, i.e., weakly-supervised partial domain adaptation (WS-PDA), which transfers a classifier from a large source domain with noises in labels to a small unlabeled target domain. As such, the key issues of WS-PDA are: 1) how to sufficiently discover the knowledge from the noisy labeled source domain and the unlabeled target domain, and 2) how to successfully adapt the knowledge across domains. In this paper, we propose a simple yet effective domain adaptation approach, termed as self-paced transfer classifier learning (SP-TCL), to address the above issues, which could be regarded as a well-performing baseline for several generalized domain adaptation tasks. The proposed model is established upon the self-paced learning scheme, seeking a preferable classifier for the target domain. Specifically, SP-TCL learns to discover faithful knowledge via a carefully designed prudent loss function and simultaneously adapts the learned knowledge to the target domain by iteratively excluding source examples from training under the self-paced fashion. Extensive evaluations on several benchmark datasets demonstrate that SP-TCL significantly outperforms state-of-the-art approaches on several generalized domain adaptation tasks.

6/21/2024

cs.CV cs.LG

Overcoming Negative Transfer by Online Selection: Distant Domain Adaptation for Fault Diagnosis

Ziyan Wang, Mohamed Ragab, Wenmian Yang, Min Wu, Sinno Jialin Pan, Jie Zhang, Zhenghua Chen

Unsupervised domain adaptation (UDA) has achieved remarkable success in fault diagnosis, bringing significant benefits to diverse industrial applications. While most UDA methods focus on cross-working condition scenarios where the source and target domains are notably similar, real-world applications often grapple with severe domain shifts. We coin the term `distant domain adaptation problem' to describe the challenge of adapting from a labeled source domain to a significantly disparate unlabeled target domain. This problem exhibits the risk of negative transfer, where extraneous knowledge from the source domain adversely affects the target domain performance. Unfortunately, conventional UDA methods often falter in mitigating this negative transfer, leading to suboptimal performance. In response to this challenge, we propose a novel Online Selective Adversarial Alignment (OSAA) approach. Central to OSAA is its ability to dynamically identify and exclude distant source samples via an online gradient masking approach, focusing primarily on source samples that closely resemble the target samples. Furthermore, recognizing the inherent complexities in bridging the source and target domains, we construct an intermediate domain to act as a transitional domain and ease the adaptation process. Lastly, we develop a class-conditional adversarial adaptation to address the label distribution disparities while learning domain invariant representation to account for potential label distribution disparities between the domains. Through detailed experiments and ablation studies on two real-world datasets, we validate the superior performance of the OSAA method over state-of-the-art methods, underscoring its significant utility in practical scenarios with severe domain shifts.

5/29/2024

cs.LG

🔎

DA-RAW: Domain Adaptive Object Detection for Real-World Adverse Weather Conditions

Minsik Jeon, Junwon Seo, Jihong Min

Despite the success of deep learning-based object detection methods in recent years, it is still challenging to make the object detector reliable in adverse weather conditions such as rain and snow. For the robust performance of object detectors, unsupervised domain adaptation has been utilized to adapt the detection network trained on clear weather images to adverse weather images. While previous methods do not explicitly address weather corruption during adaptation, the domain gap between clear and adverse weather can be decomposed into two factors with distinct characteristics: a style gap and a weather gap. In this paper, we present an unsupervised domain adaptation framework for object detection that can more effectively adapt to real-world environments with adverse weather conditions by addressing these two gaps separately. Our method resolves the style gap by concentrating on style-related information of high-level features using an attention module. Using self-supervised contrastive learning, our framework then reduces the weather gap and acquires instance features that are robust to weather corruption. Extensive experiments demonstrate that our method outperforms other methods for object detection in adverse weather conditions.

5/3/2024

cs.CV cs.RO