ODCR: Orthogonal Decoupling Contrastive Regularization for Unpaired Image Dehazing

Read original: arXiv:2404.17825 - Published 4/30/2024 by Zhongze Wang, Haitao Zhao, Jingchao Peng, Lujian Yao, Kaijie Zhao

ODCR: Orthogonal Decoupling Contrastive Regularization for Unpaired Image Dehazing

Overview

The paper proposes a novel technique called Orthogonal Decoupling Contrastive Regularization (ODCR) for unpaired image dehazing.
ODCR aims to decouple the content and atmospheric information in images, allowing for more effective dehazing without paired training data.
The method leverages contrastive learning to preserve perceptual features while removing unwanted atmospheric effects.

Plain English Explanation

In this paper, the researchers introduce a new approach called ODCR to tackle the problem of image dehazing. When taking pictures in hazy or foggy conditions, the final image can appear washed out and unclear due to the presence of atmospheric particles. ODCR seeks to address this by separating the actual content of the image from the unwanted atmospheric information.

The key insight is that by training the model to focus on the essential visual features while ignoring the hazy effects, it can recover a clear, dehazed version of the image without needing matched before-and-after training pairs. This is accomplished through a technique called contrastive learning, which encourages the model to learn representations that emphasize the important visual details while minimizing the impact of the haze.

The result is a dehazing system that can work effectively even when only unpaired hazy and clear images are available during training, rather than requiring the painstaking process of capturing matched image pairs under different weather conditions. This makes the technique more practical and widely applicable for real-world image enhancement tasks.

Technical Explanation

The key components of the ODCR approach are:

Content-Atmosphere Decoupling: The model is designed to separate the content information and atmospheric effects within the input image. This is achieved by leveraging an orthogonal decomposition that disentangles these two factors.
Contrastive Regularization: A contrastive learning objective is applied to encourage the model to learn content representations that are robust to atmospheric perturbations. This helps preserve important visual features while removing the hazy elements.
Unpaired Training: ODCR can be trained using only unpaired hazy and clear images, without requiring the tedious process of capturing matched image pairs under different weather conditions.

The proposed architecture consists of an encoder-decoder network with a unique orthogonal decomposition module. This module splits the input into content and atmospheric components, which are then processed separately and recombined to produce the final dehazed output.

The researchers evaluate ODCR on several standard image dehazing benchmarks and demonstrate significant performance improvements over existing unpaired dehazing methods. The model is able to effectively recover clear, visually pleasing images from hazy inputs without access to paired training data.

Critical Analysis

The paper provides a robust technical evaluation of the ODCR method, highlighting its advantages over prior unpaired dehazing approaches. However, the authors acknowledge some limitations:

The method assumes the content and atmospheric factors are linearly separable, which may not always hold true in complex real-world scenes.
The reliance on contrastive learning means the model performance can be sensitive to the chosen hyperparameters and training data distribution.
While ODCR outperforms other unpaired methods, there may still be a performance gap compared to supervised dehazing techniques that utilize paired training data.

Additionally, the paper does not explore the generalization of ODCR to other image enhancement tasks beyond dehazing. Extending the technique to handle a broader range of atmospheric effects or applying it to other domains could be an avenue for future research.

Conclusion

The ODCR method presented in this paper offers a novel and effective solution for unpaired image dehazing. By decoupling the content and atmospheric information in a principled manner and leveraging contrastive learning, the approach can recover clear, visually pleasing images from hazy inputs without requiring the tedious collection of matched training pairs.

This work represents an important step forward in making image enhancement techniques more practical and widely applicable, as the need for paired data is a significant bottleneck in many real-world scenarios. The insights and techniques introduced in this paper could also inspire further research into unsupervised and self-supervised methods for other image-to-image translation tasks.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

ODCR: Orthogonal Decoupling Contrastive Regularization for Unpaired Image Dehazing

Zhongze Wang, Haitao Zhao, Jingchao Peng, Lujian Yao, Kaijie Zhao

Unpaired image dehazing (UID) holds significant research importance due to the challenges in acquiring haze/clear image pairs with identical backgrounds. This paper proposes a novel method for UID named Orthogonal Decoupling Contrastive Regularization (ODCR). Our method is grounded in the assumption that an image consists of both haze-related features, which influence the degree of haze, and haze-unrelated features, such as texture and semantic information. ODCR aims to ensure that the haze-related features of the dehazing result closely resemble those of the clear image, while the haze-unrelated features align with the input hazy image. To accomplish the motivation, Orthogonal MLPs optimized geometrically on the Stiefel manifold are proposed, which can project image features into an orthogonal space, thereby reducing the relevance between different features. Furthermore, a task-driven Depth-wise Feature Classifier (DWFC) is proposed, which assigns weights to the orthogonal features based on the contribution of each channel's feature in predicting whether the feature source is hazy or clear in a self-supervised fashion. Finally, a Weighted PatchNCE (WPNCE) loss is introduced to achieve the pulling of haze-related features in the output image toward those of clear images, while bringing haze-unrelated features close to those of the hazy input. Extensive experiments demonstrate the superior performance of our ODCR method on UID.

4/30/2024

🖼️

Real-world Image Dehazing with Coherence-based Label Generator and Cooperative Unfolding Network

Chengyu Fang, Chunming He, Fengyang Xiao, Yulun Zhang, Longxiang Tang, Yuelin Zhang, Kai Li, Xiu Li

Real-world Image Dehazing (RID) aims to alleviate haze-induced degradation in real-world settings. This task remains challenging due to the complexities in accurately modeling real haze distributions and the scarcity of paired real-world data. To address these challenges, we first introduce a cooperative unfolding network that jointly models atmospheric scattering and image scenes, effectively integrating physical knowledge into deep networks to restore haze-contaminated details. Additionally, we propose the first RID-oriented iterative mean-teacher framework, termed the Coherence-based Label Generator, to generate high-quality pseudo labels for network training. Specifically, we provide an optimal label pool to store the best pseudo-labels during network training, leveraging both global and local coherence to select high-quality candidates and assign weights to prioritize haze-free regions. We verify the effectiveness of our method, with experiments demonstrating that it achieves state-of-the-art performance on RID tasks. Code will be available at url{https://github.com/cnyvfang/CORUN-Colabator}.

8/23/2024

Addressing Domain Discrepancy: A Dual-branch Collaborative Model to Unsupervised Dehazing

Shuaibin Fan, Minglong Xue, Aoxiang Ning, Senming Zhong

Although synthetic data can alleviate acquisition challenges in image dehazing tasks, it also introduces the problem of domain bias when dealing with small-scale data. This paper proposes a novel dual-branch collaborative unpaired dehazing model (DCM-dehaze) to address this issue. The proposed method consists of two collaborative branches: dehazing and contour constraints. Specifically, we design a dual depthwise separable convolutional module (DDSCM) to enhance the information expressiveness of deeper features and the correlation to shallow features. In addition, we construct a bidirectional contour function to optimize the edge features of the image to enhance the clarity and fidelity of the image details. Furthermore, we present feature enhancers via a residual dense architecture to eliminate redundant features of the dehazing process and further alleviate the domain deviation problem. Extensive experiments on benchmark datasets show that our method reaches the state-of-the-art. This project code will be available at url{https://github.com/Fan-pixel/DCM-dehaze.

7/16/2024

Driving-Video Dehazing with Non-Aligned Regularization for Safety Assistance

Junkai Fan, Jiangwei Weng, Kun Wang, Yijun Yang, Jianjun Qian, Jun Li, Jian Yang

Real driving-video dehazing poses a significant challenge due to the inherent difficulty in acquiring precisely aligned hazy/clear video pairs for effective model training, especially in dynamic driving scenarios with unpredictable weather conditions. In this paper, we propose a pioneering approach that addresses this challenge through a nonaligned regularization strategy. Our core concept involves identifying clear frames that closely match hazy frames, serving as references to supervise a video dehazing network. Our approach comprises two key components: reference matching and video dehazing. Firstly, we introduce a non-aligned reference frame matching module, leveraging an adaptive sliding window to match high-quality reference frames from clear videos. Video dehazing incorporates flow-guided cosine attention sampler and deformable cosine attention fusion modules to enhance spatial multiframe alignment and fuse their improved information. To validate our approach, we collect a GoProHazy dataset captured effortlessly with GoPro cameras in diverse rural and urban road environments. Extensive experiments demonstrate the superiority of the proposed method over current state-of-the-art methods in the challenging task of real driving-video dehazing. Project page.

5/17/2024