Latent Disentanglement for Low Light Image Enhancement

Read original: arXiv:2408.06245 - Published 8/13/2024 by Zhihao Zheng, Mooi Choo Chuah

Latent Disentanglement for Low Light Image Enhancement

Overview

The paper presents a novel approach for low-light image enhancement using latent disentanglement
The method aims to separate lighting, content, and style information in the latent space to better enhance low-light images
Key contributions include a latent disentanglement module, a conditional enhancement module, and a dual-path training strategy

Plain English Explanation

The researchers have developed a new way to improve the quality of images taken in low lighting conditions. The core idea is to separate the different aspects of an image - such as the lighting, the content, and the style - and then use that information to enhance the image.

By breaking down the image in this way, the enhancement process can be more targeted and effective. For example, the model can brighten the lighting without distorting the content or changing the overall style of the image.

The researchers tested this approach on a variety of low-light images and found that it outperformed other state-of-the-art methods. The enhanced images looked clearer and more natural, without introducing unwanted artifacts or distortions.

Technical Explanation

The paper introduces a latent disentanglement module that separates the latent representation of an image into three distinct components: lighting, content, and style. This allows the model to manipulate each aspect independently during the enhancement process.

The conditional enhancement module then takes the disentangled latent representation and generates the enhanced output image. This module is conditioned on the lighting information, enabling it to focus the enhancement on the lighting while preserving the content and style.

To train the model, the researchers use a dual-path training strategy. One path optimizes for perceptual similarity to the ground truth, while the other path encourages the disentanglement of the latent representation. This helps the model learn a more effective latent space for low-light image enhancement.

The experiments show that this approach outperforms other state-of-the-art low-light enhancement methods, both in terms of objective metrics and subjective visual quality.

Critical Analysis

The paper presents a well-designed and thoroughly evaluated solution for low-light image enhancement. The latent disentanglement approach is a novel and promising idea that could have broader applications beyond just low-light enhancement.

However, the paper does not discuss potential limitations or edge cases. For example, it's unclear how the model would perform on images with more complex lighting conditions, such as mixed lighting sources or uneven illumination. Additionally, the computational efficiency of the proposed method is not evaluated, which could be an important consideration for real-world applications.

Further research could explore the transferability of the latent disentanglement approach to other image-to-image translation tasks, as well as investigate potential ways to improve the model's robustness and efficiency.

Conclusion

The paper presents a novel low-light image enhancement method based on latent disentanglement. By separating the lighting, content, and style information in the latent space, the model can effectively enhance the lighting while preserving the original content and style of the image. The experiments demonstrate the effectiveness of this approach, with the enhanced images showing significant improvements in visual quality.

This research contributes to the ongoing efforts in the field of low-light image enhancement and could inspire further advancements in leveraging latent representations for image-to-image translation tasks.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Latent Disentanglement for Low Light Image Enhancement

Zhihao Zheng, Mooi Choo Chuah

Many learning-based low-light image enhancement (LLIE) algorithms are based on the Retinex theory. However, the Retinex-based decomposition techniques in such models introduce corruptions which limit their enhancement performance. In this paper, we propose a Latent Disentangle-based Enhancement Network (LDE-Net) for low light vision tasks. The latent disentanglement module disentangles the input image in latent space such that no corruption remains in the disentangled Content and Illumination components. For LLIE task, we design a Content-Aware Embedding (CAE) module that utilizes Content features to direct the enhancement of the Illumination component. For downstream tasks (e.g. nighttime UAV tracking and low-light object detection), we develop an effective light-weight enhancer based on the latent disentanglement framework. Comprehensive quantitative and qualitative experiments demonstrate that our LDE-Net significantly outperforms state-of-the-art methods on various LLIE benchmarks. In addition, the great results obtained by applying our framework on the downstream tasks also demonstrate the usefulness of our latent disentanglement design.

8/13/2024

Unveiling Advanced Frequency Disentanglement Paradigm for Low-Light Image Enhancement

Kun Zhou, Xinyu Lin, Wenbo Li, Xiaogang Xu, Yuanhao Cai, Zhonghang Liu, Xiaoguang Han, Jiangbo Lu

Previous low-light image enhancement (LLIE) approaches, while employing frequency decomposition techniques to address the intertwined challenges of low frequency (e.g., illumination recovery) and high frequency (e.g., noise reduction), primarily focused on the development of dedicated and complex networks to achieve improved performance. In contrast, we reveal that an advanced disentanglement paradigm is sufficient to consistently enhance state-of-the-art methods with minimal computational overhead. Leveraging the image Laplace decomposition scheme, we propose a novel low-frequency consistency method, facilitating improved frequency disentanglement optimization. Our method, seamlessly integrating with various models such as CNNs, Transformers, and flow-based and diffusion models, demonstrates remarkable adaptability. Noteworthy improvements are showcased across five popular benchmarks, with up to 7.68dB gains on PSNR achieved for six state-of-the-art models. Impressively, our approach maintains efficiency with only 88K extra parameters, setting a new standard in the challenging realm of low-light image enhancement.

9/4/2024

LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion Models

Hai Jiang, Ao Luo, Xiaohong Liu, Songchen Han, Shuaicheng Liu

In this paper, we propose a diffusion-based unsupervised framework that incorporates physically explainable Retinex theory with diffusion models for low-light image enhancement, named LightenDiffusion. Specifically, we present a content-transfer decomposition network that performs Retinex decomposition within the latent space instead of image space as in previous approaches, enabling the encoded features of unpaired low-light and normal-light images to be decomposed into content-rich reflectance maps and content-free illumination maps. Subsequently, the reflectance map of the low-light image and the illumination map of the normal-light image are taken as input to the diffusion model for unsupervised restoration with the guidance of the low-light feature, where a self-constrained consistency loss is further proposed to eliminate the interference of normal-light content on the restored results to improve overall visual quality. Extensive experiments on publicly available real-world benchmarks show that the proposed LightenDiffusion outperforms state-of-the-art unsupervised competitors and is comparable to supervised methods while being more generalizable to various scenes. Our code is available at https://github.com/JianghaiSCU/LightenDiffusion.

7/15/2024

Low-Light Video Enhancement via Spatial-Temporal Consistent Illumination and Reflection Decomposition

Xiaogang Xu, Kun Zhou, Tao Hu, Ruixing Wang, Hujun Bao

Low-Light Video Enhancement (LLVE) seeks to restore dynamic and static scenes plagued by severe invisibility and noise. One critical aspect is formulating a consistency constraint specifically for temporal-spatial illumination and appearance enhanced versions, a dimension overlooked in existing methods. In this paper, we present an innovative video Retinex-based decomposition strategy that operates without the need for explicit supervision to delineate illumination and reflectance components. We leverage dynamic cross-frame correspondences for intrinsic appearance and enforce a scene-level continuity constraint on the illumination field to yield satisfactory consistent decomposition results. To further ensure consistent decomposition, we introduce a dual-structure enhancement network featuring a novel cross-frame interaction mechanism. This mechanism can seamlessly integrate with encoder-decoder single-frame networks, incurring minimal additional parameter costs. By supervising different frames simultaneously, this network encourages them to exhibit matching decomposition features, thus achieving the desired temporal propagation. Extensive experiments are conducted on widely recognized LLVE benchmarks, covering diverse scenarios. Our framework consistently outperforms existing methods, establishing a new state-of-the-art (SOTA) performance.

5/27/2024