LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion Models

Read original: arXiv:2407.08939 - Published 7/15/2024 by Hai Jiang, Ao Luo, Xiaohong Liu, Songchen Han, Shuaicheng Liu

LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion Models

Overview

Proposes a new unsupervised low-light image enhancement method called LightenDiffusion that utilizes diffusion models and Retinex theory
Develops a latent-Retinex diffusion model that estimates latent reflectance and illumination maps to enhance low-light images
Demonstrates state-of-the-art performance on various low-light image enhancement benchmarks without requiring paired training data

Plain English Explanation

LightenDiffusion is a new technique for improving the quality of dim or low-light images. It works by using a special type of machine learning model called a diffusion model, combined with a theory called Retinex that helps understand how light and color work in images.

The key idea is to train the diffusion model to estimate two important components of the image: the reflectance (how much light is being reflected off objects) and the illumination (how much light is shining on the scene). By separately modeling these two factors, the system can then adjust the image to increase the overall brightness and clarity, without losing important details or introducing unwanted artifacts.

Importantly, this process is done in an unsupervised way, meaning the system doesn't require a large dataset of low-light images paired with their enhanced versions. Instead, it learns directly from the low-light images themselves, which makes it more practical to apply in real-world scenarios.

The results show that LightenDiffusion achieves state-of-the-art performance on standard low-light image enhancement benchmarks, producing high-quality outputs that are brighter, clearer, and more visually appealing than what previous methods could achieve. This could have important applications in photography, video, and various computer vision tasks where dealing with low-light conditions is a common challenge.

Technical Explanation

The key innovation of LightenDiffusion is the use of a latent-Retinex diffusion model, which builds on Zero-LED: Zero-Reference Lighting Estimation with Diffusion Models and Di-Retinex: Digital Imaging Retinex Theory for Low-Light Enhancement. The model learns to decompose the input low-light image into reflectance and illumination components in the latent space, and then uses a diffusion process to enhance the illumination while preserving the reflectance.

This approach is inspired by the Retinex theory of human color and lightness perception, which posits that the visual system decomposes the image into reflectance and illumination layers. By mimicking this process in the diffusion model, LightenDiffusion is able to effectively enhance low-light images without requiring paired training data.

The authors also draw inspiration from other recent works on low-light enhancement using diffusion models, such as LightDiff: Surgical Endoscopic Image Low-Light Enhancement with Diffusion Models and Light at Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Image Enhancement.

The proposed DiLightNet architecture consists of an encoder, a latent-Retinex module, and a diffusion-based decoder. The encoder maps the input image to a latent space, the latent-Retinex module decomposes the latent representation into reflectance and illumination components, and the diffusion-based decoder enhances the illumination while preserving the reflectance.

Through extensive experiments on various low-light image enhancement benchmarks, the authors demonstrate that LightenDiffusion outperforms previous state-of-the-art methods in terms of both objective metrics and subjective visual quality.

Critical Analysis

The paper presents a well-designed and carefully executed study, with a strong theoretical foundation and innovative technical approach. The use of Retinex theory and diffusion models is a novel and promising direction for low-light image enhancement, and the authors have convincingly demonstrated the effectiveness of their method.

One potential limitation of the work is that it focuses on enhancing single low-light images, without considering temporal or video-based approaches. Additionally, the paper does not address potential issues with color accuracy or artifacts introduced by the enhancement process. Further research could explore these areas and investigate the robustness of the method in more diverse real-world scenarios.

It would also be valuable to see an analysis of the computational complexity and runtime performance of LightenDiffusion, as this is an important practical consideration for real-world deployment. Finally, a more detailed exploration of the interpretability and explainability of the latent-Retinex decomposition could shed light on the inner workings of the model and potentially lead to further improvements.

Overall, this paper represents a significant advance in the field of low-light image enhancement and lays the groundwork for future research in this important area.

Conclusion

LightenDiffusion is a novel unsupervised low-light image enhancement method that leverages diffusion models and Retinex theory to effectively improve the brightness and clarity of dim images. By decomposing the input into reflectance and illumination components in the latent space and then selectively enhancing the illumination, the system is able to produce high-quality results without requiring paired training data.

The strong performance demonstrated on various benchmarks suggests that this approach could have a significant impact on a wide range of applications, from photography and videography to computer vision tasks in low-light conditions. As the authors note, further research into the robustness, efficiency, and interpretability of the method could lead to even more advanced and practical solutions for addressing the longstanding challenge of low-light image enhancement.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion Models

Hai Jiang, Ao Luo, Xiaohong Liu, Songchen Han, Shuaicheng Liu

In this paper, we propose a diffusion-based unsupervised framework that incorporates physically explainable Retinex theory with diffusion models for low-light image enhancement, named LightenDiffusion. Specifically, we present a content-transfer decomposition network that performs Retinex decomposition within the latent space instead of image space as in previous approaches, enabling the encoded features of unpaired low-light and normal-light images to be decomposed into content-rich reflectance maps and content-free illumination maps. Subsequently, the reflectance map of the low-light image and the illumination map of the normal-light image are taken as input to the diffusion model for unsupervised restoration with the guidance of the low-light feature, where a self-constrained consistency loss is further proposed to eliminate the interference of normal-light content on the restored results to improve overall visual quality. Extensive experiments on publicly available real-world benchmarks show that the proposed LightenDiffusion outperforms state-of-the-art unsupervised competitors and is comparable to supervised methods while being more generalizable to various scenes. Our code is available at https://github.com/JianghaiSCU/LightenDiffusion.

7/15/2024

Retinex-Diffusion: On Controlling Illumination Conditions in Diffusion Models via Retinex Theory

Xiaoyan Xing, Vincent Tao Hu, Jan Hendrik Metzen, Konrad Groh, Sezer Karaoglu, Theo Gevers

This paper introduces a novel approach to illumination manipulation in diffusion models, addressing the gap in conditional image generation with a focus on lighting conditions. We conceptualize the diffusion model as a black-box image render and strategically decompose its energy function in alignment with the image formation model. Our method effectively separates and controls illumination-related properties during the generative process. It generates images with realistic illumination effects, including cast shadow, soft shadow, and inter-reflections. Remarkably, it achieves this without the necessity for learning intrinsic decomposition, finding directions in latent space, or undergoing additional training with new datasets.

7/31/2024

Latent Disentanglement for Low Light Image Enhancement

Zhihao Zheng, Mooi Choo Chuah

Many learning-based low-light image enhancement (LLIE) algorithms are based on the Retinex theory. However, the Retinex-based decomposition techniques in such models introduce corruptions which limit their enhancement performance. In this paper, we propose a Latent Disentangle-based Enhancement Network (LDE-Net) for low light vision tasks. The latent disentanglement module disentangles the input image in latent space such that no corruption remains in the disentangled Content and Illumination components. For LLIE task, we design a Content-Aware Embedding (CAE) module that utilizes Content features to direct the enhancement of the Illumination component. For downstream tasks (e.g. nighttime UAV tracking and low-light object detection), we develop an effective light-weight enhancer based on the latent disentanglement framework. Comprehensive quantitative and qualitative experiments demonstrate that our LDE-Net significantly outperforms state-of-the-art methods on various LLIE benchmarks. In addition, the great results obtained by applying our framework on the downstream tasks also demonstrate the usefulness of our latent disentanglement design.

8/13/2024

Zero-LED: Zero-Reference Lighting Estimation Diffusion Model for Low-Light Image Enhancement

Jinhong He, Minglong Xue, Aoxiang Ning, Chengyun Song

Diffusion model-based low-light image enhancement methods rely heavily on paired training data, leading to limited extensive application. Meanwhile, existing unsupervised methods lack effective bridging capabilities for unknown degradation. To address these limitations, we propose a novel zero-reference lighting estimation diffusion model for low-light image enhancement called Zero-LED. It utilizes the stable convergence ability of diffusion models to bridge the gap between low-light domains and real normal-light domains and successfully alleviates the dependence on pairwise training data via zero-reference learning. Specifically, we first design the initial optimization network to preprocess the input image and implement bidirectional constraints between the diffusion model and the initial optimization network through multiple objective functions. Subsequently, the degradation factors of the real-world scene are optimized iteratively to achieve effective light enhancement. In addition, we explore a frequency-domain based and semantically guided appearance reconstruction module that encourages feature alignment of the recovered image at a fine-grained level and satisfies subjective expectations. Finally, extensive experiments demonstrate the superiority of our approach to other state-of-the-art methods and more significant generalization capabilities. We will open the source code upon acceptance of the paper.

7/10/2024