Exposure Diffusion: HDR Image Generation by Consistent LDR denoising

Read original: arXiv:2405.14304 - Published 5/24/2024 by Mojtaba Bemana, Thomas Leimkuhler, Karol Myszkowski, Hans-Peter Seidel, Tobias Ritschel

Exposure Diffusion: HDR Image Generation by Consistent LDR denoising

Overview

This paper introduces a novel method called "Exposure Diffusion" for generating high-dynamic range (HDR) images from a single low-dynamic range (LDR) input image.
The key idea is to use a diffusion model to denoise the LDR input in a consistent way across multiple exposures, which allows reconstructing an HDR image.
The method outperforms existing single-image HDR generation approaches and achieves state-of-the-art results on several benchmark datasets.

Plain English Explanation

The paper presents a new way to create high-quality HDR images from a single regular (LDR) photo. Regular cameras can only capture a limited range of brightness, but HDR images can represent a much wider range, closer to what the human eye can see.

The researchers developed a technique called "Exposure Diffusion" that works by first taking the regular photo and then using a machine learning model to "denoise" it. Denoising removes small imperfections and smooths out the image. Crucially, the denoising is done in a consistent way across different brightness levels, which allows reconstructing the full HDR image.

This approach outperforms other single-image HDR generation methods and achieves state-of-the-art results on standard HDR benchmarks. In other words, the HDR images it produces are of higher quality than what previous techniques could create from a single regular photo.

Technical Explanation

The paper introduces a novel method called "Exposure Diffusion" for generating high-dynamic range (HDR) images from a single low-dynamic range (LDR) input. The key idea is to leverage a diffusion model to denoise the LDR input in a consistent way across multiple exposures, which enables the reconstruction of an HDR image.

Specifically, the method first encodes the LDR input into a latent representation. It then applies a denoising diffusion probabilistic model (DDPM) to this latent representation, conditioning the diffusion process on the input exposure. This results in a set of denoised latent representations corresponding to different exposures. Finally, the method decodes these latent representations back into the HDR image.

The consistent denoising across exposures is the crucial innovation that allows the method to outperform existing single-image HDR generation approaches. The authors show state-of-the-art results on several HDR benchmark datasets, demonstrating the effectiveness of their Exposure Diffusion technique.

Critical Analysis

The paper presents a compelling approach for HDR image generation from a single LDR input. The key strength is the use of a diffusion model to denoise the input in a consistent way across exposures, which enables high-quality HDR reconstruction.

However, the paper does not address certain limitations of the method. For example, it is not clear how the approach would handle challenging real-world scenarios, such as scenes with significant motion blur or occlusions. Additionally, the computational complexity of the diffusion-based denoising may limit the practical applicability of the method.

Furthermore, the paper could benefit from a more in-depth analysis of failure cases and potential biases in the model. It would also be valuable to see comparisons to other recent advances in single-image HDR generation, such as those presented in papers like Perceptual Assessment Optimization for High Dynamic Range Image and HDR Imaging of Dynamic Scenes and Events.

Overall, the Exposure Diffusion method represents an interesting and promising direction for HDR image generation, but further research is needed to fully understand its limitations and potential real-world applications.

Conclusion

The paper introduces a novel approach called "Exposure Diffusion" for generating high-quality HDR images from a single LDR input. The key innovation is the use of a diffusion model to denoise the LDR input in a consistent way across multiple exposures, which enables the reconstruction of an HDR image.

The method outperforms existing single-image HDR generation techniques and achieves state-of-the-art results on several benchmark datasets. This suggests that the Exposure Diffusion approach could be a valuable tool for a wide range of applications, from computational photography to image editing and visualization.

While the paper presents a compelling solution, further research is needed to address potential limitations and explore the method's real-world performance. Continued advancements in this area could lead to significant improvements in our ability to capture and manipulate high-quality HDR imagery, with far-reaching implications for various industries and fields of study.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Exposure Diffusion: HDR Image Generation by Consistent LDR denoising

Mojtaba Bemana, Thomas Leimkuhler, Karol Myszkowski, Hans-Peter Seidel, Tobias Ritschel

We demonstrate generating high-dynamic range (HDR) images using the concerted action of multiple black-box, pre-trained low-dynamic range (LDR) image diffusion models. Common diffusion models are not HDR as, first, there is no sufficiently large HDR image dataset available to re-train them, and second, even if it was, re-training such models is impossible for most compute budgets. Instead, we seek inspiration from the HDR image capture literature that traditionally fuses sets of LDR images, called brackets, to produce a single HDR image. We operate multiple denoising processes to generate multiple LDR brackets that together form a valid HDR result. To this end, we introduce an exposure consistency term into the diffusion process to couple the brackets such that they agree across the exposure range they share. We demonstrate HDR versions of state-of-the-art unconditional and conditional as well as restoration-type (LDR2HDR) generative modeling.

5/24/2024

Diffusion-Promoted HDR Video Reconstruction

Yuanshen Guan, Ruikang Xu, Mingde Yao, Ruisheng Gao, Lizhi Wang, Zhiwei Xiong

High dynamic range (HDR) video reconstruction aims to generate HDR videos from low dynamic range (LDR) frames captured with alternating exposures. Most existing works solely rely on the regression-based paradigm, leading to adverse effects such as ghosting artifacts and missing details in saturated regions. In this paper, we propose a diffusion-promoted method for HDR video reconstruction, termed HDR-V-Diff, which incorporates a diffusion model to capture the HDR distribution. As such, HDR-V-Diff can reconstruct HDR videos with realistic details while alleviating ghosting artifacts. However, the direct introduction of video diffusion models would impose massive computational burden. Instead, to alleviate this burden, we first propose an HDR Latent Diffusion Model (HDR-LDM) to learn the distribution prior of single HDR frames. Specifically, HDR-LDM incorporates a tonemapping strategy to compress HDR frames into the latent space and a novel exposure embedding to aggregate the exposure information into the diffusion process. We then propose a Temporal-Consistent Alignment Module (TCAM) to learn the temporal information as a complement for HDR-LDM, which conducts coarse-to-fine feature alignment at different scales among video frames. Finally, we design a Zero-Init Cross-Attention (ZiCA) mechanism to effectively integrate the learned distribution prior and temporal information for generating HDR frames. Extensive experiments validate that HDR-V-Diff achieves state-of-the-art results on several representative datasets.

6/13/2024

Sagiri: Low Dynamic Range Image Enhancement with Generative Diffusion Prior

Baiang Li, Sizhuo Ma, Yanhong Zeng, Xiaogang Xu, Youqing Fang, Zhao Zhang, Jian Wang, Kai Chen

Capturing High Dynamic Range (HDR) scenery using 8-bit cameras often suffers from over-/underexposure, loss of fine details due to low bit-depth compression, skewed color distributions, and strong noise in dark areas. Traditional LDR image enhancement methods primarily focus on color mapping, which enhances the visual representation by expanding the image's color range and adjusting the brightness. However, these approaches fail to effectively restore content in dynamic range extremes, which are regions with pixel values close to 0 or 255. To address the full scope of challenges in HDR imaging and surpass the limitations of current models, we propose a novel two-stage approach. The first stage maps the color and brightness to an appropriate range while keeping the existing details, and the second stage utilizes a diffusion prior to generate content in dynamic range extremes lost during capture. This generative refinement module can also be used as a plug-and-play module to enhance and complement existing LDR enhancement models. The proposed method markedly improves the quality and details of LDR images, demonstrating superior performance through rigorous experimental validation. The project page is at https://sagiri0208.github.io

6/14/2024

Exposure Completing for Temporally Consistent Neural High Dynamic Range Video Rendering

Jiahao Cui, Wei Jiang, Zhan Peng, Zhiyu Pan, Zhiguo Cao

High dynamic range (HDR) video rendering from low dynamic range (LDR) videos where frames are of alternate exposure encounters significant challenges, due to the exposure change and absence at each time stamp. The exposure change and absence make existing methods generate flickering HDR results. In this paper, we propose a novel paradigm to render HDR frames via completing the absent exposure information, hence the exposure information is complete and consistent. Our approach involves interpolating neighbor LDR frames in the time dimension to reconstruct LDR frames for the absent exposures. Combining the interpolated and given LDR frames, the complete set of exposure information is available at each time stamp. This benefits the fusing process for HDR results, reducing noise and ghosting artifacts therefore improving temporal consistency. Extensive experimental evaluations on standard benchmarks demonstrate that our method achieves state-of-the-art performance, highlighting the importance of absent exposure completing in HDR video rendering. The code is available at https://github.com/cuijiahao666/NECHDR.

7/19/2024