HDRSplat: Gaussian Splatting for High Dynamic Range 3D Scene Reconstruction from Raw Images

Read original: arXiv:2407.16503 - Published 7/24/2024 by Shreyas Singh, Aryan Garg, Kaushik Mitra

HDRSplat: Gaussian Splatting for High Dynamic Range 3D Scene Reconstruction from Raw Images

Overview

HDRSplat is a novel method for high dynamic range (HDR) 3D scene reconstruction from raw images.
The technique uses Gaussian splatting, which efficiently represents 3D scenes in a compact way.
The resulting reconstructions capture both the geometry and appearance of the scene, including HDR color information.

Plain English Explanation

HDRSplat: Efficient High Dynamic Range 3D Scene Reconstruction from Raw Images is a new approach for creating detailed 3D models of real-world scenes. The key innovation is the use of "Gaussian splatting," which represents the 3D geometry and color information in a compact yet expressive way.

Typically, 3D reconstruction from photos struggles to capture the full range of brightness in a scene - from the darkest shadows to the brightest highlights. HDRSplat solves this by directly incorporating high dynamic range (HDR) color data into the 3D model. This allows the reconstructed scenes to faithfully represent the actual appearance, including subtle details that would be lost in a traditional low dynamic range representation.

The Gaussian splatting technique efficiently encodes the 3D geometry using a sparse set of overlapping "splats" or discs, each with a smoothly varying Gaussian profile. This compact representation captures the underlying surface while requiring far fewer data points than a dense point cloud. The HDR color information is also stored efficiently within this Gaussian splat structure.

Overall, HDRSplat enables high-fidelity 3D reconstructions that preserve the full visual richness of real-world scenes. This has applications in areas like virtual/augmented reality, visual effects, and digital preservation of cultural heritage.

Technical Explanation

HDRSplat: Gaussian Splatting for High Dynamic Range 3D Scene Reconstruction from Raw Images introduces a novel method for 3D scene reconstruction that captures both the geometry and high dynamic range (HDR) appearance of the environment.

The core innovation is the use of Gaussian splatting to compactly represent the 3D scene. Rather than a dense point cloud, the scene is encoded as a sparse set of overlapping Gaussian discs, each with a position, normal, and HDR color. This Gaussian splat representation efficiently captures the underlying surface while requiring far fewer data points.

The reconstruction pipeline first uses multi-view stereo to estimate the scene's 3D geometry from a set of calibrated input images. It then optimizes the Gaussian splat parameters to best fit this geometry while also aligning with the HDR color information extracted from the raw image data.

The resulting HDRSplat representation preserves both the detailed 3D structure and the full range of brightness and color present in the original scene. Experiments demonstrate that HDRSplat achieves state-of-the-art performance on standard benchmarks for HDR reconstruction, outperforming prior methods that rely on traditional low dynamic range data representations.

Critical Analysis

The key strength of HDRSplat is its ability to faithfully capture the full visual richness of real-world scenes, including the high dynamic range of brightness and color. This is a significant advancement over prior 3D reconstruction techniques that struggle to represent the extremes of illumination.

However, the paper acknowledges some limitations of the current approach. For example, the Gaussian splat representation may struggle to accurately model sharp geometric features or thin structures. There is also a tradeoff between the compactness of the representation and the fidelity of the reconstruction.

Additionally, the HDRSplat pipeline relies on accurate camera calibration and multi-view stereo estimation, which can be sensitive to noise or errors in the input data. Further research may be needed to improve the robustness of the reconstruction process.

Overall, HDRSplat represents an important step forward in high-quality 3D scene modeling. By directly incorporating HDR color information, it opens up new possibilities for applications that require photo-realistic virtual environments. Continued refinement of the technique could lead to even more impressive results in the future.

Conclusion

HDRSplat introduces a novel approach for 3D scene reconstruction that captures both the detailed geometry and the full range of brightness and color present in real-world environments. By using an efficient Gaussian splatting representation, the method can compactly encode HDR appearance information alongside the 3D structure.

This advance in 3D modeling has significant implications for applications such as virtual/augmented reality, visual effects, and digital preservation. By preserving the visual richness of the original scenes, HDRSplat enables the creation of highly realistic virtual environments that can enhance immersive experiences and facilitate applications like cultural heritage documentation.

While the current technique has some limitations, the core idea of directly incorporating HDR data into a compact 3D representation is a promising direction for future research. Continued improvements in areas like robustness and fine geometric detail could further expand the capabilities of HDRSplat and drive new breakthroughs in high-fidelity 3D scene reconstruction.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

HDRSplat: Gaussian Splatting for High Dynamic Range 3D Scene Reconstruction from Raw Images

Shreyas Singh, Aryan Garg, Kaushik Mitra

The recent advent of 3D Gaussian Splatting (3DGS) has revolutionized the 3D scene reconstruction space enabling high-fidelity novel view synthesis in real-time. However, with the exception of RawNeRF, all prior 3DGS and NeRF-based methods rely on 8-bit tone-mapped Low Dynamic Range (LDR) images for scene reconstruction. Such methods struggle to achieve accurate reconstructions in scenes that require a higher dynamic range. Examples include scenes captured in nighttime or poorly lit indoor spaces having a low signal-to-noise ratio, as well as daylight scenes with shadow regions exhibiting extreme contrast. Our proposed method HDRSplat tailors 3DGS to train directly on 14-bit linear raw images in near darkness which preserves the scenes' full dynamic range and content. Our key contributions are two-fold: Firstly, we propose a linear HDR space-suited loss that effectively extracts scene information from noisy dark regions and nearly saturated bright regions simultaneously, while also handling view-dependent colors without increasing the degree of spherical harmonics. Secondly, through careful rasterization tuning, we implicitly overcome the heavy reliance and sensitivity of 3DGS on point cloud initialization. This is critical for accurate reconstruction in regions of low texture, high depth of field, and low illumination. HDRSplat is the fastest method to date that does 14-bit (HDR) 3D scene reconstruction in $le$15 minutes/scene ($sim$30x faster than prior state-of-the-art RawNeRF). It also boasts the fastest inference speed at $ge$120fps. We further demonstrate the applicability of our HDR scene reconstruction by showcasing various applications like synthetic defocus, dense depth map extraction, and post-capture control of exposure, tone-mapping and view-point.

7/24/2024

HDRGS: High Dynamic Range Gaussian Splatting

Jiahao Wu, Lu Xiao, Chao Wang, Rui Peng, Kaiqiang Xiong, Ronggang Wang

Recent years have witnessed substantial advancements in the field of 3D reconstruction from 2D images, particularly following the introduction of the neural radiance field (NeRF) technique. However, reconstructing a 3D high dynamic range (HDR) radiance field, which aligns more closely with real-world conditions, from 2D multi-exposure low dynamic range (LDR) images continues to pose significant challenges. Approaches to this issue fall into two categories: grid-based and implicit-based. Implicit methods, using multi-layer perceptrons (MLP), face inefficiencies, limited solvability, and overfitting risks. Conversely, grid-based methods require significant memory and struggle with image quality and long training times. In this paper, we introduce Gaussian Splatting-a recent, high-quality, real-time 3D reconstruction technique-into this domain. We further develop the High Dynamic Range Gaussian Splatting (HDR-GS) method, designed to address the aforementioned challenges. This method enhances color dimensionality by including luminance and uses an asymmetric grid for tone-mapping, swiftly and precisely converting pixel irradiance to color. Our approach improves HDR scene recovery accuracy and integrates a novel coarse-to-fine strategy to speed up model convergence, enhancing robustness against sparse viewpoints and exposure extremes, and preventing local optima. Extensive testing confirms that our method surpasses current state-of-the-art techniques in both synthetic and real-world scenarios. Code will be released at url{https://github.com/WuJH2001/HDRGS}

8/14/2024

HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting

Yuanhao Cai, Zihao Xiao, Yixun Liang, Minghan Qin, Yulun Zhang, Xiaokang Yang, Yaoyao Liu, Alan Yuille

High dynamic range (HDR) novel view synthesis (NVS) aims to create photorealistic images from novel viewpoints using HDR imaging techniques. The rendered HDR images capture a wider range of brightness levels containing more details of the scene than normal low dynamic range (LDR) images. Existing HDR NVS methods are mainly based on NeRF. They suffer from long training time and slow inference speed. In this paper, we propose a new framework, High Dynamic Range Gaussian Splatting (HDR-GS), which can efficiently render novel HDR views and reconstruct LDR images with a user input exposure time. Specifically, we design a Dual Dynamic Range (DDR) Gaussian point cloud model that uses spherical harmonics to fit HDR color and employs an MLP-based tone-mapper to render LDR color. The HDR and LDR colors are then fed into two Parallel Differentiable Rasterization (PDR) processes to reconstruct HDR and LDR views. To establish the data foundation for the research of 3D Gaussian splatting-based methods in HDR NVS, we recalibrate the camera parameters and compute the initial positions for Gaussian point clouds. Experiments demonstrate that our HDR-GS surpasses the state-of-the-art NeRF-based method by 3.84 and 1.91 dB on LDR and HDR NVS while enjoying 1000x inference speed and only requiring 6.3% training time. Code, models, and recalibrated data will be publicly available at https://github.com/caiyuanhao1998/HDR-GS

5/28/2024

From Chaos to Clarity: 3DGS in the Dark

Zhihao Li, Yufei Wang, Alex Kot, Bihan Wen

Novel view synthesis from raw images provides superior high dynamic range (HDR) information compared to reconstructions from low dynamic range RGB images. However, the inherent noise in unprocessed raw images compromises the accuracy of 3D scene representation. Our study reveals that 3D Gaussian Splatting (3DGS) is particularly susceptible to this noise, leading to numerous elongated Gaussian shapes that overfit the noise, thereby significantly degrading reconstruction quality and reducing inference speed, especially in scenarios with limited views. To address these issues, we introduce a novel self-supervised learning framework designed to reconstruct HDR 3DGS from a limited number of noisy raw images. This framework enhances 3DGS by integrating a noise extractor and employing a noise-robust reconstruction loss that leverages a noise distribution prior. Experimental results show that our method outperforms LDR/HDR 3DGS and previous state-of-the-art (SOTA) self-supervised and supervised pre-trained models in both reconstruction quality and inference speed on the RawNeRF dataset across a broad range of training views. Code can be found in url{https://lizhihao6.github.io/Raw3DGS}.

6/13/2024