Deep Polarization Cues for Single-shot Shape and Subsurface Scattering Estimation

Read original: arXiv:2407.08149 - Published 7/12/2024 by Chenhao Li, Trung Thanh Ngo, Hajime Nagahara

Deep Polarization Cues for Single-shot Shape and Subsurface Scattering Estimation

Overview

This paper presents a novel deep learning approach for estimating the shape and subsurface scattering properties of objects from a single polarization image.
The method leverages the rich information contained in polarization cues to recover both the surface normal and subsurface scattering parameters in a single, end-to-end framework.
The proposed model outperforms state-of-the-art techniques for shape and subsurface scattering estimation on benchmark datasets.

Plain English Explanation

When we look at objects in the real world, the way light interacts with their surfaces can provide valuable information about their three-dimensional shape and the materials they are made of. One way to capture this information is through polarization imaging, which measures how the orientation of light waves changes as they reflect off an object.

The researchers in this paper have developed a new deep learning model that can use the polarization cues in a single image to estimate both the shape (or surface normals) of an object and its subsurface scattering properties. Subsurface scattering refers to the way light penetrates and scatters within semi-transparent materials, which gives them a distinctive appearance.

By combining these two valuable pieces of information - shape and material properties - in a single, end-to-end framework, the model can provide a more complete understanding of the object being observed. This could be useful in a variety of applications, such as 3D imaging of complex specular surfaces, robust depth enhancement via polarization, or high-resolution surface reconstruction of cultural heritage objects.

Technical Explanation

The key innovation in this paper is the development of a deep neural network architecture that can jointly estimate the surface normals and subsurface scattering parameters of an object from a single polarization image. This is in contrast to previous approaches that have typically tackled these two problems separately.

The proposed model, called the Deep Polarization Network (DPNet), takes a polarization image as input and outputs the corresponding surface normals and subsurface scattering coefficients. The network is composed of an encoder-decoder backbone with additional branches to predict the final outputs.

The encoder part of the network learns to extract rich features from the polarization image, while the decoders specialize in predicting the surface normals and scattering parameters, respectively. The authors show that by sharing the encoding layers, the model can effectively leverage the complementary information between these two tasks, leading to improved performance compared to standalone models.

The researchers evaluate their approach on benchmark datasets for both shape estimation and subsurface scattering estimation, demonstrating state-of-the-art results. For example, they achieve significant improvements over previous methods on the Surface Normal Reconstruction using Polarization UNet dataset.

Critical Analysis

One of the key strengths of this work is its ability to jointly estimate both shape and material properties from a single polarization image. This is a challenging task that can provide valuable information for a wide range of applications, from computer vision to computer graphics.

However, the paper does not discuss the limitations of the proposed approach in depth. For example, it is unclear how the model would perform on highly complex or irregular shapes, or on materials with more complex scattering properties. Additionally, the training and evaluation of the model are conducted on relatively controlled, synthetic datasets, and it would be important to assess its performance on real-world, noisy data as well.

Further research could also explore the potential of leveraging polarization cues for other inverse rendering tasks, such as learning large-scale scene reconstruction or robust depth enhancement. By continuing to push the boundaries of what can be learned from polarization information, the field could make significant advances in our understanding and modeling of the physical world.

Conclusion

This paper presents a novel deep learning approach for estimating both the shape and subsurface scattering properties of objects from a single polarization image. By jointly tackling these two inverse rendering tasks in a single, end-to-end framework, the proposed model can provide a more complete understanding of the observed object.

The key technical contribution is the development of the Deep Polarization Network (DPNet), which leverages the rich information contained in polarization cues to recover both surface normals and scattering parameters. The model outperforms state-of-the-art methods on benchmark datasets, highlighting the potential of this approach for a variety of applications in computer vision and computer graphics.

While the paper demonstrates the effectiveness of the proposed method, further research is needed to explore its limitations and potential extensions, such as handling more complex shapes and materials or scaling to larger scenes. By continuing to push the boundaries of what can be learned from polarization information, the field can make significant advances in our understanding and modeling of the physical world.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Deep Polarization Cues for Single-shot Shape and Subsurface Scattering Estimation

Chenhao Li, Trung Thanh Ngo, Hajime Nagahara

In this work, we propose a novel learning-based method to jointly estimate the shape and subsurface scattering (SSS) parameters of translucent objects by utilizing polarization cues. Although polarization cues have been used in various applications, such as shape from polarization (SfP), BRDF estimation, and reflection removal, their application in SSS estimation has not yet been explored. Our observations indicate that the SSS affects not only the light intensity but also the polarization signal. Hence, the polarization signal can provide additional cues for SSS estimation. We also introduce the first large-scale synthetic dataset of polarized translucent objects for training our model. Our method outperforms several baselines from the SfP and inverse rendering realms on both synthetic and real data, as demonstrated by qualitative and quantitative results.

7/12/2024

SS-SfP:Neural Inverse Rendering for Self Supervised Shape from (Mixed) Polarization

Ashish Tiwari, Shanmuganathan Raman

We present a novel inverse rendering-based framework to estimate the 3D shape (per-pixel surface normals and depth) of objects and scenes from single-view polarization images, the problem popularly known as Shape from Polarization (SfP). The existing physics-based and learning-based methods for SfP perform under certain restrictions, i.e., (a) purely diffuse or purely specular reflections, which are seldom in the real surfaces, (b) availability of the ground truth surface normals for direct supervision that are hard to acquire and are limited by the scanner's resolution, and (c) known refractive index. To overcome these restrictions, we start by learning to separate the partially-polarized diffuse and specular reflection components, which we call reflectance cues, based on a modified polarization reflection model and then estimate shape under mixed polarization through an inverse-rendering based self-supervised deep learning framework called SS-SfP, guided by the polarization data and estimated reflectance cues. Furthermore, we also obtain the refractive index as a non-linear least squares solution. Through extensive quantitative and qualitative evaluation, we establish the efficacy of the proposed framework over simple single-object scenes from DeepSfP dataset and complex in-the-wild scenes from SPW dataset in an entirely self-supervised setting. To the best of our knowledge, this is the first learning-based approach to address SfP under mixed polarization in a completely self-supervised framework.

7/15/2024

🖼️

Surface Normal Reconstruction Using Polarization-Unet

F. S. Mortazavi, S. Dajkhosh, M. Saadatseresht

Today, three-dimensional reconstruction of objects has many applications in various fields, and therefore, choosing a suitable method for high resolution three-dimensional reconstruction is an important issue and displaying high-level details in three-dimensional models is a serious challenge in this field. Until now, active methods have been used for high-resolution three-dimensional reconstruction. But the problem of active three-dimensional reconstruction methods is that they require a light source close to the object. Shape from polarization (SfP) is one of the best solutions for high-resolution three-dimensional reconstruction of objects, which is a passive method and does not have the drawbacks of active methods. The changes in polarization of the reflected light from an object can be analyzed by using a polarization camera or locating polarizing filter in front of the digital camera and rotating the filter. Using this information, the surface normal can be reconstructed with high accuracy, which will lead to local reconstruction of the surface details. In this paper, an end-to-end deep learning approach has been presented to produce the surface normal of objects. In this method a benchmark dataset has been used to train the neural network and evaluate the results. The results have been evaluated quantitatively and qualitatively by other methods and under different lighting conditions. The MAE value (Mean-Angular-Error) has been used for results evaluation. The evaluations showed that the proposed method could accurately reconstruct the surface normal of objects with the lowest MAE value which is equal to 18.06 degree on the whole dataset, in comparison to previous physics-based methods which are between 41.44 and 49.03 degree.

6/24/2024

✨

Subsurface Scattering for 3D Gaussian Splatting

Jan-Niklas Dihlmann, Arjun Majumdar, Andreas Engelhardt, Raphael Braun, Hendrik P. A. Lensch

3D reconstruction and relighting of objects made from scattering materials present a significant challenge due to the complex light transport beneath the surface. 3D Gaussian Splatting introduced high-quality novel view synthesis at real-time speeds. While 3D Gaussians efficiently approximate an object's surface, they fail to capture the volumetric properties of subsurface scattering. We propose a framework for optimizing an object's shape together with the radiance transfer field given multi-view OLAT (one light at a time) data. Our method decomposes the scene into an explicit surface represented as 3D Gaussians, with a spatially varying BRDF, and an implicit volumetric representation of the scattering component. A learned incident light field accounts for shadowing. We optimize all parameters jointly via ray-traced differentiable rendering. Our approach enables material editing, relighting and novel view synthesis at interactive rates. We show successful application on synthetic data and introduce a newly acquired multi-view multi-light dataset of objects in a light-stage setup. Compared to previous work we achieve comparable or better results at a fraction of optimization and rendering time while enabling detailed control over material attributes. Project page https://sss.jdihlmann.com/

8/23/2024