2S-UDF: A Novel Two-stage UDF Learning Method for Robust Non-watertight Model Reconstruction from Multi-view Images

Read original: arXiv:2303.15368 - Published 4/17/2024 by Junkai Deng, Fei Hou, Xuhui Chen, Wencheng Wang, Ying He

📈

Overview

This paper addresses a central challenge in learning unsigned distance fields (UDF) to reconstruct 3D non-watertight models from multi-view images.
The key problem is formulating a proper way to convert unsigned distance values into volume density, ensuring that the resulting weight function remains unbiased and sensitive to occlusions.
The authors present a novel two-stage algorithm, 2S-UDF, to learn a high-quality UDF from multi-view images.

Plain English Explanation

The paper focuses on a technique called unsigned distance fields (UDF) to reconstruct 3D objects from multiple camera views. UDFs are a way to represent the 3D shape of an object without the need for a watertight (fully enclosed) surface.

One challenge with UDFs is converting the distance values into a density function that can be used for volume rendering. This density function needs to be unbiased, meaning it accurately represents the object's shape, and sensitive to occlusions, so parts of the object that are hidden from view are properly accounted for.

The authors propose a two-stage approach to address this challenge. In the first stage, they use a simple density function that is slightly biased and transparent, but helps with a coarse reconstruction of the object. In the second stage, they refine the geometry and appearance of the object by directly adjusting the weight function used in volume rendering to ensure it is unbiased and occlusion-aware.

By separating the density and weight functions into two stages, the authors' method, called 2S-UDF, is able to achieve stable and robust training, which distinguishes it from existing UDF learning approaches.

The authors evaluate their method on several datasets, including DeepFashion3D, DTU, and BlendedMVS. The results show that 2S-UDF outperforms other UDF learning techniques in both quantitative metrics and visual quality when reconstructing 3D non-watertight models from multi-view images.

Technical Explanation

The paper presents a novel two-stage algorithm, 2S-UDF, for learning a high-quality unsigned distance field (UDF) from multi-view images. The key challenge addressed is formulating a proper way to convert unsigned distance values into volume density, ensuring that the resulting weight function remains unbiased and sensitive to occlusions.

In the first stage, the method applies an easily trainable density function that, while slightly biased and transparent, aids in coarse reconstruction. The subsequent stage then refines the geometry and appearance of the object to achieve a high-quality reconstruction by directly adjusting the weight function used in volume rendering to ensure that it is unbiased and occlusion-aware.

Decoupling density and weight in two stages makes the training of 2S-UDF stable and robust, distinguishing it from existing UDF learning approaches. The authors evaluate their method on the DeepFashion3D, DTU, and BlendedMVS datasets, and the results show superior performance over other UDF learning techniques in reconstructing 3D non-watertight models from multi-view images, both in quantitative metrics and visual quality.

Critical Analysis

The paper presents a well-designed and thoroughly evaluated approach to addressing the challenge of learning high-quality unsigned distance fields (UDFs) for 3D reconstruction from multi-view images. The authors' two-stage strategy of first using a simple density function and then refining the weight function is a clever solution to the problem of formulating an unbiased and occlusion-aware density representation.

One potential limitation of the method is that it may be computationally more expensive than single-stage approaches, as it requires training two separate models. Additionally, the authors do not provide much insight into the specific trade-offs between the first and second stages of their algorithm, or how the hyperparameters governing the transition between the two stages should be tuned.

Further research could explore ways to streamline the two-stage process, perhaps by developing a more integrated approach that can learn the density and weight functions simultaneously. It would also be interesting to see how 2S-UDF performs on a wider range of 3D reconstruction tasks and datasets, as the current evaluation is limited to a few specific benchmarks.

Overall, the 2S-UDF method represents a significant contribution to the field of 3D reconstruction from multi-view images, and the authors' insights into the importance of formulating an appropriate density-to-volume mapping are valuable for researchers working in this area.

Conclusion

This paper presents a novel two-stage algorithm, 2S-UDF, for learning high-quality unsigned distance fields (UDFs) from multi-view images. The key innovation is the decoupling of density and weight functions, which allows the method to achieve stable and robust training, resulting in superior performance compared to other UDF learning techniques.

The authors' insights into the importance of formulating an unbiased and occlusion-aware density representation are a valuable contribution to the field of 3D reconstruction from multi-view images. While the two-stage approach may introduce some computational overhead, the significant improvements in both quantitative and visual quality metrics suggest that 2S-UDF is a promising direction for further research and development in this area.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

📈

2S-UDF: A Novel Two-stage UDF Learning Method for Robust Non-watertight Model Reconstruction from Multi-view Images

Junkai Deng, Fei Hou, Xuhui Chen, Wencheng Wang, Ying He

Recently, building on the foundation of neural radiance field, various techniques have emerged to learn unsigned distance fields (UDF) to reconstruct 3D non-watertight models from multi-view images. Yet, a central challenge in UDF-based volume rendering is formulating a proper way to convert unsigned distance values into volume density, ensuring that the resulting weight function remains unbiased and sensitive to occlusions. Falling short on these requirements often results in incorrect topology or large reconstruction errors in resulting models. This paper addresses this challenge by presenting a novel two-stage algorithm, 2S-UDF, for learning a high-quality UDF from multi-view images. Initially, the method applies an easily trainable density function that, while slightly biased and transparent, aids in coarse reconstruction. The subsequent stage then refines the geometry and appearance of the object to achieve a high-quality reconstruction by directly adjusting the weight function used in volume rendering to ensure that it is unbiased and occlusion-aware. Decoupling density and weight in two stages makes our training stable and robust, distinguishing our technique from existing UDF learning approaches. Evaluations on the DeepFashion3D, DTU, and BlendedMVS datasets validate the robustness and effectiveness of our proposed approach. In both quantitative metrics and visual quality, the results indicate our superior performance over other UDF learning techniques in reconstructing 3D non-watertight models from multi-view images. Our code is available at https://bitbucket.org/jkdeng/2sudf/.

4/17/2024

Details Enhancement in Unsigned Distance Field Learning for High-fidelity 3D Surface Reconstruction

Cheng Xu, Fei Hou, Wencheng Wang, Hong Qin, Zhebin Zhang, Ying He

While Signed Distance Fields (SDF) are well-established for modeling watertight surfaces, Unsigned Distance Fields (UDF) broaden the scope to include open surfaces and models with complex inner structures. Despite their flexibility, UDFs encounter significant challenges in high-fidelity 3D reconstruction, such as non-differentiability at the zero level set, difficulty in achieving the exact zero value, numerous local minima, vanishing gradients, and oscillating gradient directions near the zero level set. To address these challenges, we propose Details Enhanced UDF (DEUDF) learning that integrates normal alignment and the SIREN network for capturing fine geometric details, adaptively weighted Eikonal constraints to address vanishing gradients near the target surface, unconditioned MLP-based UDF representation to relax non-negativity constraints, and a UDF-tailored method for extracting iso-surface with non-constant iso-values. These strategies collectively stabilize the learning process from unoriented point clouds and enhance the accuracy of UDFs. Our computational results demonstrate that DEUDF outperforms existing UDF learning methods in both accuracy and the quality of reconstructed surfaces. We will make the source code publicly available.

6/4/2024

Learning Unsigned Distance Fields from Local Shape Functions for 3D Surface Reconstruction

Jiangbei Hu, Yanggeng Li, Fei Hou, Junhui Hou, Zhebin Zhang, Shengfa Wang, Na Lei, Ying He

Unsigned distance fields (UDFs) provide a versatile framework for representing a diverse array of 3D shapes, encompassing both watertight and non-watertight geometries. Traditional UDF learning methods typically require extensive training on large datasets of 3D shapes, which is costly and often necessitates hyperparameter adjustments for new datasets. This paper presents a novel neural framework, LoSF-UDF, for reconstructing surfaces from 3D point clouds by leveraging local shape functions to learn UDFs. We observe that 3D shapes manifest simple patterns within localized areas, prompting us to create a training dataset of point cloud patches characterized by mathematical functions that represent a continuum from smooth surfaces to sharp edges and corners. Our approach learns features within a specific radius around each query point and utilizes an attention mechanism to focus on the crucial features for UDF estimation. This method enables efficient and robust surface reconstruction from point clouds without the need for shape-specific training. Additionally, our method exhibits enhanced resilience to noise and outliers in point clouds compared to existing methods. We present comprehensive experiments and comparisons across various datasets, including synthetic and real-scanned point clouds, to validate our method's efficacy.

7/2/2024

Learning Unsigned Distance Functions from Multi-view Images with Volume Rendering Priors

Wenyuan Zhang, Kanle Shi, Yu-Shen Liu, Zhizhong Han

Unsigned distance functions (UDFs) have been a vital representation for open surfaces. With different differentiable renderers, current methods are able to train neural networks to infer a UDF by minimizing the rendering errors on the UDF to the multi-view ground truth. However, these differentiable renderers are mainly handcrafted, which makes them either biased on ray-surface intersections, or sensitive to unsigned distance outliers, or not scalable to large scale scenes. To resolve these issues, we present a novel differentiable renderer to infer UDFs more accurately. Instead of using handcrafted equations, our differentiable renderer is a neural network which is pre-trained in a data-driven manner. It learns how to render unsigned distances into depth images, leading to a prior knowledge, dubbed volume rendering priors. To infer a UDF for an unseen scene from multiple RGB images, we generalize the learned volume rendering priors to map inferred unsigned distances in alpha blending for RGB image rendering. Our results show that the learned volume rendering priors are unbiased, robust, scalable, 3D aware, and more importantly, easy to learn. We evaluate our method on both widely used benchmarks and real scenes, and report superior performance over the state-of-the-art methods.

7/24/2024