Personalized Video Relighting With an At-Home Light Stage

Read original: arXiv:2311.08843 - Published 9/30/2024 by Jun Myeong Choi, Max Christman, Roni Sengupta

👨‍🏫

Overview

Researchers develop a personalized video relighting algorithm that produces high-quality, temporally consistent relit videos in real-time.
Existing relighting algorithms rely on synthetic data or light stage data, which have limitations.
The proposed approach uses recordings of a user watching YouTube videos on a monitor to train a personalized relighting algorithm.
The key contribution is a novel image-based neural relighting architecture that separates intrinsic appearance features from source lighting, then combines them with target lighting to generate relit images.
This enables temporally stable video relighting, outperforming state-of-the-art approaches.

Plain English Explanation

The researchers have created a new way to change the lighting in videos of a person's face, without needing special equipment. Normally, this kind of "relighting" is done using synthetic data or data from expensive light stage setups, which doesn't work very well.

Instead, the researchers found they could train their relighting algorithm just by having the person watch some YouTube videos on a regular computer monitor. The algorithm learns about the person's face and how it looks under different lighting conditions.

The key to this is a new neural network architecture that can separate the important features of the person's face - things like their facial structure and skin reflectance - from the lighting in the original video. It then combines this information with the new lighting that the user wants, to generate a new video that looks like the person's face is perfectly relit.

This allows for smooth, consistent relighting of the person's face in the video, which is better than what current methods can do. The researchers show their approach works well on both casually captured videos and more controlled light stage data.

Technical Explanation

The researchers developed a personalized video relighting algorithm that can generate high-quality, temporally consistent relit videos in real-time. Existing relighting approaches typically rely on either publicly available synthetic data, which yields poor results, or on actual light stage data, which is difficult to acquire.

The key contribution is a novel image-based neural relighting architecture that effectively separates the intrinsic appearance features - the geometry and reflectance of the face - from the source lighting, and then combines them with the target lighting to generate a relit image. This enables temporally stable video relighting by smoothing the intrinsic appearance features.

Both qualitative and quantitative evaluations show that this architecture improves portrait image relighting quality and temporal consistency over state-of-the-art approaches, on both casually captured "Light Stage at Your Desk" (LSYD) and light-stage-captured "One Light At a Time" (OLAT) datasets.

Critical Analysis

The paper addresses an important challenge in video editing and visual effects - the ability to realistically change the lighting in a video of a person's face. The researchers' approach of using casually captured data to train a personalized relighting model is novel and promising, as it avoids the limitations of synthetic or specialized light stage data.

However, the paper does not discuss potential limitations or caveats of the proposed method. For example, it's unclear how well the approach would generalize to a wide range of subjects, lighting conditions, or video capture settings beyond the specific datasets used. Additionally, the computational requirements and real-time performance of the relighting algorithm are not thoroughly examined.

Further research could explore the robustness and versatility of the relighting architecture, as well as potential extensions to handle more complex scenes or lighting environments. Comparisons to alternative approaches, such as learning-based or optimization-based relighting methods, could also provide valuable insights.

Conclusion

The researchers have presented a novel personalized video relighting algorithm that can generate high-quality, temporally consistent relit videos in real-time using only casually captured data. This approach represents an important step forward in making sophisticated visual effects more accessible and practical for a wide range of applications, from video production to virtual reality and beyond.

While the paper does not address all potential limitations, the core technical contributions and promising results demonstrate the value of this research and its potential to inspire further advancements in the field of computational photography and image-based rendering.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

👨‍🏫

Personalized Video Relighting With an At-Home Light Stage

Jun Myeong Choi, Max Christman, Roni Sengupta

In this paper, we develop a personalized video relighting algorithm that produces high-quality and temporally consistent relit videos under any pose, expression, and lighting condition in real-time. Existing relighting algorithms typically rely either on publicly available synthetic data, which yields poor relighting results, or on actual light stage data which is difficult to acquire. We show that by just capturing recordings of a user watching YouTube videos on a monitor we can train a personalized algorithm capable of performing high-quality relighting under any condition. Our key contribution is a novel image-based neural relighting architecture that effectively separates the intrinsic appearance features - the geometry and reflectance of the face - from the source lighting and then combines them with the target lighting to generate a relit image. This neural architecture enables smoothing of intrinsic appearance features leading to temporally stable video relighting. Both qualitative and quantitative evaluations show that our architecture improves portrait image relighting quality and temporal consistency over state-of-the-art approaches on both casually captured `Light Stage at Your Desk' (LSYD) and light-stage-captured `One Light At a Time' (OLAT) datasets.

9/30/2024

Lite2Relight: 3D-aware Single Image Portrait Relighting

Pramod Rao, Gereon Fox, Abhimitra Meka, Mallikarjun B R, Fangneng Zhan, Tim Weyrich, Bernd Bickel, Hanspeter Pfister, Wojciech Matusik, Mohamed Elgharib, Christian Theobalt

Achieving photorealistic 3D view synthesis and relighting of human portraits is pivotal for advancing AR/VR applications. Existing methodologies in portrait relighting demonstrate substantial limitations in terms of generalization and 3D consistency, coupled with inaccuracies in physically realistic lighting and identity preservation. Furthermore, personalization from a single view is difficult to achieve and often requires multiview images during the testing phase or involves slow optimization processes. This paper introduces Lite2Relight, a novel technique that can predict 3D consistent head poses of portraits while performing physically plausible light editing at interactive speed. Our method uniquely extends the generative capabilities and efficient volumetric representation of EG3D, leveraging a lightstage dataset to implicitly disentangle face reflectance and perform relighting under target HDRI environment maps. By utilizing a pre-trained geometry-aware encoder and a feature alignment module, we map input images into a relightable 3D space, enhancing them with a strong face geometry and reflectance prior. Through extensive quantitative and qualitative evaluations, we show that our method outperforms the state-of-the-art methods in terms of efficacy, photorealism, and practical application. This includes producing 3D-consistent results of the full head, including hair, eyes, and expressions. Lite2Relight paves the way for large-scale adoption of photorealistic portrait editing in various domains, offering a robust, interactive solution to a previously constrained problem. Project page: https://vcai.mpi-inf.mpg.de/projects/Lite2Relight/

7/16/2024

Relightable Neural Actor with Intrinsic Decomposition and Pose Control

Diogo Luvizon, Vladislav Golyanik, Adam Kortylewski, Marc Habermann, Christian Theobalt

Creating a controllable and relightable digital avatar from multi-view video with fixed illumination is a very challenging problem since humans are highly articulated, creating pose-dependent appearance effects, and skin as well as clothing require space-varying BRDF modeling. Existing works on creating animatible avatars either do not focus on relighting at all, require controlled illumination setups, or try to recover a relightable avatar from very low cost setups, i.e. a single RGB video, at the cost of severely limited result quality, e.g. shadows not even being modeled. To address this, we propose Relightable Neural Actor, a new video-based method for learning a pose-driven neural human model that can be relighted, allows appearance editing, and models pose-dependent effects such as wrinkles and self-shadows. Importantly, for training, our method solely requires a multi-view recording of the human under a known, but static lighting condition. To tackle this challenging problem, we leverage an implicit geometry representation of the actor with a drivable density field that models pose-dependent deformations and derive a dynamic mapping between 3D and UV spaces, where normal, visibility, and materials are effectively encoded. To evaluate our approach in real-world scenarios, we collect a new dataset with four identities recorded under different light conditions, indoors and outdoors, providing the first benchmark of its kind for human relighting, and demonstrating state-of-the-art relighting results for novel human poses.

7/29/2024

🤿

Relighting from a Single Image: Datasets and Deep Intrinsic-based Architecture

Yixiong Yang, Hassan Ahmed Sial, Ramon Baldrich, Maria Vanrell

Single image scene relighting aims to generate a realistic new version of an input image so that it appears to be illuminated by a new target light condition. Although existing works have explored this problem from various perspectives, generating relit images under arbitrary light conditions remains highly challenging, and related datasets are scarce. Our work addresses this problem from both the dataset and methodological perspectives. We propose two new datasets: a synthetic dataset with the ground truth of intrinsic components and a real dataset collected under laboratory conditions. These datasets alleviate the scarcity of existing datasets. To incorporate physical consistency in the relighting pipeline, we establish a two-stage network based on intrinsic decomposition, giving outputs at intermediate steps, thereby introducing physical constraints. When the training set lacks ground truth for intrinsic decomposition, we introduce an unsupervised module to ensure that the intrinsic outputs are satisfactory. Our method outperforms the state-of-the-art methods in performance, as tested on both existing datasets and our newly developed datasets. Furthermore, pretraining our method or other prior methods using our synthetic dataset can enhance their performance on other datasets. Since our method can accommodate any light conditions, it is capable of producing animated results. The dataset, method, and videos are publicly available.

9/30/2024