Zero Grads: Learning Local Surrogate Losses for Non-Differentiable Graphics

Read original: arXiv:2308.05739 - Published 5/8/2024 by Michael Fischer, Tobias Ritschel

🎯

Overview

Gradient-based optimization is widely used in graphics, but it cannot be applied to problems with undefined or zero gradients.
The paper proposes a framework called ZeroGrads that automates the process of learning a neural approximation (surrogate) of the objective function, allowing differentiation through arbitrary black-box graphics pipelines.
The surrogate is trained on an actively smoothed version of the objective, focusing its capacity on what matters at the current training episode.
The fitting is performed online, alongside the parameter optimization, and is self-supervised, without pre-computed data or pre-trained models.
The approach scales well to higher dimensions, demonstrated on problems with up to 35,000 interlinked variables.

Plain English Explanation

Gradient-based optimization is a powerful technique used in many graphics applications, but it has a limitation: it cannot be used when the objective function has undefined or zero gradients. To work around this issue, the researchers developed a framework called ZeroGrads that automatically learns a neural network to approximate the objective function. This neural network, or "surrogate," can then be used to calculate gradients, even for complex, black-box graphics pipelines.

The key idea is to train the surrogate on a smoothed version of the original objective function, focusing its capacity on the most relevant parts of the function for the current optimization task. This training happens online, alongside the main parameter optimization, without requiring any pre-computed data or pre-trained models. Since evaluating the original objective function can be computationally expensive (e.g., running a full simulation or rendering), the researchers also devised an efficient sampling scheme to make the optimization process tractable.

The ZeroGrads framework allows researchers and developers to optimize diverse, non-convex, and non-differentiable problems in graphics, such as visibility in rendering, discrete parameter spaces in procedural modeling, or optimal control in physics-driven animation. Compared to other derivative-free optimization algorithms, this approach scales well to higher-dimensional problems, handling up to 35,000 interlinked variables.

Technical Explanation

The ZeroGrads paper proposes a framework for optimizing non-convex, non-differentiable black-box problems in graphics by learning a neural surrogate of the objective function. This allows the use of gradient-based optimization techniques, even when the original objective has undefined or zero gradients.

The surrogate is trained on an actively smoothed version of the objective, encouraging locality and focusing the surrogate's capacity on what matters at the current training episode. The fitting is performed online, alongside the parameter optimization, and is self-supervised, without pre-computed data or pre-trained models. This approach is similar to the adaptive gradient-enhanced Gaussian process surrogates and universal online learning of gradient variations in multi-layer models.

To make the optimization process tractable, the researchers devise an efficient sampling scheme that allows for competitive performance at little overhead. This is important because evaluating the original objective function can be computationally expensive, requiring a full rendering or simulator run.

The ZeroGrads framework is demonstrated on diverse non-convex, non-differentiable black-box problems in graphics, such as visibility in rendering, discrete parameter spaces in procedural modeling, or optimal control in physics-driven animation. In contrast to other derivative-free algorithms, the approach scales well to higher dimensions, handling up to 35,000 interlinked variables.

Critical Analysis

The ZeroGrads paper presents a novel and promising approach to optimizing complex, non-differentiable problems in graphics. The use of a learned surrogate function to enable gradient-based optimization is a clever solution to a longstanding challenge in the field.

One potential limitation of the approach is the reliance on the surrogate's accuracy during the optimization process. If the surrogate fails to capture the true objective function well, the optimization may converge to a suboptimal solution. The paper acknowledges this and suggests further research into techniques like mean curvature flow arising in adversarial training to improve the surrogate's robustness.

Additionally, the paper does not explore the theoretical underpinnings of surrogate gradient learning in spiking neural networks, which could provide valuable insights into the stability and convergence properties of the ZeroGrads framework.

Overall, the ZeroGrads approach represents a significant advancement in gradient-based optimization for graphics and opens up new possibilities for tackling complex, non-differentiable problems. Continued research to address the identified limitations and further expand the capabilities of the framework will be valuable contributions to the field.

Conclusion

The ZeroGrads paper presents a novel framework for optimizing non-convex, non-differentiable black-box problems in graphics by learning a neural surrogate of the objective function. This allows the use of gradient-based optimization techniques, even when the original objective has undefined or zero gradients.

The key innovation is the online, self-supervised training of the surrogate, which focuses its capacity on the most relevant parts of the objective function for the current optimization task. The efficient sampling scheme and scalability to high-dimensional problems make the ZeroGrads framework a powerful tool for tackling diverse challenges in graphics, from rendering visibility to procedural modeling and physics-driven animation.

As the field continues to push the boundaries of graphics optimization, the ZeroGrads approach provides a valuable contribution, opening up new possibilities for researchers and developers to solve complex, non-differentiable problems. Further advancements in the theoretical foundations and robustness of the surrogate model will likely lead to even more impressive applications of this technology.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🎯

Zero Grads: Learning Local Surrogate Losses for Non-Differentiable Graphics

Michael Fischer, Tobias Ritschel

Gradient-based optimization is now ubiquitous across graphics, but unfortunately can not be applied to problems with undefined or zero gradients. To circumvent this issue, the loss function can be manually replaced by a ``surrogate'' that has similar minima but is differentiable. Our proposed framework, ZeroGrads, automates this process by learning a neural approximation of the objective function, which in turn can be used to differentiate through arbitrary black-box graphics pipelines. We train the surrogate on an actively smoothed version of the objective and encourage locality, focusing the surrogate's capacity on what matters at the current training episode. The fitting is performed online, alongside the parameter optimization, and self-supervised, without pre-computed data or pre-trained models. As sampling the objective is expensive (it requires a full rendering or simulator run), we devise an efficient sampling scheme that allows for tractable run-times and competitive performance at little overhead. We demonstrate optimizing diverse non-convex, non-differentiable black-box problems in graphics, such as visibility in rendering, discrete parameter spaces in procedural modelling or optimal control in physics-driven animation. In contrast to other derivative-free algorithms, our approach scales well to higher dimensions, which we demonstrate on problems with up to 35k interlinked variables.

5/8/2024

🎯

Decision-Focused Learning with Directional Gradients

Michael Huang, Vishal Gupta

We propose a novel family of decision-aware surrogate losses, called Perturbation Gradient (PG) losses, for the predict-then-optimize framework. The key idea is to connect the expected downstream decision loss with the directional derivative of a particular plug-in objective, and then approximate this derivative using zeroth order gradient techniques. Unlike the original decision loss which is typically piecewise constant and discontinuous, our new PG losses can be optimized using off-the-shelf gradient-based methods. Most importantly, unlike existing surrogate losses, the approximation error of our PG losses vanishes as the number of samples grows. Hence, optimizing our surrogate loss yields a best-in-class policy asymptotically, even in misspecified settings. This is the first such result in misspecified settings, and we provide numerical evidence confirming our PG losses substantively outperform existing proposals when the underlying model is misspecified.

7/25/2024

Learning Instance-Specific Parameters of Black-Box Models Using Differentiable Surrogates

Arnisha Khondaker, Nilanjan Ray

Tuning parameters of a non-differentiable or black-box compute is challenging. Existing methods rely mostly on random sampling or grid sampling from the parameter space. Further, with all the current methods, it is not possible to supply any input specific parameters to the black-box. To the best of our knowledge, for the first time, we are able to learn input-specific parameters for a black box in this work. As a test application we choose a popular image denoising method BM3D as our black-box compute. Then, we use a differentiable surrogate model (a neural network) to approximate the black-box behaviour. Next, another neural network is used in an end-to-end fashion to learn input instance-specific parameters for the black-box. Drawing inspiration from the work of Tseng et al. [1] , we applied our method to the Smartphone Image Denoising Dataset (SIDD) for image denoising. The results are compelling, demonstrating a significant increase in PSNR and a notable improvement in SSIM nearing 0.93. Experimental results underscore the effectiveness of our approach in achieving substantial improvements in both model performance and optimization efficiency. For code and implementation details, please refer to our GitHub repository. [1] Ethan Tseng, Felix Yu, Yuting Yang, Fahim Mannan, Karl St. Arnaud, Derek Nowrouzezahrai, Jean-Francois Lalonde, and Felix Heide. Hyperparameter optimization in black-box image processing using differentiable proxies. ACM Transactions on Graphics (TOG), 38(4), 7 2019.

7/26/2024

Adaptive Gradient Enhanced Gaussian Process Surrogates for Inverse Problems

Phillip Semler, Martin Weiser

Generating simulated training data needed for constructing sufficiently accurate surrogate models to be used for efficient optimization or parameter identification can incur a huge computational effort in the offline phase. We consider a fully adaptive greedy approach to the computational design of experiments problem using gradient-enhanced Gaussian process regression as surrogates. Designs are incrementally defined by solving an optimization problem for accuracy given a certain computational budget. We address not only the choice of evaluation points but also of required simulation accuracy, both of values and gradients of the forward model. Numerical results show a significant reduction of the computational effort compared to just position-adaptive and static designs as well as a clear benefit of including gradient information into the surrogate training.

4/3/2024