Posterior Distillation Sampling

Read original: arXiv:2311.13831 - Published 4/3/2024 by Juil Koo, Chanho Park, Minhyuk Sung

↗️

Overview

Posterior Distillation Sampling (PDS) is a new optimization method for editing parametric images using diffusion models.
Existing optimization-based methods have focused on image generation, but editing requires balancing conformity to target attributes and preservation of source content identity.
PDS reformulates recent 2D image editing methods into an optimization that matches the stochastic latents of the source and target, enabling diverse parameter space sampling that aligns with desired attributes while maintaining source identity.

Plain English Explanation

Diffusion models are a powerful type of machine learning model that can generate all kinds of images. Researchers have used these models for things like image editing, where you take an existing image and modify it in some way.

The challenge with editing is finding a balance - you want the edited image to have the attributes you're aiming for, but you also want it to still look like the original image. Recent methods have used the random "noise" encoded in the diffusion model's internal representations to help strike this balance.

The new Posterior Distillation Sampling (PDS) method takes this 2D image editing approach and extends it to work with parametric images. These are images defined not by pixel values, but by a set of numerical parameters - for example, a 3D scene could be defined by the positions, shapes, and materials of the objects in it.

PDS matches the random noise or "latent state" of the source parametric image to the target attributes you want, allowing it to generate new parametric images that have the desired properties while still resembling the original. This lets you edit things like 3D scenes or vector graphics in a way that preserves their core identity.

Technical Explanation

The key innovation in PDS is reformulating recent 2D image editing methods into an optimization problem for parametric images. Existing optimization-based methods for diffusion models have focused on generation, but editing requires balancing conformity to target attributes and preservation of source content identity.

Recent 2D image editing approaches have achieved this balance by leveraging the stochastic latent encoded in the generative process of diffusion models. PDS extends this insight to the parametric domain, matching the stochastic latents of the source and target parametric images.

This optimization process resembles running the generative diffusion process towards the target attributes, but aligning this process with the trajectory of the source's generative process. Extensive experiments on editing Neural Radiance Fields and Scalable Vector Graphics demonstrate PDS's ability to sample diverse parametric targets that fulfill this balance.

Critical Analysis

The paper provides a thorough technical explanation of the PDS method and demonstrates its effectiveness on various parametric image editing tasks. However, a few potential limitations and areas for further research are worth noting:

The paper focuses on the optimization formulation and does not delve deeply into the specifics of the diffusion model architectures or training procedures used. More details on these aspects could help researchers build upon this work.
While the results show PDS can edit parametric images while preserving identity, the paper does not quantify or analyze this balancing act in depth. Developing more rigorous evaluation metrics for this tradeoff could strengthen the claims.
The experiments are limited to certain parametric representations like NeRF and SVG. Exploring the generalization of PDS to other parametric image formats could broaden its applicability.

Overall, the PDS method represents an interesting advance in parametric image editing using diffusion models. Further research to address these potential limitations could help solidify the contributions and expand the practical impact of this work.

Conclusion

Posterior Distillation Sampling (PDS) is a novel optimization-based method that extends the editing capabilities of diffusion models from the pixel domain to the parametric domain. By matching the stochastic latent representations of source and target parametric images, PDS can generate diverse edited results that balance conformity to desired attributes and preservation of source identity.

The technical and empirical insights demonstrated in this paper could have important implications for a variety of parametric image editing applications, from 3D scene manipulation to vector graphic design. As diffusion models continue to advance, techniques like PDS will be crucial for unleashing their full potential for creative and interactive parametric content generation and editing.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

↗️

Posterior Distillation Sampling

Juil Koo, Chanho Park, Minhyuk Sung

We introduce Posterior Distillation Sampling (PDS), a novel optimization method for parametric image editing based on diffusion models. Existing optimization-based methods, which leverage the powerful 2D prior of diffusion models to handle various parametric images, have mainly focused on generation. Unlike generation, editing requires a balance between conforming to the target attribute and preserving the identity of the source content. Recent 2D image editing methods have achieved this balance by leveraging the stochastic latent encoded in the generative process of diffusion models. To extend the editing capabilities of diffusion models shown in pixel space to parameter space, we reformulate the 2D image editing method into an optimization form named PDS. PDS matches the stochastic latents of the source and the target, enabling the sampling of targets in diverse parameter spaces that align with a desired attribute while maintaining the source's identity. We demonstrate that this optimization resembles running a generative process with the target attribute, but aligning this process with the trajectory of the source's generative process. Extensive editing results in Neural Radiance Fields and Scalable Vector Graphics representations demonstrate that PDS is capable of sampling targets to fulfill the aforementioned balance across various parameter spaces.

4/3/2024

Zero-Shot Image Compression with Diffusion-Based Posterior Sampling

Noam Elata, Tomer Michaeli, Michael Elad

Diffusion models dominate the field of image generation, however they have yet to make major breakthroughs in the field of image compression. Indeed, while pre-trained diffusion models have been successfully adapted to a wide variety of downstream tasks, existing work in diffusion-based image compression require task specific model training, which can be both cumbersome and limiting. This work addresses this gap by harnessing the image prior learned by existing pre-trained diffusion models for solving the task of lossy image compression. This enables the use of the wide variety of publicly-available models, and avoids the need for training or fine-tuning. Our method, PSC (Posterior Sampling-based Compression), utilizes zero-shot diffusion-based posterior samplers. It does so through a novel sequential process inspired by the active acquisition technique Adasense to accumulate informative measurements of the image. This strategy minimizes uncertainty in the reconstructed image and allows for construction of an image-adaptive transform coordinated between both the encoder and decoder. PSC offers a progressive compression scheme that is both practical and simple to implement. Despite minimal tuning, and a simple quantization and entropy coding, PSC achieves competitive results compared to established methods, paving the way for further exploration of pre-trained diffusion models and posterior samplers for image compression.

7/16/2024

Amortized Posterior Sampling with Diffusion Prior Distillation

Abbas Mammadov, Hyungjin Chung, Jong Chul Ye

We propose a variational inference approach to sample from the posterior distribution for solving inverse problems. From a pre-trained diffusion model, our approach trains a conditional flow model to minimize the divergence between the proposal variational distribution and the posterior distribution implicitly defined through the diffusion model. Once trained, the flow model is capable of sampling from the posterior distribution with a single NFE, amortized with respect to the measurement. The proposed method paves a new path for distilling a diffusion prior for efficient posterior sampling. We show that our method is applicable to standard signals in Euclidean space, as well as signals on manifold.

7/26/2024

👁️

Diffusion Posterior Sampling for General Noisy Inverse Problems

Hyungjin Chung, Jeongsol Kim, Michael T. Mccann, Marc L. Klasky, Jong Chul Ye

Diffusion models have been recently studied as powerful generative inverse problem solvers, owing to their high quality reconstructions and the ease of combining existing iterative solvers. However, most works focus on solving simple linear inverse problems in noiseless settings, which significantly under-represents the complexity of real-world problems. In this work, we extend diffusion solvers to efficiently handle general noisy (non)linear inverse problems via approximation of the posterior sampling. Interestingly, the resulting posterior sampling scheme is a blended version of diffusion sampling with the manifold constrained gradient without a strict measurement consistency projection step, yielding a more desirable generative path in noisy settings compared to the previous studies. Our method demonstrates that diffusion models can incorporate various measurement noise statistics such as Gaussian and Poisson, and also efficiently handle noisy nonlinear inverse problems such as Fourier phase retrieval and non-uniform deblurring. Code available at https://github.com/DPS2022/diffusion-posterior-sampling

5/21/2024