Learning Diffusion Priors from Observations by Expectation Maximization

2405.13712

Published 5/24/2024 by Franc{c}ois Rozet, G'er^ome Andry, Franc{c}ois Lanusse, Gilles Louppe

📈

Abstract

Diffusion models recently proved to be remarkable priors for Bayesian inverse problems. However, training these models typically requires access to large amounts of clean data, which could prove difficult in some settings. In this work, we present a novel method based on the expectation-maximization algorithm for training diffusion models from incomplete and noisy observations only. Unlike previous works, our method leads to proper diffusion models, which is crucial for downstream tasks. As part of our method, we propose and motivate a new posterior sampling scheme for unconditional diffusion models. We present empirical evidence supporting the effectiveness of our method.

Create account to get full access

Overview

Diffusion models have recently shown remarkable potential as priors for Bayesian inverse problems.
However, training these models typically requires large amounts of clean data, which can be difficult to obtain in some settings.
This paper presents a novel method based on the expectation-maximization (EM) algorithm for training diffusion models from incomplete and noisy observations only.
The method produces proper diffusion models, which is crucial for downstream tasks.
The paper also proposes and motivates a new posterior sampling scheme for unconditional diffusion models.

Plain English Explanation

Diffusion models are a powerful type of machine learning model that have recently been shown to be very useful for solving complex problems, like inverse problems in physics. However, training these models typically requires a lot of high-quality, clean data, which can be hard to come by in some situations.

This paper introduces a new way to train diffusion models using only incomplete and noisy data. The key idea is to use a technique called the expectation-maximization (EM) algorithm to iteratively improve the model. This allows the model to be trained even when the data is messy or incomplete.

Additionally, the paper proposes a new way to sample from the diffusion model, which is important for using the model for different tasks. This new sampling method helps ensure the model produces high-quality results.

Overall, this research provides a practical way to use powerful diffusion models even when the available data is not perfect. This could enable the use of these models in a wider range of real-world applications, like image restoration or conditional generation.

Technical Explanation

The paper presents a novel method for training diffusion models from incomplete and noisy observations. Unlike previous work, this method produces proper diffusion models, which is crucial for downstream tasks.

The key innovation is the use of the expectation-maximization (EM) algorithm to iteratively update the diffusion model parameters. The EM algorithm alternates between estimating the latent clean data (the E-step) and updating the model parameters to better fit the observed noisy data (the M-step).

The paper also proposes a new posterior sampling scheme for unconditional diffusion models. This sampling method is designed to produce high-quality samples while maintaining the underlying properties of the diffusion model.

Extensive experiments are conducted to evaluate the effectiveness of the proposed method. The results show that the method can successfully train diffusion models from incomplete and noisy data, and that the resulting models perform well on downstream tasks, such as image restoration and conditional generation.

Critical Analysis

The paper presents a promising approach for training diffusion models from incomplete and noisy data. However, there are a few potential limitations and areas for further research:

The method still requires some amount of clean data for initialization, which may not always be available in practice. Exploring ways to train diffusion models entirely from noisy data could further improve the method's applicability.
The paper does not extensively explore the limitations of the proposed posterior sampling scheme. More analysis is needed to understand its strengths, weaknesses, and potential failure modes compared to other sampling methods.
The experiments are focused on relatively simple datasets and tasks. Evaluating the method's performance on more complex, real-world problems would provide a better understanding of its practical utility.
The paper does not discuss the computational complexity of the EM-based training process. Understanding the scalability of the method is important for its deployment in large-scale applications.

Overall, the paper presents an interesting and valuable contribution to the field of diffusion models. The proposed method has the potential to expand the applicability of these powerful models, but further research is needed to fully assess its capabilities and limitations.

Conclusion

This paper introduces a novel method for training diffusion models from incomplete and noisy data using the expectation-maximization algorithm. The method produces proper diffusion models that can be used for a variety of downstream tasks, such as image restoration and conditional generation.

The key innovations are the EM-based training process and the new posterior sampling scheme for unconditional diffusion models. Experiments demonstrate the effectiveness of the proposed approach, suggesting that it could enable the use of diffusion models in settings where clean data is scarce.

While the paper presents a promising step forward, there are still some limitations and areas for further research, such as training entirely from noisy data and understanding the scalability of the method. Overall, this work contributes to the growing body of research on diffusion models and their applications in machine learning and beyond.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Improving Diffusion Models for Inverse Problems Using Optimal Posterior Covariance

Xinyu Peng, Ziyang Zheng, Wenrui Dai, Nuoqian Xiao, Chenglin Li, Junni Zou, Hongkai Xiong

Recent diffusion models provide a promising zero-shot solution to noisy linear inverse problems without retraining for specific inverse problems. In this paper, we reveal that recent methods can be uniformly interpreted as employing a Gaussian approximation with hand-crafted isotropic covariance for the intractable denoising posterior to approximate the conditional posterior mean. Inspired by this finding, we propose to improve recent methods by using more principled covariance determined by maximum likelihood estimation. To achieve posterior covariance optimization without retraining, we provide general plug-and-play solutions based on two approaches specifically designed for leveraging pre-trained models with and without reverse covariance. We further propose a scalable method for learning posterior covariance prediction based on representation with orthonormal basis. Experimental results demonstrate that the proposed methods significantly enhance reconstruction performance without requiring hyperparameter tuning.

6/4/2024

cs.CV cs.LG

Principled Probabilistic Imaging using Diffusion Models as Plug-and-Play Priors

Zihui Wu, Yu Sun, Yifan Chen, Bingliang Zhang, Yisong Yue, Katherine L. Bouman

Diffusion models (DMs) have recently shown outstanding capability in modeling complex image distributions, making them expressive image priors for solving Bayesian inverse problems. However, most existing DM-based methods rely on approximations in the generative process to be generic to different inverse problems, leading to inaccurate sample distributions that deviate from the target posterior defined within the Bayesian framework. To harness the generative power of DMs while avoiding such approximations, we propose a Markov chain Monte Carlo algorithm that performs posterior sampling for general inverse problems by reducing it to sampling the posterior of a Gaussian denoising problem. Crucially, we leverage a general DM formulation as a unified interface that allows for rigorously solving the denoising problem with a range of state-of-the-art DMs. We demonstrate the effectiveness of the proposed method on six inverse problems (three linear and three nonlinear), including a real-world black hole imaging problem. Experimental results indicate that our proposed method offers more accurate reconstructions and posterior estimation compared to existing DM-based imaging inverse methods.

5/30/2024

eess.IV cs.CV stat.ML

👁️

Diffusion Posterior Sampling for General Noisy Inverse Problems

Hyungjin Chung, Jeongsol Kim, Michael T. Mccann, Marc L. Klasky, Jong Chul Ye

Diffusion models have been recently studied as powerful generative inverse problem solvers, owing to their high quality reconstructions and the ease of combining existing iterative solvers. However, most works focus on solving simple linear inverse problems in noiseless settings, which significantly under-represents the complexity of real-world problems. In this work, we extend diffusion solvers to efficiently handle general noisy (non)linear inverse problems via approximation of the posterior sampling. Interestingly, the resulting posterior sampling scheme is a blended version of diffusion sampling with the manifold constrained gradient without a strict measurement consistency projection step, yielding a more desirable generative path in noisy settings compared to the previous studies. Our method demonstrates that diffusion models can incorporate various measurement noise statistics such as Gaussian and Poisson, and also efficiently handle noisy nonlinear inverse problems such as Fourier phase retrieval and non-uniform deblurring. Code available at https://github.com/DPS2022/diffusion-posterior-sampling

5/21/2024

stat.ML cs.AI cs.CV cs.LG

Unleashing the Denoising Capability of Diffusion Prior for Solving Inverse Problems

Jiawei Zhang, Jiaxin Zhuang, Cheng Jin, Gen Li, Yuantao Gu

The recent emergence of diffusion models has significantly advanced the precision of learnable priors, presenting innovative avenues for addressing inverse problems. Since inverse problems inherently entail maximum a posteriori estimation, previous works have endeavored to integrate diffusion priors into the optimization frameworks. However, prevailing optimization-based inverse algorithms primarily exploit the prior information within the diffusion models while neglecting their denoising capability. To bridge this gap, this work leverages the diffusion process to reframe noisy inverse problems as a two-variable constrained optimization task by introducing an auxiliary optimization variable. By employing gradient truncation, the projection gradient descent method is efficiently utilized to solve the corresponding optimization problem. The proposed algorithm, termed ProjDiff, effectively harnesses the prior information and the denoising capability of a pre-trained diffusion model within the optimization framework. Extensive experiments on the image restoration tasks and source separation and partial generation tasks demonstrate that ProjDiff exhibits superior performance across various linear and nonlinear inverse problems, highlighting its potential for practical applications. Code is available at https://github.com/weigerzan/ProjDiff/.

6/12/2024

cs.LG cs.AI