Learning Image Priors through Patch-based Diffusion Models for Solving Inverse Problems

2406.02462

Published 6/5/2024 by Jason Hu, Bowen Song, Xiaojian Xu, Liyue Shen, Jeffrey A. Fessler

Learning Image Priors through Patch-based Diffusion Models for Solving Inverse Problems

Abstract

Diffusion models can learn strong image priors from underlying data distribution and use them to solve inverse problems, but the training process is computationally expensive and requires lots of data. Such bottlenecks prevent most existing works from being feasible for high-dimensional and high-resolution data such as 3D images. This paper proposes a method to learn an efficient data prior for the entire image by training diffusion models only on patches of images. Specifically, we propose a patch-based position-aware diffusion inverse solver, called PaDIS, where we obtain the score function of the whole image through scores of patches and their positional encoding and utilize this as the prior for solving inverse problems. First of all, we show that this diffusion model achieves an improved memory efficiency and data efficiency while still maintaining the capability to generate entire images via positional encoding. Additionally, the proposed PaDIS model is highly flexible and can be plugged in with different diffusion inverse solvers (DIS). We demonstrate that the proposed PaDIS approach enables solving various inverse problems in both natural and medical image domains, including CT reconstruction, deblurring, and superresolution, given only patch-based priors. Notably, PaDIS outperforms previous DIS methods trained on entire image priors in the case of limited training data, demonstrating the data efficiency of our proposed approach by learning patch-based prior.

Create account to get full access

Overview

This paper presents a novel approach to solving inverse problems in imaging using patch-based diffusion models.
The key idea is to learn image priors through these diffusion models, which can then be used to effectively solve inverse problems like image denoising, super-resolution, and inpainting.
The method builds on recent advancements in diffusion models, such as Principled Probabilistic Imaging Using Diffusion Models, Solving Inverse Problems with Latent Diffusion Models, and ODE-DPS: ODE-based Diffusion Posterior Sampling.

Plain English Explanation

The paper proposes a new way to solve complex image problems, like making blurry images clearer or filling in missing parts of an image. The key idea is to use a special kind of machine learning model called a "diffusion model" to learn the patterns and structure of natural images.

Diffusion models work by gradually adding noise to an image, then learning how to reverse that process and generate new images that look natural. The paper shows how these diffusion models can be used as a powerful "prior" or starting point for solving inverse problems, where the goal is to reconstruct an original image from incomplete or corrupted data.

By using the diffusion model's understanding of natural image structure, the method can effectively denoise, super-resolve, and inpaint images, outperforming previous approaches. This is an exciting development, as solving inverse problems is crucial for many real-world applications like medical imaging, computational photography, and image restoration.

Technical Explanation

The paper introduces a novel framework for solving inverse problems in imaging by leveraging patch-based diffusion models to learn powerful image priors. This builds on recent advancements in diffusion models, such as Principled Probabilistic Imaging Using Diffusion Models, Solving Inverse Problems with Latent Diffusion Models, and ODE-DPS: ODE-based Diffusion Posterior Sampling.

The key components of the proposed framework are:

Patch-based Diffusion Model: The authors train a diffusion model to learn the distribution of natural image patches, capturing their structure and patterns.
Inverse Problem Solver: This module takes the corrupted or incomplete input image and uses the learned image prior from the diffusion model to reconstruct the original image, effectively solving inverse problems like denoising, super-resolution, and inpainting.
Optimization Strategy: The authors develop a custom optimization strategy that leverages the diffusion model's understanding of natural images to efficiently solve the inverse problem.

Experiments on a variety of inverse problems demonstrate the effectiveness of this approach, outperforming previous state-of-the-art methods. The authors also show how the framework can be extended to handle more complex inverse problems, such as those encountered in IntrinsicAnything: Learning Diffusion Priors for Inverse Rendering and PatchScaler: Efficient Patch-Independent Diffusion Model for Super-Resolution.

Critical Analysis

The paper presents a compelling approach to solving inverse problems in imaging by leveraging the power of patch-based diffusion models. The authors carefully design their framework and demonstrate its effectiveness on a range of inverse problems, which is a significant contribution to the field.

However, the paper does not address some potential limitations or areas for further research. For instance, the authors do not discuss the computational complexity of their method or how it might scale to higher-resolution images. Additionally, the paper does not explore the robustness of the approach to different types of image corruption or the impact of the diffusion model's architecture on the final performance.

Furthermore, while the authors show promising results, it would be valuable to see a more in-depth analysis of the failure cases and the limitations of their approach. Addressing these aspects could help researchers better understand the strengths and weaknesses of the proposed framework and guide future improvements.

Conclusion

This paper presents a novel framework for solving inverse problems in imaging by leveraging patch-based diffusion models to learn powerful image priors. The authors demonstrate the effectiveness of their approach on a variety of inverse problems, outperforming previous state-of-the-art methods.

The key contribution of this work is the integration of recent advancements in diffusion models into a comprehensive framework for solving inverse problems. This represents an exciting development in the field, as it opens up new possibilities for leveraging the rich representational capabilities of diffusion models to tackle complex imaging challenges.

While the paper leaves some avenues for further research, it provides a solid foundation for continued exploration and advancement in this area. Ultimately, the proposed framework has the potential to significantly impact various applications, from medical imaging and computational photography to image restoration and beyond.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Unleashing the Denoising Capability of Diffusion Prior for Solving Inverse Problems

Jiawei Zhang, Jiaxin Zhuang, Cheng Jin, Gen Li, Yuantao Gu

The recent emergence of diffusion models has significantly advanced the precision of learnable priors, presenting innovative avenues for addressing inverse problems. Since inverse problems inherently entail maximum a posteriori estimation, previous works have endeavored to integrate diffusion priors into the optimization frameworks. However, prevailing optimization-based inverse algorithms primarily exploit the prior information within the diffusion models while neglecting their denoising capability. To bridge this gap, this work leverages the diffusion process to reframe noisy inverse problems as a two-variable constrained optimization task by introducing an auxiliary optimization variable. By employing gradient truncation, the projection gradient descent method is efficiently utilized to solve the corresponding optimization problem. The proposed algorithm, termed ProjDiff, effectively harnesses the prior information and the denoising capability of a pre-trained diffusion model within the optimization framework. Extensive experiments on the image restoration tasks and source separation and partial generation tasks demonstrate that ProjDiff exhibits superior performance across various linear and nonlinear inverse problems, highlighting its potential for practical applications. Code is available at https://github.com/weigerzan/ProjDiff/.

6/12/2024

cs.LG cs.AI

DiffusionBlend: Learning 3D Image Prior through Position-aware Diffusion Score Blending for 3D Computed Tomography Reconstruction

Bowen Song, Jason Hu, Zhaoxu Luo, Jeffrey A. Fessler, Liyue Shen

Diffusion models face significant challenges when employed for large-scale medical image reconstruction in real practice such as 3D Computed Tomography (CT). Due to the demanding memory, time, and data requirements, it is difficult to train a diffusion model directly on the entire volume of high-dimensional data to obtain an efficient 3D diffusion prior. Existing works utilizing diffusion priors on single 2D image slice with hand-crafted cross-slice regularization would sacrifice the z-axis consistency, which results in severe artifacts along the z-axis. In this work, we propose a novel framework that enables learning the 3D image prior through position-aware 3D-patch diffusion score blending for reconstructing large-scale 3D medical images. To the best of our knowledge, we are the first to utilize a 3D-patch diffusion prior for 3D medical image reconstruction. Extensive experiments on sparse view and limited angle CT reconstruction show that our DiffusionBlend method significantly outperforms previous methods and achieves state-of-the-art performance on real-world CT reconstruction problems with high-dimensional 3D image (i.e., $256 times 256 times 500$). Our algorithm also comes with better or comparable computational efficiency than previous state-of-the-art methods.

6/17/2024

cs.CV

Principled Probabilistic Imaging using Diffusion Models as Plug-and-Play Priors

Zihui Wu, Yu Sun, Yifan Chen, Bingliang Zhang, Yisong Yue, Katherine L. Bouman

Diffusion models (DMs) have recently shown outstanding capability in modeling complex image distributions, making them expressive image priors for solving Bayesian inverse problems. However, most existing DM-based methods rely on approximations in the generative process to be generic to different inverse problems, leading to inaccurate sample distributions that deviate from the target posterior defined within the Bayesian framework. To harness the generative power of DMs while avoiding such approximations, we propose a Markov chain Monte Carlo algorithm that performs posterior sampling for general inverse problems by reducing it to sampling the posterior of a Gaussian denoising problem. Crucially, we leverage a general DM formulation as a unified interface that allows for rigorously solving the denoising problem with a range of state-of-the-art DMs. We demonstrate the effectiveness of the proposed method on six inverse problems (three linear and three nonlinear), including a real-world black hole imaging problem. Experimental results indicate that our proposed method offers more accurate reconstructions and posterior estimation compared to existing DM-based imaging inverse methods.

5/30/2024

eess.IV cs.CV stat.ML

📊

Solving Inverse Problems with Latent Diffusion Models via Hard Data Consistency

Bowen Song, Soo Min Kwon, Zecheng Zhang, Xinyu Hu, Qing Qu, Liyue Shen

Diffusion models have recently emerged as powerful generative priors for solving inverse problems. However, training diffusion models in the pixel space are both data-intensive and computationally demanding, which restricts their applicability as priors for high-dimensional real-world data such as medical images. Latent diffusion models, which operate in a much lower-dimensional space, offer a solution to these challenges. However, incorporating latent diffusion models to solve inverse problems remains a challenging problem due to the nonlinearity of the encoder and decoder. To address these issues, we propose textit{ReSample}, an algorithm that can solve general inverse problems with pre-trained latent diffusion models. Our algorithm incorporates data consistency by solving an optimization problem during the reverse sampling process, a concept that we term as hard data consistency. Upon solving this optimization problem, we propose a novel resampling scheme to map the measurement-consistent sample back onto the noisy data manifold and theoretically demonstrate its benefits. Lastly, we apply our algorithm to solve a wide range of linear and nonlinear inverse problems in both natural and medical images, demonstrating that our approach outperforms existing state-of-the-art approaches, including those based on pixel-space diffusion models.

4/17/2024

cs.CV