Iterative CT Reconstruction via Latent Variable Optimization of Shallow Diffusion Models

Read original: arXiv:2408.03156 - Published 9/14/2024 by Sho Ozaki, Shizuo Kaji, Toshikazu Imae, Kanabu Nawa, Hideomi Yamashita, Keiichi Nakagawa

Iterative CT Reconstruction via Latent Variable Optimization of Shallow Diffusion Models

Overview

Presents an iterative approach to CT reconstruction using shallow diffusion models and latent variable optimization
Aims to improve upon existing CT reconstruction methods by leveraging the capabilities of diffusion models
Experiments demonstrate the effectiveness of the proposed method on various CT datasets

Plain English Explanation

The paper describes a new method for improving the quality of computed tomography (CT) scans. CT scans are medical imaging tests that use X-rays to create detailed images of the body. However, the process of reconstructing these images from the X-ray data can be challenging and result in lower quality images.

The researchers in this paper propose using a type of machine learning model called a "diffusion model" to help improve the CT reconstruction process. Diffusion models are a powerful tool for generating high-quality images, and the researchers show how they can be used in an iterative way to gradually refine and improve the CT reconstruction.

Specifically, the method involves optimizing the latent (hidden) variables in the diffusion model to produce a CT image that best matches the X-ray data. By iterating this process, the method is able to generate CT images with higher quality and fewer artifacts compared to traditional reconstruction techniques.

The researchers demonstrate the effectiveness of their approach on several different CT datasets, showing improvements in image quality and other important metrics. This work represents an important advancement in the field of CT imaging and could lead to better diagnoses and treatment for patients.

Technical Explanation

The paper presents an "Iterative CT Reconstruction via Latent Variable Optimization of Shallow Diffusion Models" approach to improve the quality of CT image reconstruction. The key components of the method are:

Shallow Diffusion Model: The researchers use a diffusion model with a relatively shallow architecture, which they find to be more effective for CT reconstruction compared to deeper models.
Latent Variable Optimization: The method optimizes the latent variables of the diffusion model to produce a CT image that best matches the observed X-ray data. This is done iteratively, with the latent variables updated at each step to gradually refine the reconstruction.
Iterative Refinement: By iterating the latent variable optimization, the method is able to progressively improve the quality of the reconstructed CT image, reducing artifacts and noise compared to traditional reconstruction techniques.

The researchers evaluate their approach on several CT datasets, including both simulated and real-world data. They demonstrate that the iterative latent variable optimization of the shallow diffusion model outperforms other state-of-the-art CT reconstruction methods in terms of image quality metrics such as peak signal-to-noise ratio (PSNR) and structural similarity index (SSIM).

Critical Analysis

The paper presents a novel and promising approach to CT image reconstruction, leveraging the capabilities of diffusion models to improve upon existing methods. However, there are a few potential caveats and limitations to consider:

Computational Complexity: The iterative nature of the proposed method may increase the computational cost compared to some traditional reconstruction algorithms, which could be a barrier to real-time clinical deployment.
Generalization Ability: The paper primarily evaluates the method on a limited set of CT datasets, and further research is needed to assess its generalization to a wider range of imaging modalities and acquisition settings.
Interpretability: As with many deep learning-based methods, the internal workings of the diffusion model and the optimization process may be difficult to interpret, which could be a concern for medical applications where explainability is important.
Artifacts and Biases: While the method shows improvements in reducing certain types of artifacts, it is possible that it may introduce new types of artifacts or biases that are not yet fully characterized.

Overall, the research presented in this paper represents an exciting advancement in the field of CT image reconstruction and merits further investigation and validation, particularly in the context of real-world clinical applications and potential limitations.

Conclusion

This paper introduces an innovative approach to CT image reconstruction that leverages the power of shallow diffusion models and iterative latent variable optimization. The method demonstrates significant improvements in image quality compared to traditional reconstruction techniques, suggesting that it could have important implications for medical imaging and diagnosis.

However, the researchers also acknowledge several potential challenges and areas for further exploration, such as the computational complexity of the iterative process and the need to better understand the model's internal workings and potential biases.

Despite these caveats, the research presented in this paper represents an important step forward in the field of CT imaging and highlights the potential of diffusion models to enhance medical imaging technologies. As the field of machine learning continues to advance, it will be exciting to see how this and other innovative approaches can be further developed and applied to improve patient care and outcomes.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Iterative CT Reconstruction via Latent Variable Optimization of Shallow Diffusion Models

Sho Ozaki, Shizuo Kaji, Toshikazu Imae, Kanabu Nawa, Hideomi Yamashita, Keiichi Nakagawa

Image-generative artificial intelligence (AI) has garnered significant attention in recent years. In particular, the diffusion model, a core component of generative AI, produces high-quality images with rich diversity. In this study, we proposed a novel computed tomography (CT) reconstruction method by combining the denoising diffusion probabilistic model with iterative CT reconstruction. In sharp contrast to previous studies, we optimized the fidelity loss of CT reconstruction with respect to the latent variable of the diffusion model, instead of the image and model parameters. To suppress the changes in anatomical structures produced by the diffusion model, we shallowed the diffusion and reverse processes and fixed a set of added noises in the reverse process to make it deterministic during the inference. We demonstrated the effectiveness of the proposed method through the sparse-projection CT reconstruction of 1/10 projection data. Despite the simplicity of the implementation, the proposed method has the potential to reconstruct high-quality images while preserving the patient's anatomical structures and was found to outperform existing methods, including iterative reconstruction, iterative reconstruction with total variation, and the diffusion model alone in terms of quantitative indices such as the structural similarity index and peak signal-to-noise ratio. We also explored further sparse-projection CT reconstruction using 1/20 projection data with the same trained diffusion model. As the number of iterations increased, the image quality improved comparable to that of 1/10 sparse-projection CT reconstruction. In principle, this method can be widely applied not only to CT but also to other imaging modalities.

9/14/2024

CT-SDM: A Sampling Diffusion Model for Sparse-View CT Reconstruction across All Sampling Rates

Liutao Yang, Jiahao Huang, Guang Yang, Daoqiang Zhang

Sparse views X-ray computed tomography has emerged as a contemporary technique to mitigate radiation dose. Because of the reduced number of projection views, traditional reconstruction methods can lead to severe artifacts. Recently, research studies utilizing deep learning methods has made promising progress in removing artifacts for Sparse-View Computed Tomography (SVCT). However, given the limitations on the generalization capability of deep learning models, current methods usually train models on fixed sampling rates, affecting the usability and flexibility of model deployment in real clinical settings. To address this issue, our study proposes a adaptive reconstruction method to achieve high-performance SVCT reconstruction at any sampling rate. Specifically, we design a novel imaging degradation operator in the proposed sampling diffusion model for SVCT (CT-SDM) to simulate the projection process in the sinogram domain. Thus, the CT-SDM can gradually add projection views to highly undersampled measurements to generalize the full-view sinograms. By choosing an appropriate starting point in diffusion inference, the proposed model can recover the full-view sinograms from any sampling rate with only one trained model. Experiments on several datasets have verified the effectiveness and robustness of our approach, demonstrating its superiority in reconstructing high-quality images from sparse-view CT scans across various sampling rates.

9/4/2024

CT Reconstruction using Diffusion Posterior Sampling conditioned on a Nonlinear Measurement Model

Shudong Li, Xiao Jiang, Matthew Tivnan, Grace J. Gang, Yuan Shen, J. Webster Stayman

Diffusion models have been demonstrated as powerful deep learning tools for image generation in CT reconstruction and restoration. Recently, diffusion posterior sampling, where a score-based diffusion prior is combined with a likelihood model, has been used to produce high quality CT images given low-quality measurements. This technique is attractive since it permits a one-time, unsupervised training of a CT prior; which can then be incorporated with an arbitrary data model. However, current methods rely on a linear model of x-ray CT physics to reconstruct or restore images. While it is common to linearize the transmission tomography reconstruction problem, this is an approximation to the true and inherently nonlinear forward model. We propose a new method that solves the inverse problem of nonlinear CT image reconstruction via diffusion posterior sampling. We implement a traditional unconditional diffusion model by training a prior score function estimator, and apply Bayes rule to combine this prior with a measurement likelihood score function derived from the nonlinear physical model to arrive at a posterior score function that can be used to sample the reverse-time diffusion process. This plug-and-play method allows incorporation of a diffusion-based prior with generalized nonlinear CT image reconstruction into multiple CT system designs with different forward models, without the need for any additional training. We develop the algorithm that performs this reconstruction, including an ordered-subsets variant for accelerated processing and demonstrate the technique in both fully sampled low dose data and sparse-view geometries using a single unsupervised training of the prior.

6/12/2024

📈

MSDiff: Multi-Scale Diffusion Model for Ultra-Sparse View CT Reconstruction

Pinhuang Tan, Mengxiao Geng, Jingya Lu, Liu Shi, Bin Huang, Qiegen Liu

Computed Tomography (CT) technology reduces radiation haz-ards to the human body through sparse sampling, but fewer sampling angles pose challenges for image reconstruction. Score-based generative models are widely used in sparse-view CT re-construction, performance diminishes significantly with a sharp reduction in projection angles. Therefore, we propose an ultra-sparse view CT reconstruction method utilizing multi-scale dif-fusion models (MSDiff), designed to concentrate on the global distribution of information and facilitate the reconstruction of sparse views with local image characteristics. Specifically, the proposed model ingeniously integrates information from both comprehensive sampling and selectively sparse sampling tech-niques. Through precise adjustments in diffusion model, it is capable of extracting diverse noise distribution, furthering the understanding of the overall structure of images, and aiding the fully sampled model in recovering image information more effec-tively. By leveraging the inherent correlations within the projec-tion data, we have designed an equidistant mask, enabling the model to focus its attention more effectively. Experimental re-sults demonstrated that the multi-scale model approach signifi-cantly improved the quality of image reconstruction under ultra-sparse angles, with good generalization across various datasets.

5/10/2024