DIFR3CT: Latent Diffusion for Probabilistic 3D CT Reconstruction from Few Planar X-Rays

Read original: arXiv:2408.15118 - Published 8/28/2024 by Yiran Sun, Hana Baroudi, Tucker Netherton, Laurence Court, Osama Mawlawi, Ashok Veeraraghavan, Guha Balakrishnan

DIFR3CT: Latent Diffusion for Probabilistic 3D CT Reconstruction from Few Planar X-Rays

Overview

This paper introduces a method called DIFR3CT for reconstructing 3D computed tomography (CT) scans from a small number of 2D X-ray images.
The method uses a deep generative model based on diffusion models to generate a probabilistic 3D reconstruction from the sparse X-ray inputs.
Experiments show that DIFR3CT can produce high-quality 3D CT reconstructions from just a few X-ray views, with potential applications in radiation therapy planning and other medical imaging tasks.

Plain English Explanation

[object Object]: Computed tomography (CT) scans provide detailed 3D images of the body, but typically require many X-ray images taken from different angles. DIFR3CT aims to reconstruct 3D CT scans from just a few 2D X-ray images, which could reduce radiation exposure for patients.

[object Object]: The key idea is to use a deep learning model that can "imagine" the full 3D CT scan based on the sparse X-ray inputs. This type of model, called a "diffusion model," is trained on a large dataset of CT scans to learn the patterns and structures in 3D medical images.

[object Object]: Diffusion models work by starting with random noise and gradually transforming it into realistic-looking data, like 3D CT scans, through a step-by-step "diffusion" process. The model learns to reverse this diffusion process to generate new 3D scans from the limited X-ray views.

[object Object]: By reconstructing high-quality 3D CT scans from just a few X-rays, DIFR3CT could benefit medical applications like radiation therapy planning, where detailed 3D information is crucial for accurately targeting tumors while avoiding healthy tissue.

Technical Explanation

[object Object] is a deep learning method for 3D CT reconstruction from sparse X-ray inputs. It uses a diffusion model architecture that learns to map the 2D X-ray views to a probabilistic 3D latent representation, which is then used to generate the final 3D CT reconstruction.

The key components are:

Encoder: Encodes the input X-ray views into a latent 3D feature representation
Diffusion Model: Learns to reverse the diffusion process to generate realistic 3D CT scans from the latent representation
Decoder: Reconstructs the final 3D CT volume from the generated latent representation

The model is trained end-to-end on a large dataset of CT scans and corresponding X-ray views. During inference, only a few X-ray images are provided as input, and the model outputs a probabilistic 3D reconstruction that captures the uncertainty in the sparse-view setting.

Experiments show that DIFR3CT can outperform prior methods for sparse-view CT reconstruction, generating high-quality 3D scans from as few as 3-5 X-ray views. This has promising applications in reducing radiation exposure for patients while maintaining the benefits of 3D imaging for medical tasks like radiotherapy planning.

Critical Analysis

The paper provides a thorough evaluation of DIFR3CT, including comparisons to state-of-the-art sparse-view CT reconstruction methods. The results demonstrate the effectiveness of the diffusion model approach, which can generate higher-quality 3D reconstructions than previous techniques.

However, the paper does not discuss certain limitations or potential issues with the method. For example, the performance of DIFR3CT may degrade for very challenging view configurations or anatomical regions, and the computational cost of the diffusion model could limit its real-time clinical applicability.

Additionally, the paper does not explore the model's robustness to noise, artifacts, or other realistic challenges that may arise in clinical X-ray imaging. Further research would be needed to assess the practical viability of DIFR3CT for real-world radiotherapy planning and other medical applications.

Overall, the paper presents a promising deep learning approach for sparse-view 3D CT reconstruction, but additional work is needed to fully understand the method's strengths, weaknesses, and potential for clinical translation.

Conclusion

[object Object] is a novel deep learning framework that can reconstruct high-quality 3D CT scans from just a few 2D X-ray views. By leveraging powerful diffusion models, the method can generate probabilistic 3D reconstructions that capture uncertainty in the sparse-view setting.

The results demonstrate the potential of DIFR3CT to reduce radiation exposure for patients while still providing the benefits of 3D medical imaging, with applications in radiotherapy planning and other areas of diagnostic radiology. Further research is needed to fully evaluate the method's robustness and clinical viability, but this work represents an important step forward in sparse-view CT reconstruction using deep generative models.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

DIFR3CT: Latent Diffusion for Probabilistic 3D CT Reconstruction from Few Planar X-Rays

Yiran Sun, Hana Baroudi, Tucker Netherton, Laurence Court, Osama Mawlawi, Ashok Veeraraghavan, Guha Balakrishnan

Computed Tomography (CT) scans are the standard-of-care for the visualization and diagnosis of many clinical ailments, and are needed for the treatment planning of external beam radiotherapy. Unfortunately, the availability of CT scanners in low- and mid-resource settings is highly variable. Planar x-ray radiography units, in comparison, are far more prevalent, but can only provide limited 2D observations of the 3D anatomy. In this work we propose DIFR3CT, a 3D latent diffusion model, that can generate a distribution of plausible CT volumes from one or few (<10) planar x-ray observations. DIFR3CT works by fusing 2D features from each x-ray into a joint 3D space, and performing diffusion conditioned on these fused features in a low-dimensional latent space. We conduct extensive experiments demonstrating that DIFR3CT is better than recent sparse CT reconstruction baselines in terms of standard pixel-level (PSNR, SSIM) on both the public LIDC and in-house post-mastectomy CT datasets. We also show that DIFR3CT supports uncertainty quantification via Monte Carlo sampling, which provides an opportunity to measure reconstruction reliability. Finally, we perform a preliminary pilot study evaluating DIFR3CT for automated breast radiotherapy contouring and planning -- and demonstrate promising feasibility. Our code is available at https://github.com/yransun/DIFR3CT.

8/28/2024

DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays

Xuhui Liu, Zhi Qiao, Runkun Liu, Hong Li, Juan Zhang, Xiantong Zhen, Zhen Qian, Baochang Zhang

Computed tomography (CT) is widely utilized in clinical settings because it delivers detailed 3D images of the human body. However, performing CT scans is not always feasible due to radiation exposure and limitations in certain surgical environments. As an alternative, reconstructing CT images from ultra-sparse X-rays offers a valuable solution and has gained significant interest in scientific research and medical applications. However, it presents great challenges as it is inherently an ill-posed problem, often compromised by artifacts resulting from overlapping structures in X-ray images. In this paper, we propose DiffuX2CT, which models CT reconstruction from orthogonal biplanar X-rays as a conditional diffusion process. DiffuX2CT is established with a 3D global coherence denoising model with a new, implicit conditioning mechanism. We realize the conditioning mechanism by a newly designed tri-plane decoupling generator and an implicit neural decoder. By doing so, DiffuX2CT achieves structure-controllable reconstruction, which enables 3D structural information to be recovered from 2D X-rays, therefore producing faithful textures in CT images. As an extra contribution, we collect a real-world lumbar CT dataset, called LumbarV, as a new benchmark to verify the clinical significance and performance of CT reconstruction from X-rays. Extensive experiments on this dataset and three more publicly available datasets demonstrate the effectiveness of our proposal.

7/19/2024

New!DX2CT: Diffusion Model for 3D CT Reconstruction from Bi or Mono-planar 2D X-ray(s)

Yun Su Jeong, Hye Bin Yoo, Il Yong Chun

Computational tomography (CT) provides high-resolution medical imaging, but it can expose patients to high radiation. X-ray scanners have low radiation exposure, but their resolutions are low. This paper proposes a new conditional diffusion model, DX2CT, that reconstructs three-dimensional (3D) CT volumes from bi or mono-planar X-ray image(s). Proposed DX2CT consists of two key components: 1) modulating feature maps extracted from two-dimensional (2D) X-ray(s) with 3D positions of CT volume using a new transformer and 2) effectively using the modulated 3D position-aware feature maps as conditions of DX2CT. In particular, the proposed transformer can provide conditions with rich information of a target CT slice to the conditional diffusion model, enabling high-quality CT reconstruction. Our experiments with the bi or mono-planar X-ray(s) benchmark datasets show that proposed DX2CT outperforms several state-of-the-art methods. Our codes and model will be available at: https://www.github.com/intyeger/DX2CT.

9/16/2024

Diff2CT: Diffusion Learning to Reconstruct Spine CT from Biplanar X-Rays

Zhi Qiao, Xuhui Liu, Xiaopeng Wang, Runkun Liu, Xiantong Zhen, Pei Dong, Zhen Qian

Intraoperative CT imaging serves as a crucial resource for surgical guidance; however, it may not always be readily accessible or practical to implement. In scenarios where CT imaging is not an option, reconstructing CT scans from X-rays can offer a viable alternative. In this paper, we introduce an innovative method for 3D CT reconstruction utilizing biplanar X-rays. Distinct from previous research that relies on conventional image generation techniques, our approach leverages a conditional diffusion process to tackle the task of reconstruction. More precisely, we employ a diffusion-based probabilistic model trained to produce 3D CT images based on orthogonal biplanar X-rays. To improve the structural integrity of the reconstructed images, we incorporate a novel projection loss function. Experimental results validate that our proposed method surpasses existing state-of-the-art benchmarks in both visual image quality and multiple evaluative metrics. Specifically, our technique achieves a higher Structural Similarity Index (SSIM) of 0.83, a relative increase of 10%, and a lower Fr'echet Inception Distance (FID) of 83.43, which represents a relative decrease of 25%.

8/22/2024