DX2CT: Diffusion Model for 3D CT Reconstruction from Bi or Mono-planar 2D X-ray(s)

Read original: arXiv:2409.08850 - Published 9/16/2024 by Yun Su Jeong, Hye Bin Yoo, Il Yong Chun

DX2CT: Diffusion Model for 3D CT Reconstruction from Bi or Mono-planar 2D X-ray(s)

Overview

Presents a diffusion model for 3D CT reconstruction from 2D X-ray images
Enables 3D CT reconstruction from single or multiple 2D X-ray images
Leverages diffusion models to generate high-quality 3D CT volumes

Plain English Explanation

The paper introduces a new method called DX2CT that uses diffusion models to reconstruct 3D computed tomography (CT) images from 2D X-ray images. Diffusion models are a type of machine learning algorithm that can generate new data by gradually transforming random noise into realistic-looking outputs.

The key idea behind DX2CT is to train a diffusion model to learn the relationship between 2D X-ray images and 3D CT volumes. Once trained, the model can take a new 2D X-ray image as input and generate a corresponding 3D CT reconstruction. This allows 3D CT scans to be obtained from much simpler and more widely available 2D X-ray imaging, which has applications in medical imaging, industrial inspection, and other domains.

The paper demonstrates that DX2CT can produce high-quality 3D reconstructions from either a single 2D X-ray image or multiple 2D images taken from different angles. This flexibility is important, as it means the technique can be applied in situations where only limited X-ray data is available.

Technical Explanation

The DX2CT model is based on a Latent Diffusion architecture, which is a type of diffusion model. Diffusion models learn to generate new data by progressively adding noise to the input and then learning to reverse this noising process.

In the case of DX2CT, the input is a 2D X-ray image (either a single view or multiple views) and the target output is a 3D CT volume. The diffusion model is trained to learn the mapping between the 2D X-ray input and the corresponding 3D CT data, allowing it to generate high-quality 3D reconstructions from new 2D X-ray inputs.

The paper evaluates DX2CT on several medical imaging benchmarks, including CTCOR and spine CT reconstruction tasks. The results demonstrate that DX2CT outperforms previous state-of-the-art methods for 3D CT reconstruction from 2D X-rays, both in terms of reconstruction quality and computational efficiency.

Critical Analysis

The paper provides a thorough evaluation of the DX2CT model and its capabilities, including comparisons to existing techniques. However, the authors acknowledge several limitations and areas for future work:

The model's performance may degrade on more complex or diverse anatomical structures beyond the medical imaging benchmarks evaluated.
The training process requires paired 2D X-ray and 3D CT data, which can be challenging to obtain in practice.
The computational complexity of the diffusion model may limit its deployment in real-time applications.

Additionally, while the paper focuses on the technical aspects of the DX2CT model, it would be valuable to further explore the potential clinical and societal implications of this technology. For example, how might it impact patient care, radiation exposure, and healthcare costs?

Conclusion

The DX2CT paper presents an innovative approach to 3D CT reconstruction from 2D X-ray images using diffusion models. This work has the potential to significantly improve the accessibility and cost-effectiveness of 3D medical imaging, with applications in various fields beyond healthcare. As the authors note, further research is needed to address the current limitations and explore the broader impact of this technology.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

DX2CT: Diffusion Model for 3D CT Reconstruction from Bi or Mono-planar 2D X-ray(s)

Yun Su Jeong, Hye Bin Yoo, Il Yong Chun

Computational tomography (CT) provides high-resolution medical imaging, but it can expose patients to high radiation. X-ray scanners have low radiation exposure, but their resolutions are low. This paper proposes a new conditional diffusion model, DX2CT, that reconstructs three-dimensional (3D) CT volumes from bi or mono-planar X-ray image(s). Proposed DX2CT consists of two key components: 1) modulating feature maps extracted from two-dimensional (2D) X-ray(s) with 3D positions of CT volume using a new transformer and 2) effectively using the modulated 3D position-aware feature maps as conditions of DX2CT. In particular, the proposed transformer can provide conditions with rich information of a target CT slice to the conditional diffusion model, enabling high-quality CT reconstruction. Our experiments with the bi or mono-planar X-ray(s) benchmark datasets show that proposed DX2CT outperforms several state-of-the-art methods. Our codes and model will be available at: https://www.github.com/intyeger/DX2CT.

9/16/2024

DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays

Xuhui Liu, Zhi Qiao, Runkun Liu, Hong Li, Juan Zhang, Xiantong Zhen, Zhen Qian, Baochang Zhang

Computed tomography (CT) is widely utilized in clinical settings because it delivers detailed 3D images of the human body. However, performing CT scans is not always feasible due to radiation exposure and limitations in certain surgical environments. As an alternative, reconstructing CT images from ultra-sparse X-rays offers a valuable solution and has gained significant interest in scientific research and medical applications. However, it presents great challenges as it is inherently an ill-posed problem, often compromised by artifacts resulting from overlapping structures in X-ray images. In this paper, we propose DiffuX2CT, which models CT reconstruction from orthogonal biplanar X-rays as a conditional diffusion process. DiffuX2CT is established with a 3D global coherence denoising model with a new, implicit conditioning mechanism. We realize the conditioning mechanism by a newly designed tri-plane decoupling generator and an implicit neural decoder. By doing so, DiffuX2CT achieves structure-controllable reconstruction, which enables 3D structural information to be recovered from 2D X-rays, therefore producing faithful textures in CT images. As an extra contribution, we collect a real-world lumbar CT dataset, called LumbarV, as a new benchmark to verify the clinical significance and performance of CT reconstruction from X-rays. Extensive experiments on this dataset and three more publicly available datasets demonstrate the effectiveness of our proposal.

7/19/2024

Diff2CT: Diffusion Learning to Reconstruct Spine CT from Biplanar X-Rays

Zhi Qiao, Xuhui Liu, Xiaopeng Wang, Runkun Liu, Xiantong Zhen, Pei Dong, Zhen Qian

Intraoperative CT imaging serves as a crucial resource for surgical guidance; however, it may not always be readily accessible or practical to implement. In scenarios where CT imaging is not an option, reconstructing CT scans from X-rays can offer a viable alternative. In this paper, we introduce an innovative method for 3D CT reconstruction utilizing biplanar X-rays. Distinct from previous research that relies on conventional image generation techniques, our approach leverages a conditional diffusion process to tackle the task of reconstruction. More precisely, we employ a diffusion-based probabilistic model trained to produce 3D CT images based on orthogonal biplanar X-rays. To improve the structural integrity of the reconstructed images, we incorporate a novel projection loss function. Experimental results validate that our proposed method surpasses existing state-of-the-art benchmarks in both visual image quality and multiple evaluative metrics. Specifically, our technique achieves a higher Structural Similarity Index (SSIM) of 0.83, a relative increase of 10%, and a lower Fr'echet Inception Distance (FID) of 83.43, which represents a relative decrease of 25%.

8/22/2024

DIFR3CT: Latent Diffusion for Probabilistic 3D CT Reconstruction from Few Planar X-Rays

Yiran Sun, Hana Baroudi, Tucker Netherton, Laurence Court, Osama Mawlawi, Ashok Veeraraghavan, Guha Balakrishnan

Computed Tomography (CT) scans are the standard-of-care for the visualization and diagnosis of many clinical ailments, and are needed for the treatment planning of external beam radiotherapy. Unfortunately, the availability of CT scanners in low- and mid-resource settings is highly variable. Planar x-ray radiography units, in comparison, are far more prevalent, but can only provide limited 2D observations of the 3D anatomy. In this work we propose DIFR3CT, a 3D latent diffusion model, that can generate a distribution of plausible CT volumes from one or few (<10) planar x-ray observations. DIFR3CT works by fusing 2D features from each x-ray into a joint 3D space, and performing diffusion conditioned on these fused features in a low-dimensional latent space. We conduct extensive experiments demonstrating that DIFR3CT is better than recent sparse CT reconstruction baselines in terms of standard pixel-level (PSNR, SSIM) on both the public LIDC and in-house post-mastectomy CT datasets. We also show that DIFR3CT supports uncertainty quantification via Monte Carlo sampling, which provides an opportunity to measure reconstruction reliability. Finally, we perform a preliminary pilot study evaluating DIFR3CT for automated breast radiotherapy contouring and planning -- and demonstrate promising feasibility. Our code is available at https://github.com/yransun/DIFR3CT.

8/28/2024